Thoughts on vad #160
YLQY
started this conversation in
Show and tell
Replies: 1 comment
-
Hi,
Can you please elaborate on this? What is the difference between cropping and disconnecting? Do you have poor performance on edges? Maybe provide a sample audio + hyper-params + the sample chart that is drawn using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In our experiments, we found that when using vad for speech recognition, it is more inclined to disconnect the audio without cropping the audio. This reduces miscuts of vad. In our Chinese online game scene and TV drama scene, the effect of the former will be 3-4 points better than the latter :)
Beta Was this translation helpful? Give feedback.
All reactions