❓ Help How to classify an 20ms audio ? #92

649459021 · 2021-08-20T07:47:36Z

649459021
Aug 20, 2021

I just want to classify an 20ms audio whether there are people talking.
Could you give me some examples?
Thanks

Aug 20, 2021

Hi,

20ms is a too small of an audio chunk size for our VAD - https://github.com/snakers4/silero-vad#how-vad-works
For very small chunks - please use WebRTC VAD

View full answer

snakers4 · 2021-08-20T07:49:53Z

snakers4
Aug 20, 2021
Maintainer

Hi,

20ms is a too small of an audio chunk size for our VAD - https://github.com/snakers4/silero-vad#how-vad-works
For very small chunks - please use WebRTC VAD

2 replies

649459021 Aug 20, 2021
Author

Or maybe longer audio.
I just want to know how to classify an audio.

snakers4 Aug 20, 2021
Maintainer

Then just split audio into some manageable chunks (i.e. several minutes), batch them, run though the network and plot the results

You will have to fiddle a bit with thresholds to produce a simple reliable 1/0 classifier, but I guess some naive approach like a percentage of audio over certain medium threshold will work

You can push this into examples afterwards

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

❓ Help How to classify an 20ms audio ? #92

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

❓ Help How to classify an 20ms audio ? #92

649459021 Aug 20, 2021

Replies: 1 comment · 2 replies

snakers4 Aug 20, 2021 Maintainer

649459021 Aug 20, 2021 Author

snakers4 Aug 20, 2021 Maintainer

649459021
Aug 20, 2021

Replies: 1 comment 2 replies

snakers4
Aug 20, 2021
Maintainer

649459021 Aug 20, 2021
Author

snakers4 Aug 20, 2021
Maintainer