ASR for long audio files (streaming) with Beam Search/LM #4597
Replies: 3 comments
-
Not really. We do plan on doing a high level API for beam search, but it will not be with a LM. There is no plan for such integration in the next few months either since for streaming efficiently and in production environment we have Nvidia Riva for that. |
Beta Was this translation helpful? Give feedback.
-
Thank you for the answer. Any recommendations for SAD models? |
Beta Was this translation helpful? Give feedback.
-
If you have ground truth labels, then you can use CTC segmentation in Nemo to auto split audio files. Any production grade VAD model can also do it, though I don't know of any open source production grade VAD models, most are research grade. Maybe @fayejf knows |
Beta Was this translation helpful? Give feedback.
-
Hi
Following older answers for closed issues: #2307 and the tutorial: https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/Streaming_ASR.ipynb.
Any updates on Nemo supporting LM with long audio (streaming)?
Beta Was this translation helpful? Give feedback.
All reactions