no code implementations • 29 May 2023 • Abhinav Goyal, Nikesh Garera
Our model achieves a word error rate (WER) of 3. 69% without EOS and 4. 78% with EOS while also reducing the search latency by approximately ~1300 ms (equivalent to 46. 64% reduction) when compared to an independent voice activity detection (VAD) model.
no code implementations • 26 Oct 2022 • Abhinav Goyal, Anupam Singh, Nikesh Garera
Automation of on-call customer support relies heavily on accurate and efficient speech-to-intent (S2I) systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3