MLS: A large-scale multilingual dataset for speech research V Pratap, Q Xu, A Sriram, G Synnaeve, R Collobert INTERSPEECH 2020, 21st Annual Conference of the International Speech …, 2020 | 285 | 2020 |
End-to-end asr: from supervised to semi-supervised learning with modern architectures G Synnaeve, Q Xu, J Kahn, T Likhomanenko, E Grave, V Pratap, A Sriram, ... International Conference on Machine Learning - Workshop on Self-supervised …, 2019 | 234 | 2019 |
Wav2letter++: A fast open-source speech recognition system V Pratap, A Hannun, Q Xu, J Cai, J Kahn, G Synnaeve, V Liptchinsky, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 200 | 2019 |
Robust wav2vec 2.0: Analyzing domain shift in self-supervised pre-training WN Hsu, A Sriram, A Baevski, T Likhomanenko, Q Xu, V Pratap, J Kahn, ... INTERSPEECH 2021, 22nd Annual Conference of the International Speech …, 2021 | 183 | 2021 |
Massively Multilingual ASR: 50 languages, 1 model, 1 billion parameters V Pratap, A Sriram, P Tomasello, A Hannun, V Liptchinsky, G Synnaeve, ... INTERSPEECH 2020, 21st Annual Conference of the International Speech …, 2020 | 128 | 2020 |
Rethinking evaluation in ASR: Are our models robust enough? T Likhomanenko, Q Xu, V Pratap, P Tomasello, J Kahn, G Avidov, ... INTERSPEECH 2021, 22nd Annual Conference of the International Speech …, 2020 | 76 | 2020 |
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... arXiv preprint arXiv:2305.13516, 2023 | 45 | 2023 |
Scaling up online speech recognition using convnets V Pratap, Q Xu, J Kahn, G Avidov, T Likhomanenko, A Hannun, ... INTERSPEECH 2020, 21st Annual Conference of the International Speech …, 2020 | 38 | 2020 |
Differentiable weighted finite-state transducers A Hannun, V Pratap, J Kahn, WN Hsu arXiv preprint arXiv:2010.01003, 2020 | 26 | 2020 |
Flashlight: Enabling innovation in tools for machine learning JD Kahn, V Pratap, T Likhomanenko, Q Xu, A Hannun, J Cai, P Tomasello, ... International Conference on Machine Learning, 10557-10574, 2022 | 18 | 2022 |
Performance evaluation of offline speech recognition on edge devices S Gondi, V Pratap Electronics 10 (21), 2697, 2021 | 13 | 2021 |
Star temporal classification: Sequence classification with partially labeled data V Pratap, A Hannun, G Synnaeve, R Collobert arXiv preprint arXiv:2201.12208, 2022 | 9* | 2022 |
Performance and efficiency evaluation of ASR inference on the edge S Gondi, V Pratap Sustainability 13 (22), 12392, 2021 | 8 | 2021 |
Word order does not matter for speech recognition V Pratap, Q Xu, T Likhomanenko, G Synnaeve, R Collobert ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 5 | 2022 |
Parallel Composition of Weighted Finite-State Transducers S Sengupta, V Pratap, A Hannun ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 1 | 2022 |
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch J Hwang, M Hira, C Chen, X Zhang, Z Ni, G Sun, P Ma, R Huang, V Pratap, ... arXiv preprint arXiv:2310.17864, 2023 | | 2023 |
Customized keyword query suggestions on online social networks KS Hazra, VP Konduru US Patent 10,534,815, 2020 | | 2020 |