Follow
Jay Mahadeokar
Jay Mahadeokar
Facebook AI
Verified email at fb.com
Title
Cited by
Cited by
Year
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2432020
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1642022
Transformer-transducer: End-to-end speech recognition with self-attention
CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ...
arXiv preprint arXiv:1910.12977, 2019
1642019
Voicebox: Text-guided multilingual universal speech generation at scale
M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ...
Advances in neural information processing systems 36, 2024
992024
Contextual RNN-T for open domain ASR
M Jain, G Keren, J Mahadeokar, G Zweig, F Metze, Y Saraf
arXiv preprint arXiv:2006.03411, 2020
912020
Deep shallow fusion for RNN-T personalization
D Le, G Keren, J Chan, J Mahadeokar, C Fuegen, ML Seltzer
2021 IEEE Spoken Language Technology Workshop (SLT), 251-257, 2021
712021
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion
D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ...
arXiv preprint arXiv:2104.02194, 2021
692021
Alignment restricted streaming recurrent neural network transducer
J Mahadeokar, Y Shangguan, D Le, G Keren, H Su, T Le, CF Yeh, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2021
662021
RNN-T for latency controlled ASR with improved beam search
M Jain, K Schubert, J Mahadeokar, CF Yeh, K Kalgaonkar, A Sriram, ...
arXiv preprint arXiv:1911.01629, 2019
432019
Prompting large language models with speech recognition abilities
Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
422024
Improved neural language model fusion for streaming recurrent neural network transducer
S Kim, Y Shangguan, J Mahadeokar, A Bruguier, C Fuegen, ML Seltzer, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
242021
Dissecting user-perceived latency of on-device E2E speech recognition
Y Shangguan, R Prabhavalkar, H Su, J Mahadeokar, Y Shi, J Zhou, C Wu, ...
arXiv preprint arXiv:2104.02207, 2021
242021
Dynamic encoder transducer: A flexible solution for trading off accuracy for latency
Y Shi, V Nagaraja, C Wu, J Mahadeokar, D Le, R Prabhavalkar, A Xiao, ...
arXiv preprint arXiv:2104.02176, 2021
142021
Federated domain adaptation for asr with full self-supervision
J Jia, J Mahadeokar, W Zheng, Y Shangguan, O Kalinli, F Seide
arXiv preprint arXiv:2203.15966, 2022
122022
Streaming parallel transducer beam search with fast-slow cascaded encoders
J Mahadeokar, Y Shi, K Li, D Le, J Zhu, V Chandra, O Kalinli, ML Seltzer
arXiv preprint arXiv:2203.15773, 2022
122022
Streaming transformer transducer based speech recognition using non-causal convolution
Y Shi, C Wu, D Wang, A Xiao, J Mahadeokar, X Zhang, C Liu, K Li, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
112022
Memory-efficient speech recognition on smart devices
G Venkatesh, A Valliappan, J Mahadeokar, Y Shangguan, C Fuegen, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
Spatial attention for far-field speech recognition with deep beamforming neural networks
W He, L Lu, B Zhang, J Mahadeokar, K Kalgaonkar, C Fuegen
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
112020
Procter: Pronunciation-aware contextual adapter for personalized speech recognition in neural transducers
R Pandey, R Ren, Q Luo, J Liu, A Rastrow, A Gandhe, D Filimonov, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
Multi-head state space model for speech recognition
Y Fathullah, C Wu, Y Shangguan, J Jia, W Xiong, J Mahadeokar, C Liu, ...
arXiv preprint arXiv:2305.12498, 2023
82023
The system can't perform the operation now. Try again later.
Articles 1–20