Frank Qiaochu Zhang
Title
Cited by
Cited by
Year
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
682020
Streaming transformer-based acoustic models using self-attention with augmented memory
C Wu, Y Wang, Y Shi, CF Yeh, F Zhang
arXiv preprint arXiv:2005.08042, 2020
142020
Deja-vu: Double feature presentation and iterated loss in deep transformer networks
A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
112020
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig
Interspeech 2020, 2020
92020
Weak-Attention Suppression For Transformer Based Speech Recognition
Y Shi, Y Wang, C Wu, C Fuegen, F Zhang, D Le, CF Yeh, ML Seltzer
arXiv preprint arXiv:2005.09137, 2020
82020
Improving RNN transducer based ASR with auxiliary tasks
C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig
2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021
72021
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer
arXiv preprint arXiv:2010.10759, 2020
62020
Multilingual graphemic hybrid ASR with massive data augmentation
C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig
arXiv preprint arXiv:1909.06522, 2019
62019
Contextualizing asr lattice rescoring with hybrid pointer network language model
DR Liu, C Liu, F Zhang, G Synnaeve, Y Saraf, G Zweig
arXiv preprint arXiv:2005.07394, 2020
42020
Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition
CF Yeh, Y Wang, Y Shi, C Wu, F Zhang, J Chan, ML Seltzer
2021 IEEE Spoken Language Technology Workshop (SLT), 8-14, 2021
22021
Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR
X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 46-51, 2021
22021
Training asr models by generation of contextual information
K Singh, D Okhonko, J Liu, Y Wang, F Zhang, R Girshick, S Edunov, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
22020
Deja-vu: Double feature presentation in deep transformer networks
A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ...
Submitted to ICASSP, 2020
22020
Multilingual asr with massive data augmentation
C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig
arXiv preprint arXiv:1909.06522, 2019
22019
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Y Wang, Y Shi, F Zhang, C Wu, J Chan, CF Yeh, A Xiao
arXiv preprint arXiv:2010.14665, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–15