George Saon
George Saon
Verified email at us.ibm.com
Title
Cited by
Cited by
Year
Speaker adaptation of neural network acoustic models using i-vectors
G Saon, H Soltau, D Nahamoo, M Picheny
2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 55-59, 2013
5742013
Boosted MMI for model and feature-space discriminative training
D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, ...
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
4462008
Deep convolutional neural networks for large-scale speech tasks
TN Sainath, B Kingsbury, G Saon, H Soltau, A Mohamed, G Dahl, ...
Neural networks 64, 39-48, 2015
3772015
fMPE: Discriminatively trained features for speech recognition
D Povey, B Kingsbury, L Mangu, G Saon, H Soltau, G Zweig
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005
3592005
Maximum likelihood discriminant feature spaces
G Saon, M Padmanabhan, R Gopinath, S Chen
2000 IEEE International Conference on Acoustics, Speech, and Signal …, 2000
2772000
English conversational telephone speech recognition by humans and machines
G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ...
arXiv preprint arXiv:1703.02136, 2017
2762017
Improvements to deep convolutional neural networks for LVCSR
TN Sainath, B Kingsbury, A Mohamed, GE Dahl, G Saon, H Soltau, ...
2013 IEEE workshop on automatic speech recognition and understanding, 315-320, 2013
1992013
The IBM 2015 English conversational telephone speech recognition system
G Saon, HKJ Kuo, S Rennie, M Picheny
arXiv preprint arXiv:1505.05899, 2015
1952015
The IBM Attila speech recognition toolkit
H Soltau, G Saon, B Kingsbury
2010 IEEE Spoken Language Technology Workshop, 97-102, 2010
1602010
Advances in speech transcription at IBM under the DARPA EARS program
SF Chen, B Kingsbury, L Mangu, D Povey, G Saon, H Soltau, G Zweig
IEEE Transactions on Audio, Speech, and Language Processing 14 (5), 1596-1608, 2006
1532006
The IBM 2004 conversational telephony system for rich transcription
H Soltau, B Kingsbury, L Mangu, D Povey, G Saon, G Zweig
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005
1412005
Large-vocabulary continuous speech recognition systems: A look at some recent advances
G Saon, JT Chien
IEEE Signal Processing Magazine 29 (6), 18-33, 2012
1322012
Direct acoustics-to-word models for english conversational speech recognition
K Audhkhasi, B Ramabhadran, G Saon, M Picheny, D Nahamoo
arXiv preprint arXiv:1703.07754, 2017
1172017
Anatomy of an extremely fast LVCSR decoder
G Saon, D Povey, G Zweig
Ninth European Conference on Speech Communication and Technology, 2005
942005
Joint training of convolutional and non-convolutional neural networks
H Soltau, G Saon, TN Sainath
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
902014
Building competitive direct acoustics-to-word models for english conversational speech recognition
K Audhkhasi, B Kingsbury, B Ramabhadran, G Saon, M Picheny
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
872018
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions
S Thomas, S Ganapathy, G Saon, H Soltau
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
862014
Data-driven approach to designing compound words for continuous speech recognition
G Saon, M Padmanabhan
IEEE transactions on Speech and audio processing 9 (4), 327-332, 2001
852001
Feature and model space speaker adaptation with full covariance Gaussians
D Povey, G Saon
Ninth International Conference on Spoken Language Processing, 2006
832006
Methods and apparatus for performing heteroscedastic discriminant analysis in pattern recognition systems
RA Gopinath, M Padmanabhan, GA Saon
US Patent 6,609,093, 2003
682003
The system can't perform the operation now. Try again later.
Articles 1–20