Multimodal and multiresolution depression detection from speech and facial landmark features M Nasir, A Jati, PG Shivakumar, S Nallan Chakravarthula, P Georgiou Proceedings of the 6th international workshop on audio/visual emotion …, 2016 | 91 | 2016 |
Transfer learning from adult to children for speech recognition: Evaluation, analysis and recommendations PG Shivakumar, P Georgiou Computer speech & language 63, 101077, 2020 | 85 | 2020 |
Improving speech recognition for children using acoustic adaptation and pronunciation modeling. PG Shivakumar, A Potamianos, S Lee, SS Narayanan WOCCI, 15-19, 2014 | 71 | 2014 |
Perception optimized deep denoising autoencoders for speech enhancement. PG Shivakumar, PG Georgiou Interspeech, 3743-3747, 2016 | 48 | 2016 |
Learning from past mistakes: improving automatic speech recognition output via noisy-clean phrase context modeling PG Shivakumar, H Li, K Knight, P Georgiou APSIPA Transactions on Signal and Information Processing 8, 2019 | 26 | 2019 |
Spoken Language Intent Detection Using Confusion2Vec PG Shivakumar, M Yang, P Georgiou Proc. Interspeech 2019, 819--823, 2019 | 18 | 2019 |
Simplified and supervised i-vector modeling for speaker age regression PG Shivakumar, M Li, V Dhandhania, SS Narayanan 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 16 | 2014 |
Confusion2vec: Towards enriching vector space word representations with representational ambiguities PG Shivakumar, P Georgiou PeerJ Computer Science 5, e195, 2019 | 12 | 2019 |
Multimodal Fusion of Multirate Acoustic, Prosodic, and Lexical Speaker Characteristics for Native Language Identification. PG Shivakumar, SN Chakravarthula, PG Georgiou INTERSPEECH, 2408-2412, 2016 | 10 | 2016 |
End-to-end neural systems for automatic children speech recognition: An empirical study PG Shivakumar, S Narayanan Computer Speech & Language 72, 101289, 2022 | 9 | 2022 |
Incremental online spoken language understanding PG Shivakumar, N Kumar, P Georgiou, S Narayanan arXiv preprint arXiv:1910.10287, 2019 | 4 | 2019 |
Behavior gated language models PG Shivakumar, SY Tseng, P Georgiou, S Narayanan arXiv preprint arXiv:1909.00107, 2019 | 3 | 2019 |
RNN Based Incremental Online Spoken Language Understanding PG Shivakumar, N Kumar, P Georgiou, S Narayanan 2021 IEEE Spoken Language Technology Workshop (SLT), 989-996, 2021 | 2 | 2021 |
Confusion2Vec 2.0: Enriching ambiguous spoken language representations with subwords P Gurunath Shivakumar, P Georgiou, S Narayanan PloS one 17 (3), e0264488, 2022 | 1 | 2022 |
Systems and methods for incremental natural language understanding KN Kumar, JR Kennedy, SD Lombardo, PG Shivakumar US Patent 11,195,533, 2021 | 1 | 2021 |
Phone Duration Modeling for Speaker Age Estimation in Children PG Shivakumar, S Bishop, C Lord, S Narayanan arXiv preprint arXiv:2109.01568, 2021 | | 2021 |
Phone Duration Modeling for Speaker Age Estimation in Children P Gurunath Shivakumar, S Bishop, C Lord, S Narayanan arXiv e-prints, arXiv: 2109.01568, 2021 | | 2021 |
RNN based Incremental Online Spoken Language Understanding P Gurunath Shivakumar, N Kumar, P Georgiou, S Narayanan arXiv e-prints, arXiv: 1910.10287, 2019 | | 2019 |
Behavior Gated Language Models P Gurunath Shivakumar, SY Tseng, P Georgiou, S Narayanan arXiv e-prints, arXiv: 1909.00107, 2019 | | 2019 |
Spoken Language Intent Detection using Confusion2Vec P Gurunath Shivakumar, M Yang, P Georgiou arXiv e-prints, arXiv: 1904.03576, 2019 | | 2019 |