Emotional prosody control for speech generation S Sivaprasad, S Kosgi, V Gandhi arXiv preprint arXiv:2111.04730, 2021 | 12 | 2021 |
Reappraising domain generalization in neural networks S Sivaprasad, A Goindani, V Garg, R Basu, S Kosgi, V Gandhi arXiv preprint arXiv:2110.07981, 2021 | 4 | 2021 |
ParrotTTS: Text-to-speech synthesis exploiting disentangled self-supervised representations N Shah, S Kosgi, V Tambrahalli, S Neha, A Nelakanti, V Gandhi Findings of the Association for Computational Linguistics: EACL 2024, 79-91, 2024 | | 2024 |
Adversarial Robustness of Mel Based Speaker Recognition Systems R Srivastava, S Kosgi, S Sivaprasad, N Sahipjohn, V Gandhi 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | | 2023 |
MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting N Shah, V Tambrahalli, S Kosgi, N Pedanekar, V Gandhi arXiv preprint arXiv:2305.11926, 2023 | | 2023 |
Towards building controllable Text to Speech systems K Saiteja International Institute of Information Technology Hyderabad, 2023 | | 2023 |
ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations N Shah, S Kosgi, V Tambrahalli, N Sahipjohn, N Pedanekar, V Gandhi arXiv preprint arXiv:2303.01261, 2023 | | 2023 |
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems S Kosgi, S Sivaprasad, N Pedanekar, A Nelakanti, V Gandhi Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | | 2022 |