Rohan Badlani

Cited by

	All	Since 2019
Citations	379	323
h-index	9	9
i10-index	9	9

2016201720182019202020212022202320241 32 23 35 39 36 66 84 63

Rohan Badlani

Computer Science, Stanford University, BITS Pilani

Verified email at cs.stanford.edu

Machine Learning Deep Learning Parallel Computing Data Mining Conversational AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
One TTS alignment to rule them all R Badlani, A Łańcucki, KJ Shih, R Valle, W Ping, B Catanzaro ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	74	2022
Experiments on the DCASE challenge 2016: Acoustic scene classification and sound event detection in real life recording B Elizalde, A Kumar, A Shah, R Badlani, E Vincent, B Raj, I Lane arXiv preprint arXiv:1607.06706, 2016	58*	2016
Content-based representations of audio using siamese neural networks P Manocha, R Badlani, A Kumar, A Shah, B Elizalde, B Raj 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	52	2018
RAD-TTS: Parallel flow-based TTS with robust alignment learning and diverse synthesis KJ Shih, R Valle, R Badlani, A Lancucki, W Ping, B Catanzaro ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit …, 2021	50	2021
NELS-Never-Ending Learner of Sounds BR Benjamin Elizalde, Rohan Badlani, Ankit Shah, Anurag Kumar NIPS Workshop on Machine Learning for Audio, 2018	34*	2018
An approach for self-training audio event detectors using web data B Elizalde, A Shah, S Dalmia, MH Lee, R Badlani, A Kumar, B Raj, I Lane 2017 25th European Signal Processing Conference (EUSIPCO), 1863-1867, 2017	28*	2017
Disambiguating sentiment: An ensemble of humour, sarcasm, and hate speech features for sentiment classification R Badlani, N Asnani, M Rai W-NUT 2019, 337-345, 2019	22*	2019
Audio flamingo: A novel audio language model with few-shot learning and dialogue abilities Z Kong, A Goel, R Badlani, W Ping, R Valle, B Catanzaro arXiv preprint arXiv:2402.01831, 2024	14	2024
Generating and using joint representations of source code R Badlani, O Lewis, G Evangelopoulos, O Hatalsky, B Ni US Patent 11,169,786, 2021	10	2021
RAD-MMM: Multilingual multiaccented multispeaker text to speech R Badlani, R Valle, KJ Shih, JF Santos, S Gururani, B Catanzaro Proc. Interspeech 2023, 626-630, 2023	9*	2023
P-flow: a fast and data-efficient zero-shot TTS through speech prompting S Kim, K Shih, JF Santos, E Bakhturina, M Desta, R Valle, S Yoon, ... Advances in Neural Information Processing Systems 36, 2024	5	2024
Framework for evaluation of sound event detection in web videos R Badlani, A Shah, B Elizalde, A Kumar, B Raj 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	5	2018
Relation extraction with contextualized relation embedding (CRE) X Chen, R Badlani arXiv preprint arXiv:2011.09658, 2020	4	2020
Pattern-based automatic parallelization of representative-based clustering algorithms S Islam, S Balasubramaniam, S Gupta, S Brajesh, R Badlani, ... 2018 IEEE 5th International Conference on Data Science and Advanced …, 2018	4	2018
Automatic parallelization of representative-based clustering algorithms for multicore cluster systems S Islam, S Balasubramaniam, S Gupta, S Brajesh, R Badlani, ... International Journal of Data Science and Analytics 10, 135-159, 2020	3	2020
VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation R Badlani, A Arora, S Ghosh, R Valle, KJ Shih, JF Santos, B Ginsburg, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	2	2023
Generative modeling for low dimensional speech attributes with neural spline flows KJ Shih, R Valle, R Badlani, JF Santos, B Catanzaro arXiv preprint arXiv:2203.01786, 2022	2	2022
DCASE challenge task 1 A Kumar, B Elizalde, A Shah, R Badlani, E Vincent, B Raj, I Lane Tech. Rep., DCASE2016 Challenge, 2016	2	2016
Synthesizing video from audio using one or more neural networks MY Liu, K Nagano, S Yeongho, JRVG da Costa, SEO Jaewoo, TC Wang, ... US Patent App. 17/382,027, 2023	1	2023
Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment P Neekhara, S Hussain, S Ghosh, J Li, R Valle, R Badlani, B Ginsburg arXiv preprint arXiv:2406.17957, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by