Clap learning audio concepts from natural language supervision B Elizalde, S Deshmukh, M Al Ismail, H Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 196 | 2023 |
Detection of COVID-19 through the analysis of vocal fold oscillations M Al Ismail, S Deshmukh, R Singh ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 50 | 2021 |
Pengi: An audio language model for audio tasks S Deshmukh, B Elizalde, R Singh, H Wang Advances in Neural Information Processing Systems 36, 18090-18108, 2023 | 39 | 2023 |
Interpreting glottal flow dynamics for detecting covid-19 from voice S Deshmukh, M Al Ismail, R Singh ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 35 | 2021 |
Audio Retrieval with WavText5K and CLAP Training S Deshmukh, B Elizalde, H Wang Proc. Interspeech 2023, 2948--2952, 2022 | 33 | 2022 |
Improving weakly supervised sound event detection with self-supervised auxiliary tasks S Deshmukh, B Raj, R Singh Proc. Interspeech 2021, 596--600, 2021 | 22* | 2021 |
Attacker behaviour profiling using stochastic ensemble of hidden markov models S Deshmukh, R Rade, DF Kazi arXiv preprint arXiv:1905.11824, 2019 | 14 | 2019 |
NaRLE: Natural language models using reinforcement learning with emotion feedback R Zhou, S Deshmukh, J Greer, C Lee arXiv preprint arXiv:2110.02148, 2021 | 12 | 2021 |
Natural language supervision for general-purpose audio representations B Elizalde, S Deshmukh, H Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 11 | 2024 |
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session LM Heller, B Elizalde, B Raj, S Deshmukh arXiv preprint arXiv:2302.09719, 2023 | 10 | 2023 |
Tackling toxic online communication with recurrent capsule networks S Deshmukh, R Rade 2018 Conference on Information and Communication Technology (CICT), 1-7, 2018 | 10 | 2018 |
Prompting audios using acoustic properties for emotion representation H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9* | 2024 |
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ... arXiv preprint arXiv:2310.04445, 2023 | 7 | 2023 |
Multi-view learning for speech emotion recognition with categorical emotion, categorical sentiment, and dimensional scores D Tompkins, D Emmanouilidou, S Deshmukh, B Elizalde ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Temporal and stochastic modelling of attacker behaviour R Rade, S Deshmukh, R Nene, AS Wadekar, A Unny Advances in Data Science: Third International Conference on Intelligent …, 2019 | 5 | 2019 |
Training audio captioning models without audio S Deshmukh, B Elizalde, D Emmanouilidou, B Raj, R Singh, H Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
PAM: Prompting Audio-Language Models for Audio Quality Assessment S Deshmukh, D Alharthi, B Elizalde, H Gamper, MA Ismail, R Singh, B Raj, ... arXiv preprint arXiv:2402.00282, 2024 | 2 | 2024 |
Zero-Shot Transfer for Wildlife Bioacoustics Detection Z Miao, B Elizalde, S Deshmukh, J Kitzes, H Wang, R Dodhia, JML Ferres | 1 | 2023 |
Training framework for automated tasks involving multiple machine learning models CY Lee, R Zhou, N Nishikant, SS Deshmukh, JD Greer US Patent App. 17/516,940, 2023 | 1 | 2023 |
Adapting task-oriented dialogue models for email conversations S Deshmukh, C Lee arXiv preprint arXiv:2208.09439, 2022 | 1 | 2022 |