Siddhant Arora
Siddhant Arora
Graduate Student, Carnegie Mellon University
Verified email at
Cited by
Cited by
A survey on graph neural networks for knowledge graph completion
S Arora
arXiv preprint arXiv:2007.12374, 2020
Espnet-slu: Advancing spoken language understanding through espnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Explain, edit, and understand: Rethinking user study design for evaluating model explanations
S Arora, D Pruthi, N Sadeh, WW Cohen, ZC Lipton, G Neubig
Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5277-5285, 2022
SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks
S Shon, S Arora, CJ Lin, A Pasad, F Wu, R Sharma, WL Wu, HY Lee, ...
arXiv preprint arXiv:2212.10525, 2022
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
BERT meets CTC: New formulation of end-to-end speech recognition with pre-trained masked language model
Y Higuchi, B Yan, S Arora, T Ogawa, T Kobayashi, S Watanabe
arXiv preprint arXiv:2210.16663, 2022
Capreolus: A toolkit for end-to-end neural ad hoc retrieval
A Yates, S Arora, X Zhang, W Yang, KM Jose, J Lin
Proceedings of the 13th International Conference on Web Search and Data …, 2020
A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding
Y Peng, S Arora, Y Higuchi, Y Ueda, S Kumar, K Ganesan, S Dalmia, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 406-413, 2023
Two-pass low latency end-to-end spoken language understanding
S Arora, S Dalmia, X Chang, B Yan, A Black, S Watanabe
arXiv preprint arXiv:2207.06670, 2022
Rethinking end-to-end evaluation of decomposable tasks: A case study on spoken language understanding
S Arora, A Ostapenko, V Viswanathan, S Dalmia, F Metze, S Watanabe, ...
arXiv preprint arXiv:2106.15065, 2021
OWSM v3. 1: Better and faster open whisper-style speech models based on e-branchformer
Y Peng, J Tian, W Chen, S Arora, B Yan, Y Sudo, M Shakeel, K Choi, ...
arXiv preprint arXiv:2401.16658, 2024
Token-level sequence labeling for spoken language understanding using compositional end-to-end models
S Arora, S Dalmia, B Yan, F Metze, AW Black, S Watanabe
arXiv preprint arXiv:2210.15734, 2022
Blockwise streaming transformer for spoken language understanding and simultaneous speech translation
K Deng, S Watanabe, J Shi, S Arora
arXiv preprint arXiv:2204.08920, 2022
On embeddings in relational databases
S Arora, S Bedathur
arXiv preprint arXiv:2005.06437, 2020
A comparative study on e-branchformer vs conformer in speech recognition, translation, and understanding tasks
Y Peng, K Kim, F Wu, B Yan, S Arora, W Chen, J Tang, S Shon, P Sridhar, ...
arXiv preprint arXiv:2305.11073, 2023
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
A tale of two regulatory regimes: Creation and analysis of a bilingual privacy policy corpus
S Arora, H Hosseini, C Utz, VK Bannihatti, T Dhellemmes, A Ravichander, ...
LREC proceedings, 2022
CMU’s IWSLT 2023 simultaneous speech translation system
B Yan, J Shi, S Maiti, W Chen, X Li, Y Peng, S Arora, S Watanabe
Proceedings of the 20th International Conference on Spoken Language …, 2023
Creation and analysis of an international corpus of privacy laws
S Gupta, E Poplavska, N O'Toole, S Arora, T Norton, N Sadeh, S Wilson
arXiv preprint arXiv:2206.14169, 2022
Bert meets relational db: Contextual representations of relational databases
S Arora, V Gupta, G Gaur, S Bedathur
arXiv preprint arXiv:2104.14914, 2021
The system can't perform the operation now. Try again later.
Articles 1–20