LipReading with 3D-2D-CNN BLSTM-HMM and word-CTC models DK Margam, R Aralikatti, T Sharma, A Thanda, S Roy, SM Venkatesan arXiv preprint arXiv:1906.12170, 2019 | 16 | 2019 |
Speaker Adaptation for Lip-Reading Using Visual Identity Vectors. PA Kandala, A Thanda, DK Margam, RC Aralikatti, T Sharma, S Roy, ... INTERSPEECH, 2758-2762, 2019 | 7 | 2019 |
Real Time Online Visual End Point Detection Using Unidirectional LSTM. T Sharma, RC Aralikatti, DK Margam, A Thanda, S Roy, PA Kandala, ... INTERSPEECH, 2000-2004, 2019 | 4 | 2019 |
Audio-Visual Decision Fusion for WFST-based and seq2seq Models R Aralikatti, S Roy, A Thanda, DK Margam, PA Kandala, T Sharma, ... arXiv preprint arXiv:2001.10832, 2020 | 1 | 2020 |