Audio word2vec: Unsupervised learning of audio segment representations using sequence-to-sequence autoencoder YA Chung, CC Wu, CH Shen, HY Lee, LS Lee arXiv preprint arXiv:1603.00982, 2016 | 121 | 2016 |
Spoken content retrieval—beyond cascading speech recognition with text retrieval L Lee, J Glass, H Lee, C Chan IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (9), 1389 …, 2015 | 85 | 2015 |
Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations J Chou, C Yeh, H Lee, L Lee arXiv preprint arXiv:1804.02812, 2018 | 74 | 2018 |
Temporal pattern attention for multivariate time series forecasting SY Shih, FK Sun, H Lee Machine Learning 108 (8), 1421-1441, 2019 | 61 | 2019 |
Learning chinese word representations from glyphs of characters TR Su, HY Lee arXiv preprint arXiv:1708.04755, 2017 | 53 | 2017 |
Neural attention models for sequence classification: Analysis and application to key term extraction and dialogue act detection S Shen, H Lee arXiv preprint arXiv:1604.00077, 2016 | 52 | 2016 |
Supervised and unsupervised transfer learning for question answering YA Chung, HY Lee, J Glass arXiv preprint arXiv:1711.05345, 2017 | 47 | 2017 |
Mockingjay: Unsupervised speech representation learning with deep bidirectional transformer encoders AT Liu, S Yang, PH Chi, P Hsu, H Lee ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 42 | 2020 |
Towards machine comprehension of spoken content: Initial toefl listening comprehension test by machine BH Tseng, SS Shen, HY Lee, LS Lee arXiv preprint arXiv:1608.06378, 2016 | 35 | 2016 |
Segmental audio word2vec: Representing utterances as sequences of vectors with applications in spoken term detection YH Wang, H Lee, L Lee 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 34 | 2018 |
Unsupervised learning of audio segment representations using sequence-to-sequence recurrent neural networks YA Chung, CC Wu, CH Shen, HY Lee, LS Lee Proc. Interspeech, 765-769, 2016 | 32 | 2016 |
Interactive spoken document retrieval with suggested key terms ranked by a Markov decision process YC Pan, HY Lee, LS Lee IEEE Transactions on Audio, Speech, and Language Processing 20 (2), 632-645, 2011 | 32 | 2011 |
Learning to encode text as human-readable summaries using generative adversarial networks YS Wang, HY Lee arXiv preprint arXiv:1810.02851, 2018 | 30 | 2018 |
Spoken SQuAD: A study of mitigating the impact of speech recognition errors on listening comprehension CH Li, SL Wu, CL Liu, H Lee arXiv preprint arXiv:1804.00320, 2018 | 29 | 2018 |
Improved spoken term detection with graph-based re-ranking in feature space YN Chen, CP Chen, HY Lee, CA Chan, LS Lee 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 29 | 2011 |
Improving conditional sequence generative adversarial networks by stepwise evaluation YL Tuan, HY Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (4), 788-798, 2019 | 27 | 2019 |
One-shot voice conversion by separating speaker and content representations with instance normalization J Chou, C Yeh, H Lee arXiv preprint arXiv:1904.05742, 2019 | 26 | 2019 |
Enhanced spoken term detection using support vector machines and weighted pseudo examples HY Lee, LS Lee IEEE transactions on audio, speech, and language processing 21 (6), 1272-1284, 2013 | 26 | 2013 |
Tree transformer: Integrating tree structures into self-attention YS Wang, HY Lee, YN Chen arXiv preprint arXiv:1909.06639, 2019 | 24 | 2019 |
Gate activation signal analysis for gated recurrent neural networks and its correlation with phoneme boundaries YH Wang, CT Chung, H Lee arXiv preprint arXiv:1703.07588, 2017 | 24 | 2017 |