Purely sequence-trained neural networks for ASR based on lattice-free MMI. D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ... Interspeech, 2751-2755, 2016 | 1023 | 2016 |
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020 | 331 | 2020 |
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. G Sell, D Snyder, A McCree, D Garcia-Romero, J Villalba, M Maciejewski, ... Interspeech, 2808-2812, 2018 | 252 | 2018 |
Voicebox: Text-guided multilingual universal speech generation at scale M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ... Advances in neural information processing systems 36, 2024 | 179 | 2024 |
An Exploration of Dropout with LSTMs. G Cheng, V Peddinti, D Povey, V Manohar, S Khudanpur, Y Yan Interspeech, 1586-1590, 2017 | 147 | 2017 |
JHU ASpIRE system: Robust LVCSR with TDNNs, iVector adaptation and RNN-LMs V Peddinti, G Chen, V Manohar, T Ko, D Povey, S Khudanpur Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on, 2015 | 134 | 2015 |
Acoustic Modelling from the Signal Domain Using CNNs. P Ghahremani, V Manohar, D Povey, S Khudanpur Interspeech, 3434-3438, 2016 | 121 | 2016 |
Investigation of transfer learning for ASR using LF-MMI trained neural networks P Ghahremani, V Manohar, H Hadian, D Povey, S Khudanpur 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 91 | 2017 |
Semi-supervised training of acoustic models using lattice-free MMI V Manohar, H Hadian, D Povey, S Khudanpur 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 86 | 2018 |
A teacher-student learning approach for unsupervised domain adaptation of sequence-trained asr models V Manohar, P Ghahremani, D Povey, S Khudanpur 2018 IEEE Spoken Language Technology Workshop (SLT), 250-257, 2018 | 69 | 2018 |
JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning V Manohar, D Povey, S Khudanpur 2017 IEEE automatic speech recognition and understanding workshop (ASRU …, 2017 | 68 | 2017 |
Semi-supervised maximum mutual information training of deep neural network acoustic models. V Manohar, D Povey, S Khudanpur Interspeech, 2630-2634, 2015 | 60 | 2015 |
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ... Proc. CHiME-5, 6-10, 2018 | 54 | 2018 |
A keyword search system using open source software J Trmal, G Chen, D Povey, S Khudanpur, P Ghahremani, X Zhang, ... 2014 IEEE Spoken Language Technology Workshop (SLT), 530-535, 2014 | 52 | 2014 |
Far-Field ASR Without Parallel Data. V Peddinti, V Manohar, Y Wang, D Povey, S Khudanpur INTERSPEECH 9, 1996-2000, 2016 | 51 | 2016 |
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ... Interspeech, 3597-3601, 2017 | 47 | 2017 |
ASR for under-resourced languages from probabilistic transcription MA Hasegawa-Johnson, P Jyothi, D McCloy, M Mirbagheri, GM Di Liberto, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (1), 50-63, 2016 | 41 | 2016 |
ADAPTING ASR FOR UNDER-RESOURCED LANGUAGES USING MISMATCHED TRANSCRIPTIONS C Liu, P Jyothi, H Tang, V Manohar, M Hasegawa-Johnson, S Khudanpur Acoustics, Speech and Signal Processing (ICASSP 2016), IEEE International …, 2016 | 31 | 2016 |
Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system V Manohar, SJ Chen, Z Wang, Y Fujita, S Watanabe, S Khudanpur ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 29 | 2019 |
Characterizing performance of speaker diarization systems on far-field speech using standard methods M Maciejewski, D Snyder, V Manohar, N Dehak, S Khudanpur 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 27 | 2018 |