Desh Raj

Cited by

	All	Since 2019
Citations	993	976
h-index	15	15
i10-index	17	17

280

140

210

201720182019202020212022202320243 12 19 79 178 242 261 196

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Leibny Paola GarciaJohns Hopkins UniversityVerified email at jhu.edu
Zili HuangJohns Hopkins UniversityVerified email at jhu.edu
Jan "Yenda" TrmalAssociate Research Scientist at Johns Hopkins UniversityVerified email at jhu.edu
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verified email at columbia.edu
David SnyderApple Inc.Verified email at apple.com
Takuya YoshiokaAssemblyAIVerified email at assemblyai.com
Naoyuki KandaMicrosoftVerified email at microsoft.com
Yusuke FujitaLY Corp.Verified email at linecorp.com
Shota HoriguchiNTT CorporationVerified email at ntt.com
Xuankai ChangApple - Carnegie Mellon UniversityVerified email at apple.com
Vimal ManoharMeta Platforms Inc.Verified email at meta.com
Aswin Shanmugam SubramanianMicrosoftVerified email at microsoft.com
Christoph BoeddekerPaderborn UniversityVerified email at mail.upb.de
Zhaoheng NiMeta Reality LabsVerified email at meta.com
Neville RyantUniversity of PennsylvaniaVerified email at ldc.upenn.edu
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Verified email at google.com
Hakan ErdoganGoogleVerified email at google.com

Desh Raj

Meta AI

Verified email at meta.com - Homepage

Speech Recognition Deep Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020	307	2020
Probing the information encoded in x-vectors D Raj, D Snyder, D Povey, S Khudanpur 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	114	2019
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ... 2021 IEEE spoken language technology workshop (SLT), 897-904, 2021	86	2021
Dover-lap: A method for combining overlap-aware diarization outputs D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021	72	2021
Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text D Raj, S Sahu, A Anand Proceedings of the 21st conference on computational natural language …, 2017	48	2017
Sequential multi-frame neural beamforming for speech separation and enhancement ZQ Wang, H Erdogan, S Wisdom, K Wilson, D Raj, S Watanabe, Z Chen, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 905-911, 2021	47	2021
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ... arXiv preprint arXiv:2102.01363, 2021	39	2021
Multi-class spectral clustering with overlaps for speaker diarization D Raj, Z Huang, S Khudanpur 2021 IEEE Spoken Language Technology Workshop (SLT), 582-589, 2021	34	2021
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ... arXiv preprint arXiv:2306.13734, 2023	31	2023
Target-speaker voice activity detection with improved i-vector estimation for unknown number of speaker M He, D Raj, Z Huang, J Du, Z Chen, S Watanabe arXiv preprint arXiv:2108.03342, 2021	31	2021
GPU-accelerated guided source separation for meeting transcription D Raj, D Povey, S Khudanpur arXiv preprint arXiv:2212.05271, 2022	30	2022
Using ASR methods for OCR A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ... 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019	23	2019
Uncertain fuzzy self-organization based clustering: interval type-2 approach to adaptive resonance theory S Majheed, A Gupta, D Raj, FCH Rhee Information Sciences, 2017	21*	2017
Continuous streaming multi-talker asr with dual-path transducers D Raj, L Lu, Z Chen, Y Gaur, J Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	18	2022
The JHU multi-microphone multi-speaker ASR system for the CHiME-6 challenge A Arora, D Raj, AS Subramanian, K Li, B Ben-Yair, M Maciejewski, ... arXiv preprint arXiv:2006.07898, 2020	16	2020
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings Z Huang, D Raj, P García, S Khudanpur ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	15	2023
Analysis of Data Generated from Multidimensional Type-1 and Type-2 Fuzzy Membership Functions D Raj, A Gupta, B Garg, K Tanna, FCH Rhee IEEE Transactions on Fuzzy Systems, 0	12*
Low-latency speech separation guided diarization for telephone conversations G Morrone, S Cornell, D Raj, L Serafini, E Zovato, A Brutti, S Squartini 2022 IEEE Spoken Language Technology Workshop (SLT), 641-646, 2023	8	2023
Injecting text and cross-lingual supervision in few-shot learning from self-supervised models M Wiesner, D Raj, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	8	2022
Joint speaker diarization and speech recognition based on region proposal networks Z Huang, M Delcroix, LP Garcia, S Watanabe, D Raj, S Khudanpur Computer Speech & Language 72, 101316, 2022	6	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors