Follow
Rohit Prabhavalkar
Rohit Prabhavalkar
Staff Research Scientist, Google
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
State-of-the-art speech recognition with sequence-to-sequence models
CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ...
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018
14582018
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
7372019
Exploring architectures, data and units for streaming end-to-end speech recognition with rnn-transducer
K Rao, H Sak, R Prabhavalkar
IEEE Automatic Speech Recognition and Understanding (ASRU), 2017
4102017
A Comparison of Sequence-to-Sequence Models for Speech Recognition
R Prabhavalkar, K Rao, TN Sainath, B Li, L Johnson, N Jaitly
Interspeech, 939-943, 2017
3922017
An analysis of incorporating an external language model into a sequence-to-sequence model
A Kannan, Y Wu, P Nguyen, TN Sainath, Z Chen, R Prabhavalkar
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018
2892018
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
C Donahue, B Li, R Prabhavalkar
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018
2792018
Google usm: Scaling automatic speech recognition beyond 100 languages
Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ...
arXiv preprint arXiv:2303.01037, 2023
2362023
Personalized speech recognition on mobile devices
I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2292016
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency
TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2282020
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
2092019
Deep context: end-to-end contextual speech recognition
G Pundak, TN Sainath, R Prabhavalkar, A Kannan, D Zhao
2018 IEEE spoken language technology workshop (SLT), 418-425, 2018
2022018
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models
R Prabhavalkar, TN Sainath, Y Wu, P Nguyen, Z Chen, CC Chiu, ...
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018
1912018
Two-pass end-to-end speech recognition
TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ...
arXiv preprint arXiv:1908.10992, 2019
1682019
From audio to semantics: Approaches to end-to-end spoken language understanding
P Haghani, A Narayanan, M Bacchiani, G Chuang, N Gaur, P Moreno, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 720-726, 2018
1682018
Recognizing long-form speech using streaming end-to-end models
A Narayanan, R Prabhavalkar, CC Chiu, D Rybach, TN Sainath, ...
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
1352019
On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition
R Prabhavalkar, O Alsharif, A Bruguier, L McGraw
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2016
1202016
End-to-end speech recognition: A survey
R Prabhavalkar, T Hori, TN Sainath, R Schlüter, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
1182023
Automatic Gain Control and Multi-Style Training For Robust Small-Footprint Keyword Spotting With Deep Neural Networks
R Prabhavalkar, R Alvarez, C Parada, P Nakkiran, TN Sainath
International Conference on Acoustics, Speech and Signal Processing, 2015
1092015
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models
Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw
IEEE Automatic Speech Recognition and Understanding (ASRU), 2017
1032017
Compressing deep neural networks using a rank-constrained topology
P Nakkiran, R Alvarez, R Prabhavalkar, C Parada
INTERSPEECH, 2015
962015
The system can't perform the operation now. Try again later.
Articles 1–20