Follow
Mohan Li
Mohan Li
Toshiba Europe Ltd
Verified email at toshiba.eu
Title
Cited by
Cited by
Year
End-to-end speech recognition with adaptive computation steps
M Li, M Liu, H Masanori
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
392019
Head-synchronous decoding for transformer-based streaming ASR
M Li, C Zorilă, R Doddipatla
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
222021
Transformer-based online speech recognition with decoder-end adaptive computation steps
M Li, C Zorilă, R Doddipatla
2021 IEEE spoken language technology workshop (SLT), 1-7, 2021
222021
Non-autoregressive end-to-end approaches for joint automatic speech recognition and spoken language understanding
M Li, R Doddipatla
2022 IEEE Spoken Language Technology Workshop (SLT), 390-397, 2023
102023
Transformer-based streaming ASR with cumulative attention
M Li, S Zhang, C Zorilă, R Doddipatla
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
102022
An investigation into the multi-channel time domain speaker extraction network
C Zorilă, M Li, R Doddipatla
2021 IEEE Spoken Language Technology Workshop (SLT), 793-800, 2021
62021
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results.
M Li, Y Cao, W Zhou, M Liu
Interspeech, 1641-1645, 2019
62019
Dialoc: An iterative approach to embodied dialog localization
C Zhang, M Li, I Budvytis, S Liwicki
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
32024
Multiple-hypothesis RNN-T Loss for unsupervised fine-tuning and self-training of neural transducer
CT Do, M Li, R Doddipatla
arXiv preprint arXiv:2207.14736, 2022
32022
Toshiba’s speech recognition system for the CHiME 2020 challenge
C Zorila, M Li, D Hayakawa, M Liu, N Ding, R Doddipatla
Proc. of The 6th Intl. Workshop on Speech Processing in Everyday …, 2020
32020
Prompting Whisper for QA-driven zero-shot end-to-end spoken language understanding
M Li, S Keizer, R Doddipatla
arXiv preprint arXiv:2406.15209, 2024
22024
Towards a unified end-to-end language understanding system for speech and text inputs
M Li, C Zorilă, CT Do, R Doddipatla
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
22023
Cumulative attention based streaming transformer ASR with internal language model joint training and rescoring
M Li, CT Do, R Doddipatla
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Domain adaptive self-supervised training of automatic speech recognition
CT Do, R Doddipatla, M Li, T Hain
Proc. INTERSPEECH 2023, 4389-4393, 2023
22023
Improving HS-DACS based streaming Transformer ASR with deep reinforcement learning
M Li, R Doddipatla
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
22021
Towards a speaker diarization system for the CHiME 2020 dinner party transcription
C Boeddeker, T Cord-Landwehr, J Heitkaemper, C Zorila, D Hayakawa, ...
Proc. 6th International Workshop on Speech Processing in Everyday …, 2020
22020
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding
M Li, CT Do, S Keizer, Y Farag, S Stoyanchev, R Doddipatla
2024 IEEE Spoken Language Technology Workshop (SLT), 1115-1122, 2024
12024
Speech recognition systems and methods
LI Mohan, T Zorila, RS Doddipatla
US Patent 12,002,450, 2024
2024
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
M Li, R Doddipatla, C Zorila
Proc. Interspeech 2022, 2088-2092, 2022
2022
An Investigation into the Multi-channel Time Domain Speaker Extraction Network
R Doddipatla, C Zorila, M Li
2021 IEEE Spoken Language Technology Workshop (SLT), 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20