End-to-end speech recognition with adaptive computation steps M Li, M Liu, H Masanori ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 39 | 2019 |
Head-synchronous decoding for transformer-based streaming ASR M Li, C Zorilă, R Doddipatla ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 22 | 2021 |
Transformer-based online speech recognition with decoder-end adaptive computation steps M Li, C Zorilă, R Doddipatla 2021 IEEE spoken language technology workshop (SLT), 1-7, 2021 | 22 | 2021 |
Non-autoregressive end-to-end approaches for joint automatic speech recognition and spoken language understanding M Li, R Doddipatla 2022 IEEE Spoken Language Technology Workshop (SLT), 390-397, 2023 | 10 | 2023 |
Transformer-based streaming ASR with cumulative attention M Li, S Zhang, C Zorilă, R Doddipatla ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
An investigation into the multi-channel time domain speaker extraction network C Zorilă, M Li, R Doddipatla 2021 IEEE Spoken Language Technology Workshop (SLT), 793-800, 2021 | 6 | 2021 |
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results. M Li, Y Cao, W Zhou, M Liu Interspeech, 1641-1645, 2019 | 6 | 2019 |
Dialoc: An iterative approach to embodied dialog localization C Zhang, M Li, I Budvytis, S Liwicki Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |
Multiple-hypothesis RNN-T Loss for unsupervised fine-tuning and self-training of neural transducer CT Do, M Li, R Doddipatla arXiv preprint arXiv:2207.14736, 2022 | 3 | 2022 |
Toshiba’s speech recognition system for the CHiME 2020 challenge C Zorila, M Li, D Hayakawa, M Liu, N Ding, R Doddipatla Proc. of The 6th Intl. Workshop on Speech Processing in Everyday …, 2020 | 3 | 2020 |
Prompting Whisper for QA-driven zero-shot end-to-end spoken language understanding M Li, S Keizer, R Doddipatla arXiv preprint arXiv:2406.15209, 2024 | 2 | 2024 |
Towards a unified end-to-end language understanding system for speech and text inputs M Li, C Zorilă, CT Do, R Doddipatla 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 2 | 2023 |
Cumulative attention based streaming transformer ASR with internal language model joint training and rescoring M Li, CT Do, R Doddipatla ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Domain adaptive self-supervised training of automatic speech recognition CT Do, R Doddipatla, M Li, T Hain Proc. INTERSPEECH 2023, 4389-4393, 2023 | 2 | 2023 |
Improving HS-DACS based streaming Transformer ASR with deep reinforcement learning M Li, R Doddipatla 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 2 | 2021 |
Towards a speaker diarization system for the CHiME 2020 dinner party transcription C Boeddeker, T Cord-Landwehr, J Heitkaemper, C Zorila, D Hayakawa, ... Proc. 6th International Workshop on Speech Processing in Everyday …, 2020 | 2 | 2020 |
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding M Li, CT Do, S Keizer, Y Farag, S Stoyanchev, R Doddipatla 2024 IEEE Spoken Language Technology Workshop (SLT), 1115-1122, 2024 | 1 | 2024 |
Speech recognition systems and methods LI Mohan, T Zorila, RS Doddipatla US Patent 12,002,450, 2024 | | 2024 |
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition M Li, R Doddipatla, C Zorila Proc. Interspeech 2022, 2088-2092, 2022 | | 2022 |
An Investigation into the Multi-channel Time Domain Speaker Extraction Network R Doddipatla, C Zorila, M Li 2021 IEEE Spoken Language Technology Workshop (SLT), 2021 | | 2021 |