About Me

I am a final-year Ph.D. candidate (expected graduation: April 2026) at the Department of Systems Engineering and Engineering Management, the Chinese University of Hong Kong (CUHK), supervised by Prof. LIU Xunying and co-supervised by Prof. CHEN Xie. Before that, I finished my Master and Bachelor degrees in CUHK and SouthEast University (SEU) respectively.

My research interests focus on multimodal speech-language models, self-supervised learning for speech, and large-scale ASR.

🔥 News

2026.01: Released Covo-Audio Technical Report as a core contributor.
2025.12: 1 first-author paper accepted by ICASSP 2026.
2025.09: 1 first-author journal accepted by IEEE/ACM TASLP.
2025.06: 1 first-author paper and 4 co-authored papers accepted by INTERSPEECH 2025. GigaSpeech 2 accepted by ACL 2025.
2025.01: 2 conference papers have been accepted by ICASSP 2025.
2024.06: 2 conference papers have been accepted by INTERSPEECH 2024.
2023.12: 2 conference papers have been accepted by ICASSP 2024.
2023.05: 5 conference papers have been accepted by INTERSPEECH 2023 including 1 first-author paper.
2023.02: 2 conference papers have been accepted by ICASSP 2023 and 1 journal has been accepted by IEEE TASLP.
2022.06: 3 conference papers have been accepted by INTERSPEECH 2022 including 1 first-author paper.
2021.06: 1 journal has been accepted by IEEE TASLP.

📖 Educations

Since 2021.09, Ph.D. student, The Chinese University of Hong Kong (CUHK), China.
2019.09 - 2020.06, Master of Computer Science, the Chinese University of Hong Kong (CUHK), China.
2015.06 - 2019.07, Bachelor of Software Engineering, SouthEast University (SEU), China.

💻 Experience

2025.12 - Now, Research Intern, Hunyuan, Tencent, Shenzhen, China.
2025.06 - 2025.12, Research Intern, Tencent AI Lab, Shenzhen, China.
2024.06 - 2025.06, Research Intern, Noah’s Ark Lab, Huawei, Hong Kong SAR, China.
2023.10 - 2024.02, Remote Research Intern, Speech Lab, Alibaba DAMO Academy, China.
2022.03 - 2023.09, Research Intern, International Digital Economy Academy (IDEA), China.
2020.08 - 2021.09, Research Assistant, The Chinese University of Hong Kong (CUHK), China.

📝 Selected Publications

Exploring SSL Discrete Tokens For Multilingual Automatic Speech Recognition
Mingyu Cui, Mengzhe Geng, Yiwen Shao, Jiawen Kang, Lingwei Meng, Dingdong Wang, Chenxing Li, Meng Yu, Xunying Liu
IEEE ICASSP 2026
Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Mingyu Cui, Yifan Yang, Jiajun Deng, Jiawen Kang, Shujie Hu, Tianzi Wang, Zhaoqing Li, Shiliang Zhang, Xie Chen, Xunying Liu
ISCA Interspeech 2025
Exploring Cross-Utterance Speech Contexts for Conformer-Transducer Speech Recognition Systems
Mingyu Cui, Mengzhe Geng, Jiajun Deng, Chengxi Deng, Jiawen Kang, Shujie Hu, Guinan Li, Tianzi Wang, Zhaoqing Li, Xie Chen, Xunying Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu
ISCA Interspeech 2023, Dublin, Ireland (Oral Presentation)
Two-pass Decoding and Cross-adaptation Based System Combination of End-to-End Conformer and Hybrid TDNN ASR Systems
Mingyu Cui, Jiajun Deng, Shujie Hu, Xurong Xie, Tianzi Wang, Shoukang Hu, Mengzhe Geng, Boyang Xue, Xunying Liu, Helen Meng
ISCA Interspeech 2022, Incheon, Korea
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, Jinpeng Li, Bo Yang, et al.
ACL 2025

More paper details please find in Google Scholar

🎖 Teaching Assistance

ENGG 1120 C, Linear Algebra
FTEC 4006, Internet Finance
SEEM 2420, Operations Research I