About Me
I am currently a Ph.D. student at the Department of Systems Engineering and Engineering Management, the Chinese University of Hong Kong (CUHK) supervised by Prof. LIU Xunying and co-supervised by Prof. CHEN Xie. Before that, I finished my Master and Bachelor degrees in CUHK and SouthEast University (SEU) respectively.
My research interests focus on long-context ASR, multimodal LLM, and streaming LLM.
🔥 News
- 2024.06: We released a multilingual ASR corpus GigaSpeech2 comprising 30,000 hours of Thai, Indonesian, and Vietnamese!
- 2024.06: 2 conference papers have been accepted by INTERSPEECH 2024.
- 2023.12: 2 conference papers have been accepted by ICASSP 2024.
- 2023.05: 5 conference papers have been accepted by INTERSPEECH 2023 including 1 first-author paper.
- 2023.02: 2 conference papers have been accepted by ICASSP 2023 and 1 journal has been accepted by IEEE TASLP.
- 2022.06: 3 conference papers have been accepted by INTERSPEECH 2022 including 1 first-author paper.
- 2021.06: 1 journal has been accepted by IEEE TASLP.
📖 Educations
- Since 2021.09, Ph.D. student, The Chinese University of Hong Kong (CUHK), China.
- 2019.09 - 2020.06, Master of Computer Science, the Chinese University of Hong Kong (CUHK), China.
- 2015.06 - 2019.07, Bachelor of Software Engineering, SouthEast University (SEU), China.
💻 Experience
- 2024.06 - Now, Research Intern, Noah’s Ark Lab, Hong Kong SAR, China.
- 2023.10 - 2024.02, Remote Research Intern, Speech Lab, Alibaba DAMO Academy, China.
- 2022.03 - 2023.09, Research Intern, International Digital Economy Academy (IDEA), China.
- 2020.08 - 2021.09, Research Assitant, The Chinese University of Hong Kong (CUHK), China.
📝 Selected Publications
-
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, Jinpeng Li, Bo Yang, et al. Submitted to NIPS 2024
-
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu.
ISCA Interspeech 2023, Dublin, Ireland (Oral Presentation) -
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
Jiajun Deng, Guinan Li, Xurong Xie, Zengrui Jin, Mingyu Cui, Tianzi Wang, Shujie Hu, Mengzhe Geng, Xunying Liu
ISCA Interspeech 2023, Dublin, Ireland (Oral Presentation) -
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One
Lingwei Meng, Jiawen Kang, Mingyu Cui, Yuejiao Wang, Xixin Wu, Helen Meng
IEEE ICASSP2023, Rhodes Island, Greece (Oral Presentation) -
Two-pass decoding and cross-adaptation based system combination of end-to-end conformer and hybrid tdnn asr systems
Mingyu Cui, Jiajun Deng, Shujie Hu, Xurong Xie, Tianzi Wang, Shoukang Hu, Mengzhe Geng, Boyang Xue, Xunying Liu, Helen Meng
ISCA Interspeech 2022, Incheon, KoreaMore paper details please find in Google Scholar
🎖 Teaching Assistance
- ENGG 1120 C, Linear Algebra
- FTEC 4006, Internet Finance
- SEEM 2420, Operations Research I