I am an Associate Researcher and Master Supervisor at the School of Computer Science and Technology, East China Normal University. I was selected for the Shanghai Youth Science and Technology Rising Star Program (上海市青年科技英才扬帆计划) in 2023. My research interests primarily focus on foundational issues in NLP and AI for medicine and education, such as Large Language Models, clinical NLP, and AI-assisted education. I completed my Ph.D. at the Research Center for Social Computing and Information Retrieval (SCIR) in 2022, under the supervision of Professor Wanxiang Che. Also, I had worked as a algorithm engineer intern at Tencent Jarvis Lab during my Ph.D, under the supervision of Yefeng Zheng. From 2022 to 2024, I was an associate researcher at Shanghai AI Lab, working on the large language model for the medical domain.

💻 Work Experience

2024.05 - Present, Associate Researcher, East China Normal University.
2022.10 - 2024.04, Associate Researcher, Shanghai AI Lab.
2020.05 - 2021.05, Algorithm Engineer Intern, Tencent.
2019.04 - 2019.09, Algorithm Engineer Intern, Tencent.

📖 Educations

2016.09 - 2022.09, Ph.D of Computer Science, Harbin Institute of Technology.
2012.09 - 2016.06, Bachelor of Mathematics, Harbin Institute of Technology.

🎖 Honors and Awards

2023 Shanghai Youth Science and Technology Rising Star Program (上海市青年科技英才扬帆计划), 2023.

💬 Invited Talks

2025.05: Communication University of Zhejiang, ‘‘AIGC时代教育产品变革：迈向个性化与智能化’’.
2024.09: 中国数字经济创新发展大会, ‘‘AI时代算力的影响及智能教育的应用’’.
2024.04: The Hong Kong Polytechnic University, ‘‘A Brief Introduction of Medical Dialogue System’’.

🔥 News

2025.05: 🎉🎉 One paper ‘‘Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling’’ is accepted by ACL 2025.
2025.05: 🎉🎉 One paper “LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios” is accepted by ACL 2025.
2025.05: 🎉🎉 One paper “Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation Capability” is accepted by ACL 2025 Findings.

📝 Publications

Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling.
Jiayi Zeng, Yizhe Feng, Mengliang He, Wenhui Lei, Wei Zhang, Zeming Liu, Xiaoming Shi, and Aimin Zhou.
In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025). (CCF A)
LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios.
Xiaodong Wu, Minhao Wang, Yichen Liu, Xiaoming Shi, He Yan, Lu Xiangju, Junmin Zhu, and Wei Zhang.
In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025). (CCF A)
Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation Capability.
Mengliang He, Jiayi Zeng, Yankai Jiang, Wei Zhang, Zeming Liu, Xiaoming Shi, and Aimin Zhou.
In Proceedings of the 63nd Annual Meeting of the Association for Computational Linguistics (ACL 2025) Findings. (CCF A)
Cost-effective Instruction Learning for Pathology Vision and Language Analysis.
Kaitao Chen, Mianxin Liu, Fang Yan, Lei Ma, Xiaoming Shi, Lilong Wang, Xiaosong Wang, Lifeng Zhu, Zhe Wang, Mu Zhou, and Shaoting Zhang.
Nature Computational Science (2025). (Q1)
[Paper]
Medical Dialogue System: A Survey of Categories, Methods, Evaluation and Challenges.
Xiaoming Shi, Zeming Liu, Li Du, Yuxuan Wang, Hongru Wang, Yuhang Guo, Tong Ruan, Jie Xu, and Shaoting Zhang.
In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Findings. (CCF A)
[Paper]
MidMed: Towards Mixed-Type Dialogues for Medical Consultation.
Xiaoming Shi, Zeming Liu, Chuan Wang, Haitao Leng, Kui Xue, Xiaofan Zhang, and Shaoting Zhang.
In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023). (CCF A)
[Paper] [Resource]
Understanding Medical Conversations with Scattered Keyword Attention and Weak Supervision from Responses.
Xiaoming Shi, Haifeng Hu, Wanxiang Che, Zhongqian Sun, Ting Liu and Junzhou Huang.
In Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020). (CCF A)
[Paper] [Code]
Understanding Patient Query with Weak Supervision from Doctor Response.
Xiaoming Shi, Sendong Zhao, Wanxiang Che, Yefeng Zheng.
IEEE Journal of Biomedical and Health Informatics 26, no. 6 (2021): 2770-2777. (Q1)
[Paper] [Code]
Learning Semantic Alignment with Global Modality Reconstruction for VideoLanguage Pre-training towards Retrieval.
Mingchao Li, Xiaoming Shi, Haitao Leng, Wei Zhou, Haitao Zheng, and Kuncai Zhang. (Co-first author)
In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023). (CCF A)
[Paper]
Combating with Extremely Noisy Samples in Weakly Supervised Slot Filling for Automatic Diagnosis.
Xiaoming Shi, and Wanxiang Che.
Frontiers of Computer Science 17, no. 5 (2023): 175333. (Q1)
[Paper]
Online Action Detection with Learning Future Representations by Contrastive Learning.
Haitao Leng, Xiaoming Shi, Wei Zhou, Kuncai Zhang, Qiankun Shi, and Pengcheng Zhu.
In Proceedings of the IEEE International Conference on Multimedia and Expo 2023. (CCF B)
[Paper]
MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models.
Mianxin Liu, Jinru Ding, Jie Xu, Weiguo Hu, Xiaoyang Li, Lifeng Zhu, Zhian Bai, Xiaoming Shi, Benyou Wang, Haitao Song, Pengfei Liu, Xiaofan Zhang, Shanshan Wang, Kang Li, Haofen Wang, Tong Ruan, Xuanjing Huang, Xin Sun, and Shaoting Zhang.
Big Data Mining and Analytics, 2024. (Q1)
[Paper]
Coherency Improved Explainable Recommendation via Large Language Model.
Shijie Liu, Ruixin Ding, Weiha Lu, Jun Wang, Mo Yu, Xiaoming Shi, and Wei Zhang.
In Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025). (CCF A)
[Paper]
STAMPsy: Towards SpatioTemporal-Aware Mixed-Type Dialogues for Psychological Counseling.
Jieyi Wang, Yue Huang, Zeming Liu, Dexuan Xu, Chuan Wang, Xiaoming Shi, Ruiyuan Guan, Hongxing Wang, Weihua Yue, and Yu Huang.
In Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025). (CCF A)
[Paper]
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model.
Yongyu Yan, Kui Xue, Xiaoming Shi, Qi Ye, Jingping Liu, and Tong Ruan.
In Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2023). (CCF B)
[Paper] [Code]
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus.
Xiaoming Shi, Zeming Liu, Yiming Lei, Chenkai Zhang, Haitao Leng, Chuan Wang, Qingjie Liu, and Yunhong Wang.
NAACL 2025. (CCF B)
[Paper]
Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring.
Honglin Mu, Han He, Yuxin Zhou, Yunlong Feng, Yang Xu, Libo Qin, Xiaoming Shi, Zeming Liu, Xudong Han, Qi Shi, Qingfu Zhu, and Wanxiang Che.
NAACL 2025. (CCF B)
[Paper]
大语言模型安全性:分类、评估、归因、缓解、展望.
李思霖, 兰天伟, 邱昱力, 单赢宇, 施晓明, 柳泽明, 姚嘉树, 曾理, 郭宇航, 黄河燕.
计算机学报, 2024. (CCF A类中文科技期刊)

🧱 Patents

一种语义识别方法、装置、计算机设备和存储介质, CN112052318A.
用于自动问答系统的分类模型训练、自动答答方法及装置, CN112287089B.
医学词语标注方法、医学词语映射方法、装置及设备, CN112989767B.
医学词语映射方法、装置、计算机设备及存储介质, CN113761116A.

Xiaoming Shi