Xiao Yu (余啸)

The State Key Laboratory of Blockchain and Data Security, Zhejiang University, China

Contact me at: xiao.yu@zju.edu.cn

  Biography

I am currently a Research Fellow at the State Key Laboratory of Blockchain and Data Security, Zhejiang University, Hangzhou, China.

Prior to joining Zhejiang University, I was a Postdoctoral Researcher at Huawei, where I had the privilege of working under the supervision of Prof. Xin Xia.

I received my Ph.D. degree in December 2020 from the School of Computer Science, Wuhan University, China, under the supervision of Prof. Jin Liu.

Additionally, I earned a joint Ph.D. degree in March 2021 from the Department of Computer Science, City University of Hong Kong, supervised by Prof. Qing Li and Prof. Jacky Wai Keung.

  Research Interests

LLMs Data Governance and Evaluation

LLMs Data Governance and Evaluation is conducting research on large language model (LLM) data engineering, addressing hallucination phenomena, and evaluating task-specific capabilities of LLMs and Agents within the domain of software engineering.

Intelligent Software Engineering

Intelligent Software Engineering is harnessing deep learning and LLM technologies for advancing tasks such as automated code generation, code annotation and maintenance, Stack Overflow question title generation, and bug report title generation, aiming to enhance the efficiency and effectiveness of software development and maintenance processes.

Software Security and Reliability

Software Security and Reliability is investigating methodologies for software vulnerability and defect detection, log anomaly identification, security bug report classification, as well as the detection of code smells and technical debt, with the goal of ensuring secure and reliable software systems.

  Selected Publications Google Scholar

* indicates the corresponding authors, # indicates the supervised students

(12) Guancheng Lin#, Xiao Yu*, Jacky Keung, Xing Hu, Xin Xia, Alex X. Liu. Don’t Use a Cannon to Kill a Fly: Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations. In the 35th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2026), 2026.

(11) Fengji Zhang#, Linquan Wu, Huiyu Bai, Guancheng Lin, Xiao Li, Xiao Yu*, Yue Wang, Bei Chen, Jacky Keung. HumanEval-V: Systematic Evaluation of Visual Reasoning in Large Multimodal Models for Code Generation. ACM Transactions on Software Engineering and Methodology, 2026. [ACM]

(10) Xiao Yu, Haoxuan Chen#, Lei Liu#, Xing Hu, Jacky Wai Keung, Xin Xia*. RealisticCodeBench: Towards More Realistic Evaluation of Large Language Models for Code Generation. In Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering (ASE 2025), 2025: 3021-3033. [PDF] [IEEE]

(9) Yizhou Chen, Zeyu Sun, Guoqing Wang, Qingyuan Liang, Xiao Yu, Dan Hao. From Cryptic to Clear - Training on LLM Explanations to Detect Smart Contract Vulnerabilities. ACM Transactions on Software Engineering and Methodology, 2026, 35 (6): 146:1-146:24. [PDF] [ACM]

(8) Xiaoxue Ma, Yishu Li, Jacky Keung, Xiao Yu*, Huiqi Zou, Zhen Yang, Federica Sarro, Earl T Barr. Practitioners’ expectations on log anomaly detection. IEEE Transactions on Software Engineering, 2025, 51(9): 2455-2471. [PDF] [IEEE]

(7) Xiaoxue Ma, Huiqi Zou, Pinjia He, Jacky Keung, Yishu Li, Xiao Yu*, Federica Sarro, On the Influence of Data Resampling for Deep Learning-Based Log Anomaly Detection: Insights and Recommendations. IEEE Transactions on Software Engineering, 2025, 51(1): 243-261. [PDF] [IEEE]

(6) Xiao Yu, Guancheng Lin#, Xing Hu*, Jacky Keung, Xin Xia. Less is More: Unlocking Semi-Supervised Deep Learning for Vulnerability Detection. ACM Transactions on Software Engineering and Methodology, 2025, 34(3): 62:1-62:37. [PDF] [ACM]

(5) Xiao Yu, Lei Liu#, Xing Hu*, Jacky Keung, Jin Liu, Xin Xia. Fight Fire with Fire: How Much Can We Trust ChatGPT on Source Code-Related Tasks? . IEEE Transactions on Software Engineering, 2024, 50(12): 3435-3453. Reported by IEEE Spectrum. [PDF] [IEEE]

(4) Peng Zhang, Yang Wang, Xutong Liu, Zeyu Lu, Yibiao Yang, Yanhui Li, Lin Chen, Ziyuan Wang, Chang-Ai Sun, Xiao Yu, Yuming Zhou. Assessing Effectiveness of Test Suites: What Do We Know and What Should We Do? . ACM Transactions on Software Engineering and Methodology, 2024, 33(4): 86:1-86:32. [PDF] [ACM]

(3) Xiao Yu, Zexian Zhang#, Feifei Niu*, Xing Hu, Xin Xia, John Grundy. What Makes a High-Quality Training Dataset for Large Language Models: A Practitioners’ Perspective. In Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE 2024), 2024: 656-668. [PDF] [ACM]

(2) Xiao Yu, Lei Liu#, Xing Hu*, Jacky Keung, Xin Xia, David Lo. Practitioners’ Expectations on Automated Test Generation. In Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2024), 2024: 1618-1630. [PDF] [ACM]

(1) Zhen Yang#, Jacky Wai Keung, Xiao Yu*, Yan Xiao, Zhi Jin*, Jingyu Zhang. On the significance of category prediction for code-comment synchronization. ACM Transactions on Software Engineering and Methodology, 2023, 32(2): 30:1-30:41. [PDF] [ACM]

  Service

Journal Reviewers  
  • ACM Transactions on Software Engineering and Methodology

  • IEEE Transactions on Software Engineering

  • IEEE Transactions on Dependable and Secure Computing

  • IEEE Transactions on Reliability

  • Empirical Software Engineering

  • Information and Software Technology

  • Journal of Systems and Software

  • Journal of Software: Evolution and Process

  • Software: Practice and Experience

  • IET Software

Program Committee Member  
  • The 42nd IEEE International Conference on Software Maintenance and Evolution (ICSME 2026)

  • The 50th IEEE Annual Computers, Software, and Applications Conference (COMPSAC 2026)

  • The 30th International Conference on Evaluation and Assessment in Software Engineering (EASE 2026)

  • The 31st/32nd/33th Asia-Pacific Software Engineering Conference (APSEC 2024, 2025, 2026)

  • The 15th Asia-Pacific Symposium on Internetware (Internetware 2024)

Proceedings Chair  
  • The 3rd ACM International Conference on AI Foundation Models and Software Engineering (FORGE 2026) in ICSE 2026

Publication Chair  
  • The 32nd International Symposium on Software Reliability Engineering (ISSRE 2021)

Guest Editor  
  • Information and Software Technology ISSRE 2021 special section