Xin Yin (殷鑫) is the third-year Ph.D. student at Zhejiang University, supervised by Prof. Chao Ni. I also obtained Bachelor’s degree at Central South University.
My research interest includes Large Language Model, Software Testing, and Software Security. I have published papers at the top international conferences such as FSE/ISSTA/ICSE/ASE/CVPR/EMNLP. I developed a few well-known approaches including:
- SVulD and MVulD: Vulnerability Detection
- CodeGPTSensor+: LLM-generated Code Detection
- ThinkRepair and ReduceFix: Program Repair
- Rectifier and RepoTransAgent: Code Translation
- AUGER, RATester, and CasModaTest: Unit Test Generation
- SolEval and PrefGen: Smart Contract Generation
- READ: Reasoning Segmentation
In 2025, I will lead or participate in the following research topics:
- Software Testing: Unit Test Generation
- Large Language Models (LLMs): Agent
🔥 News
- 2025.08: 🎉 Two papers were accepted by EMNLP 2025 Main!
- 2025.08: 🎉 One paper was accepted by ASE 2025!
- 2025.07: 🎉 One paper was accepted by ISSRE 2025!
- 2025.04: 🎉 One paper was accepted by TOSEM 2025!
- 2025.02: 🎉 One paper was accepted by CVPR 2025!
- 2024.10: 🎉 One paper was accepted by ICSE 2025!
- 2024.09: 🎉 One paper was accepted by TSE 2024!
- 2024.07: 🎉 One paper was accepted by ISSTA 2024!
- 2023.05: 🎉 One paper was accepted by FSE 2023!
- 2023.03: 🎉 One paper was accepted by ICPC 2023!
📝 Publications
# denotes co-first author or first student author
Representative papers: 7 CCF-A papers, 2 TH-CPL-A papers
Selected Publications
- Pre-training CLIP against Data Poisoning with Optimal Transport-based Matching and Alignment.
Tong Zhang, Kuofeng Gao, Jiawang Bai, Leo Yu Zhang, Xin Yin, Zonghui Wang, Shouling Ji, Wenzhi Chen.
In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP’25 Main). (TH-CPL-A) - SolEval: Benchmarking Large Language Models for Repository-level Solidity Smart Contract Generation.
Zhiyuan Peng, Xin Yin#, Rui Qian, Peiqin Lin, YongKang Liu, Hao Zhang, Chenhao Ying, Yuan Luo.
In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP’25 Main). (TH-CPL-A) - PrefGen: A Preference-Driven Methodology for Secure Yet Gas-Efficient Smart Contract Generation.
Zhiyuan Peng, Xin Yin#, Zijie Zhou, Chenhao Ying, Chao Ni, Yuan Luo.
In Proceedings of the 40th IEEE/ACM Automated Software Engineering Conference (ASE’25). (CCF-A) - Abundant Modalities Offer More Nutrients: Multi-Modal-Based Function-level Vulnerability Detection.
Chao Ni, Xin Yin#, Xinrui Li, Xiaodan Xu, Zhi Yu.
In ACM Transactions on Software Engineering and Methodology (TOSEM’25). (CCF-A) - Reasoning to Attend: Try to Understand How <SEG> Token Works.
Rui Qian, Xin Yin#, Dejing Dou.
In Proceedings of the 2025 IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR’25). (CCF-A) - What You See Is What You Get: Attention-based Self-guided Automatic Unit Test Generation.
Xin Yin, Chao Ni, Xiaodan Xu, Xiaohu Yang.
In Proceedings of the 47th IEEE/ACM International Conference on Software Engineering (ICSE’25). (CCF-A) - Multitask-based Evaluation of Open-Source LLM on Software Vulnerability.
Xin Yin, Chao Ni, Shaohua Wang.
In IEEE Transactions on Software Engineering (TSE’24). (CCF-A) - ThinkRepair: Self-Directed Automated Program Repair.
Xin Yin, Chao Ni, Shaohua Wang, Zhenhao Li, Limin Zeng, Xiaohu Yang.
In Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA’24). (CCF-A) - Distinguishing Look-Alike Innocent and Vulnerable Code by Subtle Semantic Representation Learning and Explanation.
Chao Ni, Xin Yin#, Kaiwen Yang, Dehai Zhao, Zhenchang Xing, Xin Xia.
In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE’23). (CCF-A)
Peer-Reviewed Publications
- A Cascaded Pipeline for Self-Directed, Model-Agnostic Unit Test Generation via LLMs.
Chao Ni, Xiaoya Wang, Xin Yin, Liushan Chen, Guojun Ma.
In Proceedings of the IEEE International Symposium on Software Reliability Engineering (ISSRE’25). (CCF-B) - Automatic Commit Range Identification of Untagged Version.
Yan Zhu, Lingfeng Bao, Chengjie Chen, Lexiao Zhang, Xin Yin, Chao Ni.
In Proceedings of the Asia-Pacific Software Engineering Conference (APSEC’24). (CCF-C) - FVA: Assessing Function-Level Vulnerability by Integrating Flow-Sensitive Structure and Code Statement Semantic.
Chao Ni, Liyu Shen, Wei Wang, Xiang Chen, Xin Yin, Lexiao Zhang.
In Proceedings of the IEEE/ACM International Conference on Program Comprehension (ICPC’23). (CCF-B) - Spatio-temporal aware knowledge graph embedding for recommender systems.
Liu Yang, Xin Yin#, Jun Long, Tingxuan Chen, Jie Zhao, Wenti Huang.
In Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA’22). (CCF-C)
Preprints
- Learning to Align Human Code Preferences.
Xin Yin, Chao Ni, Liushan Chen, Xiaohu Yang, Arxiv - Detecting LLM-generated Code with Subtle Modification by Adversarial Training.
Xin Yin, Xinrui Li, Chao Ni, Xiaodan Xu, Xiaohu Yang, Arxiv - Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection.
Xin Yin, Chao Ni, Xinrui Li, Liushan Chen, Guojun Ma, Xiaohu Yang, Arxiv - Improving the Ability of Pre-trained Language Model by Imparting Large Language Model’s Experience.
Xin Yin, Chao Ni, Xinrui Li, Xiaohu Yang, Arxiv - Rectifier: Code Translation with Corrector via LLMs.
Xin Yin, Chao Ni, Tien N. Nguyen, Shaohua Wang, Xiaohu Yang, Arxiv - Learning-based Models for Vulnerability Detection: An Extensive Study.
Chao Ni, Xin Yin#, Liyu Shen, Shaohua Wang, Arxiv - RepoTransAgent: Multi-Agent LLM Framework for Repository-Aware Code Translation.
Ziqi Guan, Xin Yin#, Zhiyuan Peng, Chao Ni, Arxiv - MulChain: Enabling Advanced Cross-Modal Queries in Hybrid-Storage Blockchains.
Zhiyuan Peng, Xin Yin#, Gang Wang, Chenhao Ying, Chao Ni, Wei Chen, Xikun Jiang, Yibin Xu, Yuan Luo, Arxiv - Input Reduction Enhanced LLM-based Program Repair.
Boyang Yang, Luyao Ren, Xin Yin, Jiadong Ren, Haoye Tian, Shunfu Jin, Arxiv - SepPrune: Structured Pruning for Efficient Deep Speech Separation.
Yuqi Li, Kai Li, Xin Yin, Zhifei Yang, Junhao Dong, Zeyu Dong, Chuanguang Yang, Yingli Tian, Yao Lu, Arxiv - SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models.
Yuqi Li, Yao Lu, Junhao Dong, Zeyu Dong, Chuanguang Yang, Xin Yin, Yihao Chen, Jianping Gou, Yingli Tian, Tingwen Huang, Arxiv - Pros and Cons! Evaluating ChatGPT on Software Vulnerability.
Xin Yin, Arxiv
🎖 Honors and Awards
- 2025.06, 浙江大学争创优秀博士学位论文资助
💬 Academic Services
- Journal Reviewer: IEEE Transactions on Software Engineering (TSE)
- Conference PC Member: ICSE 2026 (Shadow PC), AAAI 2026 (PC)
📖 Educations
- 2022.09 - Present, Ph.D. student, Zhejiang University.
- 2018.09 - 2022.06, Bachelor, Central South University.
💻 Internships
- 2024.06 - 2025.06, State Key Laboratory of Blockchain and Data Security, Hangzhou.
- 2023.07 - 2023.10, Software Engineering Application Technology Lab at Huawei, Hangzhou.