Registered user since Wed 1 Nov 2017
I am the Chief Expert of the Software Engineering Application Technology at Huawei, China. Prior to joining Huawei, I was an ARC DECRA Fellow and a lecturer (equivalent to U.S. assistant professor) at the Faculty of Information Technology, Monash University, Australia, from 2018 to 2021. I got my Ph.D degree in June 2014 from the College of Computer Science and Technology, Zhejiang University, China. I was very fortunate to work under the supervision of Prof. Xiaohu Yang and Prof. Jianling Sun. From July 2012 to January 2014, I was a visiting student of Prof. David Lo in Singapore Management University.
To help developers and testers improve their productivity, my current research focuses on data science for software engineering, i.e., mining and analyzing rich data in software repositories to uncover interesting and actionable information. I have employed and customized structured and unstructured data analytics techniques – including data mining, information retrieval, natural language processing, search-based algorithms, and program analysis – to transform passive software engineering data into automated tools and new insights.
Contributions
2025
2030 Software Engineering
ESEC/FSE
ICSE
- Author in Software Engineering in Practice within the SE In Practice (SEIP)-track
- Industry Challenge Co-Chair in Organizing Committee
- Industry Challenge Co-Chair in Industry Challenge within the Industry Challenge Track-track
- Area Co-Chair for AI for Software Engineering in Research Track within the Research Track-track
2024
APSEC
ASE
- Author of ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code within the Research Papers-track
- Author of REACT: IR-Level Patch Presence Test for Binary within the Research Papers-track
- Author of What Makes a High-Quality Training Dataset for Large Language Models: A Practitioners’ Perspective within the Research Papers-track
- Author of App Review Driven Collaborative Bug Finding within the Journal-first Papers-track
- Author of B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests within the Research Papers-track
ISSTA
- Author of Automating Zero-Shot Patch Porting for Hard Forks within the Technical Papers-track
- Committee Member in Program Committee within the Technical Papers-track
- Author of SelfPiCo: Self-Guided Partial Code Execution with LLMs within the Technical Papers-track
- Author of Practitioners’ Expectations on Automated Test Generation within the Technical Papers-track
Internetware
2030 Software Engineering
ESEC/FSE
- Committee Member in Program Committee within the Ideas, Visions and Reflections-track
- Author of Easy over Hard: A Simple Baseline for Test Failures Causes Prediction within the Industry Papers-track
- Committee Member in Program Committee within the Industry Papers-track
- Committee Member in Program Committee within the Research Papers-track
- Session Chair of Software Maintenance and Comprehension 3 (part of Research Papers)
- Author of Unveil the Mystery of Critical Software Vulnerabilities within the Industry Papers-track
MODAL
ICSE
- Author of Streamlining Java Programming: Uncovering Well-Formed Idioms with IdioMine within the Research Track-track
- Author of PPT4J: Patch Presence Test for Java Binaries within the Research Track-track
- Co-chair in Software Engineering in Society within the Software Engineering in Society-track
- Author of Pre-training by Predicting Program Dependencies for Vulnerability Analysis Tasks within the Research Track-track
- Software Engineering in Society Co-Chair in Organising Committee
- Industry Challenge Track Co-Chair in Organising Committee
- Industry Challenge Track Co-Chair in Industry Challenge within the Industry Challenge Track-track
- Author of Code Search is All You Need? Improving Code Suggestions with Code Search within the Research Track-track
- Author of Exploiting Library Vulnerability via Migration Based Automating Test Generation within the Research Track-track
- Committee Member in Research Track within the Research Track-track
- Author of Towards More Practical Automation of Vulnerability Assessment within the Research Track-track
- Author of PPT4J: Patch Presence Test for Java Binaries within the Artifact Evaluation-track
- Author of PS3: Precise Patch Presence Test based on Semantic Symbolic Signature within the Research Track-track
- Author of MUT: Human-in-the-Loop Unit Test Migration within the Research Track-track
- Committee Member in New Ideas and Emerging Results within the New Ideas and Emerging Results-track
- Author of Learning in the Wild: Towards Leveraging Unlabeled Data for Effectively Tuning Pre-trained Code Models within the Research Track-track
International Conference on Program Comprehension
Mining Software Repositories
Forge
2023
Mining Software Repositories
ESEC/FSE
- Committee Member in Test of Time Award Committee
- Committee Member in Program Committee within the Industry Papers-track
- Author of [Remote] CCT5: A Code-Change-Oriented Pre-Trained Model within the Research Papers-track
- Author of [Remote] Distinguishing Look-Alike Innocent and Vulnerable Code by Subtle Semantic Representation Learning and Explanation within the Research Papers-track
- Author of [Remote] Software Architecture in Practice: Challenges and Opportunities within the Research Papers-track
- Author of [Remote] C³: Code Clone-Based Identification of Duplicated Components within the Industry Papers-track
- Committee Member in Program Committee within the Research Papers-track
ASE
- Author of Identify and Update Test Cases when Production Code Changes: A Transformer-based Approach within the Research Papers-track
- Committee Member in Program Committee within the Industry Challenge (Competition) -track
- Committee Member in Program Committee within the Research Papers-track
- Session Chair of SATE - Software Engineering at the Era of LLMs (part of SATE - Software Engineering at the Era of LLMs)
- Author of Are They All Good? Studying Practitioners' Expectations on the Readability of Log Messages within the Research Papers-track
- Session Chair of SATE - Software Engineering at the Era of LLMs (part of SATE - Software Engineering at the Era of LLMs)
- Session Chair of SATE - Software Engineering at the Era of LLMs (part of SATE - Software Engineering at the Era of LLMs)
- Author of Neural SZZ Algorithm within the Research Papers-track
Internetware
ICSE
- Author of Faster or Slower? Performance Mystery of Python Idioms Unveiled with Empirical Evidence within the Technical Track-track
- Chair in Demonstrations within the DEMO - Demonstrations-track
- Committee Member in Software Engineering in Practice within the SEIP - Software Engineering in Practice-track
- Author of CoLeFunDa: Explainable Silent Vulnerability Fix Identification within the Technical Track-track
- Author of deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search within the Journal-First Papers-track
- Author of CCRep: Learning Code Change Representations via Pre-Trained Code Model and Query Back within the Technical Track-track
- Author of Predictive Comment Updating with Heuristics and AST-Path-Based Neural Learning: A Two-Phase Approach within the Journal-First Papers-track
- Formal Research Demonstrations Co-Chairs in Organising Committee
- Author of Correlating Automated and Human Evaluation of Code Documentation Generation Quality within the Journal-First Papers-track
- Committee Member in Software Engineering in Society within the SEIS - Software Engineering in Society-track
- Author of Fine-grained Commit-level Vulnerability Type Prediction By CWE Tree Structure within the Technical Track-track
- Author of SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics within the Technical Track-track
- Author of RepresentThemAll: A Universal Learning Representation of Bug Reports within the Technical Track-track
International Conference on Program Comprehension
International Conference on Technical Debt
2022
ESEC/FSE
- Author of Characterizing Search Activities on Stack Overflow within the ESEC/FSE 2021-track
- Author of Are We Building on the Rock? On the Importance of Data Preprocessing for Code Summarization within the Research Papers-track
- Author of Automated Unearthing of Dangerous Issue Reports within the Research Papers-track
- Author of The Best of Both Worlds: Integrating Semantic Features with Expert Features for Defect Prediction and Localization within the Research Papers-track
- Author of What Motivates Software Practitioners to Contribute to Inner Source? within the Research Papers-track
- Committee Member in Program Committee within the Industry Paper-track
- Author of CodeMatcher: A Tool for Large-Scale Code Search Based on Query Semantics Matching within the Demonstrations-track
- Author of How to Better Utilize Code Graphs in Semantic Code Search? within the Research Papers-track
- Author of Making Python Code Idiomatic by Automatic Refactoring Non-idiomatic Python Code with Pythonic Idioms within the Research Papers-track
ASE
- Author of Is this Change the Answer to that Problem?Correlating Descriptions of Bug and Code Changes for Evaluating Patch Correctness within the Artifact Evaluation-track
- Author of Code Structure Guided Transformer for Source Code Summarization within the Journal-first Papers-track
- Committee Member in Program Committee within the Research Papers-track
- Author of Constructing a System Knowledge Graph of User Tasks and Failures from Bug Reports to Support Soap Opera Testing within the Research Papers-track
- Author of Is this Change the Answer to that Problem? Correlating Descriptions of Bug and Code Changes for Evaluating Patch Correctness within the Research Papers-track
ICSE
- Author of CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words within the Journal-First Papers-track
- Author of An Empirical Study of Release Note Production and Usage in Practice within the Journal-First Papers-track
- Committee Member in Program Committee within the NIER - New Ideas and Emerging Results-track
- Author of Practitioners’ Expectations on Automated Code Comment Generation within the Technical Track-track
- Committee Member in Program Committee within the SEIP - Software Engineering in Practice-track
- Author of SZZ for Vulnerability: Automatic Identification of Version Ranges Affected by CVE Vulnerabilities within the Technical Track-track
- Author of Context-aware Retrieval-based Deep Commit Message Generation within the Journal-First Papers-track
- Author of ShellFusion: Answer Generation for Shell Programming Tasks via Knowledge Fusion within the Technical Track-track
- Author of Maintenance-Related Concerns for Post-deployed Ethereum Smart Contract Development: Issues, Techniques, and Future Challenges within the Journal-First Papers-track
- Author of Code Reviewer Recommendation in Tencent: Practice, Challenge, and Direction within the SEIP - Software Engineering in Practice-track
- Author of Why My Code Summarization Approach Does Not Work: Improving Code Summarization with Comment Category Prediction within the Journal-First Papers-track
- Author of An Exploratory Study on the Repeatedly Shared External Links on Stack Overflow within the Journal-First Papers-track
- Author of A Universal Data Augmentation Approach for Fault Localization within the Technical Track-track
- Author of Post2Vec: Learning Distributed Representations of Stack Overflow Posts within the Journal-First Papers-track
Mining Software Repositories
International Conference on Technical Debt
International Conference on Program Comprehension
2021
ASE
- Committee Member in Program Committee within the Research Papers-track
- Author of Automating User Notice Generation for Smart Contract Functions within the Research Papers-track
- Committee Member in Program Committee within the Student Research Competition-track
- Author of EditSum: A Retrieve-and-Edit Framework for Source Code Summarization within the Research Papers-track
- Author of Automating Developer Chat Mining within the Research Papers-track
- Author of Detecting Adversarial Samples with Graph-Guided Testing within the Late Breaking Results-track
- Author of Reducing Bug Triaging Confusion by Learning from Mistakes with a Bug Tossing Knowledge Graph within the Research Papers-track
- Workshops Co-Chair in Workshops Chairs within the Workshops-track
- Workshops Co-Chair in Organizing Committee
- Author of Finding A Needle in a Haystack: Automated Mining of Silent Vulnerability Fixes within the Research Papers-track
ESEC/FSE
- Author of Code2Que: A Tool for Improving Question Titles from Mined Code Snippets in Stack Overflow within the Demonstrations-track
- Committee Member in Program Committee within the Doctoral Symposium-track
- Author of Generating Question Titles for Stack Overflow from Mined Code Snippets within the Journal First-track
- Author of Automating the Removal of Obsolete TODO Comments within the Research Papers-track
- Committee Member in Program Committee within the Ideas, Visions and Reflections-track
- Committee Member in Program Committee within the Research Papers-track
- Author of Checking Smart Contracts with Structural Code Embedding within the Journal First-track
- Author of Embedding App-Library Graph for Neural Third Party Library Recommendation within the Research Papers-track
- Author of KGAMD: An API-Misuse Detector Driven by Fine-Grained API-Constraint Knowledge Graph within the Demonstrations-track
- Author of Just-In-Time Defect Identification and Localization: A Two-Phase Framework within the Journal First-track
- Author of Characterizing Search Activities on Stack Overflow within the Research Papers-track
ICSE
- Author of UIS-Hunter: Detecting UI Design Smells in Android Apps within the DEMO - Demonstrations-track
- Author of Smart Contract Security: a Practitioners’ Perspective within the Technical Track-track
- Author of What Makes a Popular Academic AI Repository? within the Journal-First Papers-track
- Author of Revisiting Supervised and Unsupervised Methods for Effort-Aware Cross-Project Defect Prediction within the Journal-First Papers-track
- Author of Defining Smart Contract Defects on Ethereum within the Journal-First Papers-track
- Author of Chatbot4QR: Interactive Query Refinement for Technical Question Retrieval within the Journal-First Papers-track
- Author of A First Look at Human Values-Violation in App Reviews within the SEIS - Software Engineering in Society-track
- Author of An Exploratory Study on the Introduction and Removal of Different Types of Technical Debt within the Journal-First Papers-track
- Author of Modular Tree Network for Source Code Representation Learning within the Journal-First Papers-track
- Author of Smart Contract Security: a Practitioners' Perspective within the AE - Artifact Evaluation-track
- Author of A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms within the Technical Track-track
- Author of psc2code: Denoising Code Extraction from Programming Screencasts within the Journal-First Papers-track
- Author of An Empirical Study of the Landscape of Open Source Projects in Baidu, Alibaba, and Tencent within the SEIP - Software Engineering in Practice-track
- Author of Wireframe-based UI Design Search through Image Autoencoder within the Journal-First Papers-track
- Author of Don't Do That! Hunting Down Visual Design Smells in Complex UIs against Design Guidelines within the Technical Track-track
- Committee Member in Program Committee within the Technical Track-track
- Author of Automatic Solution Summarization for Crash Bugs within the Technical Track-track
- Author of Unveiling the Mystery of API Evolution in Deep Learning Frameworks -- A Case Study of Tensorflow 2 within the SEIP - Software Engineering in Practice-track
- Author of Technical Q&A Site Answer Recommendation via Question Boosting within the Journal-First Papers-track
International Conference on Program Comprehension
2020
ESEC/FSE
- Panelist of New Future of Work within the Plenary events-track
- Committee Member in Program Committee within the Research Papers-track
- Committee Member in Program Committee within the Visions and Reflections -track
- Author of Enhancing Developer Interactions with Programming Screencasts through Accurate Code Extraction within the Tool Demos-track
- Author of DeepCommenter: A Deep Code Comment Generation Tool with Hybrid Lexical and Syntactical Information within the Tool Demos-track
- Author of Effort-Aware Just-in-Time Defect Identification in Practice: A Case Study at Alibaba within the Industry Papers-track
- Author of JITO: A Tool for Just-in-Time Defect Identification and Localization within the Tool Demos-track
ASE
- Session Chair of Maintenance and Evolution (4) (part of Research Papers)
- Author of API-Misuse Detection Driven by Fine-Grained API-Constraint Knowledge Graph within the Research Papers-track
- Journal-first Papers Chair in Organizing Committee
- Author of Retrieve and Refine: Exemplar-based Neural Comment Generation within the Research Papers-track
- Journal-first Papers Chair in Journal-first Papers Chair within the Journal-first Papers-track
- Author of Automating Just-In-Time Comment Updating within the Research Papers-track
- Author of Predicting Code Context Models for Software Development Tasks within the Research Papers-track
- Committee Member in Program Committee within the Research Papers-track
- Session Chair of What Makes a Good ASE Paper? (part of Social/Networking )
ICSE
- Author of Characterizing and Identifying Reverted Commits within the Journal First-track
- Author of What do Programmers Discuss about Deep Learning Frameworks within the Journal First-track
- Author of A Survey on Adaptive Random Testing within the Journal First-track
- Author of Neural Network Based Classification of Self-admitted Technical Debt: From Performance to Explainability and Deployability within the Journal First-track
- Author of Which Variables Should I Log? within the Journal First-track
- Author of The Impact of Mislabeled Changes by SZZ on Just-in-Time Defect Prediction within the Journal First-track
- Author of Is Using Deep Learning Frameworks Free? Characterizing Technical Debt in Deep Learning Frameworks within the Software Engineering in Society-track
- Author of Smart Contract Development: Challenges and Opportunities within the Journal First-track
- Author of What Do Programmers Discuss about Blockchain? A Case Study on the Use of Balanced LDA and the Reference Architecture of a Domain to Capture Online Discussions about Blockchain Platforms across Stack Exchange Communities within the Journal First-track
- Author of Demystify Official API Usage Directives with Crowdsourced API Misuse Scenarios, Erroneous Code Examples and Patches within the Technical Papers-track
- Programme Committee in Program Committee within the Technical Papers-track
- Author of How does Machine Learning Change Software Development Practices? within the Journal First-track
- Author of A Large Scale Study of Long-Time Contributor Prediction for GitHub Projects within the Journal First-track
- Author of Locating Latent Design Information in Developer Discussions: A Study on Pull Requests within the Journal First-track
International Conference on Program Comprehension
- Author of A Self-Attentional Neural Architecture for Code Completion with Multi-Task Learning within the Research-track
- Committee Member in Program Committee within the Research-track
- Author of Improving Code Search with Co-Attentive Representation Learning within the Research-track
- Author of Duplicate Bug Report Detection Using Dual-Channel Convolutional Neural Networks within the Research-track
MOBILESoft
Mining Software Repositories
2019
ASE
- Session Chair of Prediction (part of Papers)
- Author of Discovering, Explaining and Summarizing Controversial Discussions in Community Q&A Sites within the Research Papers-track
- Author of Automating App Review Response Generation within the Research Papers-track
- Author of Automatic, highly accurate app permission recommendation within the Journal First Presentations-track
- Committee Member in Program Committee within the Research Papers-track
- Author of Automatic Generation of Pull Request Descriptions within the Research Papers-track
Celebration of ASE
ICSE
- Author of On Reliability of Patch Correctness Assessment within the Technical Track-track
- Author of Practitioners' Views on Good Software Testing Practices within the Software Engineering in Practice-track
- Author of Chaff from the Wheat: Characterizing and Determining Valid Bug Reports within the Journal-First Papers-track
- Author of Perceptions, Expectations, and Challenges in Defect Prediction within the Journal-First Papers-track
- Author of Automating Intention Mining within the Journal-First Papers-track
- Session Chair of Crowdsourced Knowledge and Feedback (part of Papers)
- Author of ActionNet: Vision-based Workflow Action Recognition From Programming Screencasts within the Technical Track-track
- Committee Member in Program Committee within the Technical Track-track
- Author of How Practitioners Perceive Coding Proficiency within the Technical Track-track
- Author of How Practitioners Perceive Automated Bug Report Management Techniques within the Journal-First Papers-track
Mining Software Repositories
2018
ESEC/FSE
- Author of VT-Revolution: Interactive Programming Tutorials Made Possible within the Demonstrations-track
- Author of Early prediction of merged code changes to prioritize reviewing tasks within the Journal-First-track
- Author of Automating Change-level Self-admitted Technical Debt Determination within the Journal-First-track
- Author of VT-Revolution: Interactive Programming Video Tutorial Authoring and Watching System within the Journal-First-track
ICSE
- Author of SATD Detector: A Text-Mining-Based Self-Admitted Technical Debt Detection Tool within the DEMO - Demonstrations -track
- Author of Measuring Program Comprehension: A Large-Scale Field Study with Professionals within the Journal first papers-track
- Author of Inference of development activities from interaction with uninstrumented applications within the Journal first papers-track
Mining Software Repositories
International Conference on Program Comprehension
- Author of Deep Code Comment Generation within the Technical Research-track
- Committee Member in Program Committee within the Technical Research-track
- Author of Recommending Frequently Encountered Bugs within the Technical Research-track
- Author of What Design is Discussed by Developers? within the Technical Research-track
- Author of What Design is Discussed by Developers? within the Early Research Achievement-track