Open Source Software (OSS) projects start with an initial vocabulary, often determined by the first generation of developers. This vocabulary, embedded in code identifier names and internal code comments, goes through multiple rounds of change, influenced by the interrelated patterns of human (e.g., developers joining and departing) and system (e.g., maintenance activities) interactions. Capturing the dynamics of this change is crucial for understanding and synthesizing code changes over time. However, existing code evolution analysis tools, available in modern version control systems such as GitHub and SourceForge, often overlook the linguistic aspects of code evolution. To bridge this gap, in this paper, we propose to study code evolution in OSS projects through the lens of developers’ language, also known as code lexicon. Our analysis is conducted using 32 OSS projects sampled from a broad range of application domains. Our results show that different maintenance activities impact code lexicon differently. These insights lay out a preliminary foundation for modeling the linguistic history of OSS projects. In the long run, this foundation will be utilized to provide support for basic program comprehension tasks and help researchers gain new insights into the complex interplay between linguistic change and various system and human aspects of OSS development.
Tue 14 JulDisplayed time zone: (UTC) Coordinated Universal Time change
01:30 - 02:30 | Session 4: SummalizationResearch / ERA at ICPC Chair(s): Venera Arnaoudova Washington State University | ||
01:30 15mPaper | Improved Code Summarization via a Graph Neural Network Research Alexander LeClair University Of Notre Dame, Sakib Haque University of Notre Dame, Lingfei Wu IBM Research, Collin McMillan University of Notre Dame Pre-print Media Attached | ||
01:45 15mPaper | BugSum: Deep Context Understanding for Bug Report Summarization Research Haoran Liu National University of Defense Technology, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Shanshan Li National University of Defense Technology, Yong Guo National University of Defense Technology, Deze Wang National University of Defense Technology, Xiaoguang Mao National University of Defense Technology Media Attached | ||
02:00 15mPaper | A Human Study of Comprehension and Code Summarization Research Sean Stapleton University of Michigan, Yashmeet Gambhir University of Michigan, Alexander LeClair University Of Notre Dame, Zachary Eberhart , Westley Weimer University of Michigan, USA, Kevin Leach University of Michigan, Yu Huang University of Michigan Pre-print Media Attached | ||
02:15 15mPaper | Linguistic Documentation of Software History ERA Media Attached |