SCALAR: A Part-of-speech Tagger for Identifiers
The paper presents the Source Code Analysis and Lexical Annotation Runtime (SCALAR), a tool specialized for mapping (annotating) source code identifier names to their corresponding part-of-speech tag sequence (grammar pattern). SCALAR’s internal model is trained using scikit-learn’s GradientBoostingClassifier in conjunction with a manually-curated oracle of identifier names and their grammar patterns. This specializes the tagger to recognize the unique structure of the natural language used by developers to create all types of identifiers (e.g., function names, variable names etc.). SCALAR’s output is compared with a previous version of the tagger, as well as a modern off-the-shelf part-of-speech tagger to show how it improves upon other taggers’ output for annotating identifiers. The code is available on Github.
Mon 28 AprDisplayed time zone: Eastern Time (US & Canada) change
| 11:00 - 12:30 | Empirical Findings, Future Visions, Recommendations Replications and Negative Results (RENE) / Early Research Achievements (ERA) / Tool Demonstration / Research Track at 205 Chair(s): Mark Hills Appalachian State University, Coen De Roover Vrije Universiteit Brussel, Gema Rodríguez-Pérez Department of Computer Science, Mathematics, Physics and Statistics, University of British Columbia, Okanagan Campus | ||
| 11:0010m Talk | Terminal Lucidity: Envisioning the Future of the Terminal Research Track Michael MacInnis Carleton University, Olga Baysal Carleton University, Michele Lanza Software Institute - USI, LuganoPre-print | ||
| 11:106m Talk | Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists Early Research Achievements (ERA) Alyssia Chen University of Hawaii at Manoa, Carol Wong University of Hawaii at Manoa, Bonita Sharif University of Nebraska-Lincoln, USA, Anthony Peruma University of Hawai‘i at MānoaPre-print | ||
| 11:1610m Talk | Method Names in Jupyter Notebooks: An Exploratory Study Research Track Carol Wong University of Hawaii at Manoa, Gunnar Larsen University of Hawaii at Manoa, Rocky Huang University of Hawaii at Manoa, Bonita Sharif University of Nebraska-Lincoln, USA, Anthony Peruma University of Hawai‘i at Mānoa | ||
| 11:266m Talk | SCALAR: A Part-of-speech Tagger for Identifiers Tool Demonstration Christian Newman , Brandon Scholten Kent State University, Sophia Testa Kent State University, Joshua Behler Kent State University, Syreen Banabilah Kent State University, Michael L. Collard The University of Akron, Michael J. Decker Bowling Green State University, Mohamed Wiem Mkaouer University of Michigan - Flint, Marcos Zampieri George mason University, Eman Abdullah AlOmar Stevens Institute of Technology, USA, Reem Alsuhaibani Prince Sultan University, Anthony Peruma University of Hawai‘i at Mānoa, Jonathan I. Maletic Kent State University | ||
| 11:326m Talk | How do Papers Make into Machine Learning Frameworks: A Preliminary Study on TensorFlow Early Research Achievements (ERA) Federica Pepe University of Sannio, Claudia Farkas York University, Maleknaz Nayebi York University, Giulio Antoniol Ecole Polytechnique de Montreal, Massimiliano Di Penta University of Sannio, Italy | ||
| 11:384m Talk | Toward Neurosymbolic Program Comprehension Early Research Achievements (ERA) Alejandro Velasco William & Mary, Aya Garryyeva William and Mary, David Nader Palacio William & Mary, Antonio Mastropaolo William and Mary, USA, Denys Poshyvanyk William & MaryPre-print | ||
| 11:4210m Talk | Combining Static Analysis Techniques for Program Comprehension Using Slicito Tool DemonstrationPre-print File Attached | ||
| 11:526m Talk | Mining Code Change Patterns in Ada Projects Replications and Negative Results (RENE) | ||
| 11:586m Talk | Telling Software Evolution Stories With Sonification Early Research Achievements (ERA) | ||
| 12:0410m Talk | Attributed Multiplex Learning for Analogical Third-Party Library Recommendation and Retrieval Research Track Baihui Sang State Key Laboratory for Novel Software Technology, Nanjing University, Liang Wang Nanjing University, Jierui Zhang Nanjing University, Xianping Tao Nanjing University | ||
| 12:146m Talk | LLM2FedLLM - A Tool for Simulating Federated LLMs for Software Engineering Tasks Tool Demonstration Jahnavi Kumar Indian Institute of Technology Tirupati, India, Siddhartha Gandu Indian Institute of Technology Tirupati, Sridhar Chimalakonda Indian Institute of Technology Tirupati | ||
| 12:2010m Live Q&A | Session's Discussion: "Empirical Findings, Future Visions, Recommendations" Research Track | ||




