ARCLIN: Automated API Mention Resolution for Unformatted Texts (ICSE 2022 - Technical Track)

Write a Blog >>

Sun 8 - Fri 27 May 2022

Who

Yintong Huo, Yuxin Su, Hongming Zhang, Michael Lyu

Track

ICSE 2022 Technical Track

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 11 May 2022 05:10 - 05:15 at ICSE room 3 - Tools and Configurations Chair(s): Robert Chatley
Thu 12 May 2022 13:25 - 13:30 at ICSE room 2 - Tools and Environments 6 Chair(s): Domenico Bianculli
Fri 27 May 2022 09:40 - 09:45 at Room 306+307 - Papers 18: Recommender Systems, tools and environments Chair(s): Christian Bird

Abstract

Online technical forums (e.g., StackOverflow) are popular platforms for developers to discuss technical problems such as how to use specific Application Programming Interface (API), how to solve the programming tasks, or how to fix bugs in their codes. These discussions can often provide auxiliary knowledge of how to use the software that is not covered by the official documents. The automatic extraction of such knowledge will support a set of downstream tasks like API searching or indexing. However, unlike official documentation written by experts, discussions in open forums are made by regular developers who write in short and informal texts, including spelling errors or abbreviations. There are three major challenges for the accurate APIs recognition and linking mentioned APIs from unstructured natural language documents to an entry in the API repository: (1) distinguishing API mentions from common words; (2) identifying API mentions without a fully qualified name; and (3) disambiguating API mentions with similar method names but in a different library.

In this paper, to tackle these challenges, we propose an ARCLIN tool, which can effectively distinguish and link APIs without using human annotations. Specifically, we first design an API recognizer to automatically extract API mentions from natural language sentences by a Condition Random Field (CRF) on the top of a Bi-directional Long Short-Term Memory (Bi-LSTM) module, then we apply a context-aware scoring mechanism to compute the mention-entry similarity for each entry in an API repository. Compared to previous approaches with heuristic rules, our proposed tool without manual inspection outperforms by 8% in a high-quality dataset Py-mention, which contains 558 mentions and 2,830 sentences from five popular Python libraries. To our best knowledge, ARCLIN is the first approach to achieving full automation of API mention resolution from unformatted text without manually collected labels.

Link to Preprint

https://arxiv.org/abs/2201.01459

DOI

https://doi.org/10.1145/3510003.3510158

Yintong Huo

The Chinese University of Hong Kong

Yuxin Su

Sun Yat-sen University

Hongming Zhang

The Hong Kong University of Science and Technology

Michael Lyu

The Chinese University of Hong Kong

ARCLIN: Automated API Mention Resolution for Unformatted Texts

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 11 May
Displayed time zone: Eastern Time (US & Canada) change

05:00 - 06:00	Tools and ConfigurationsTechnical Track / SEIP - Software Engineering in Practice at ICSE room 3 Chair(s): Robert Chatley Imperial College London

5m Talk		Challenges in Applying Continuous Experimentation: A Practitioners’ Perspective SEIP - Software Engineering in Practice Kevin Anderson Delft University of Technology, Denise Visser bol.com, Jan-Willem Mannen ING, Yuxiang Jiang Delft University of Technology, Arie van Deursen Delft University of Technology, Netherlands DOI Pre-print
5m Talk		Reflekt: a Library for Compile-Time Reflection in Kotlin SEIP - Software Engineering in Practice Anastasiia Birillo JetBrains Research, Elena Lyulina JetBrains Research, Maria Malysheva JetBrains Research;Saint Petersburg State University, Vladislav Tankov JetBrains, ITMO University, Timofey Bryksin JetBrains Research; HSE University Pre-print Media Attached
5m Talk		ARCLIN: Automated API Mention Resolution for Unformatted Texts Technical Track Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Hongming Zhang The Hong Kong University of Science and Technology, Michael Lyu The Chinese University of Hong Kong DOI Pre-print Media Attached
5m Talk		On the Benefits and Limits of Incremental Build of Software Configurations: An Exploratory Study Technical Track Georges Aaron RANDRIANAINA Université de Rennes 1, IRISA, Xhevahire Tërnava Université de Rennes 1, INRIA/IRISA, Djamel Eddine Khelladi CNRS, France, Mathieu Acher Univ. Rennes 1, Inria, IRISA, Institut Universitaire de France (IUF) Pre-print Media Attached
5m Talk		Causality in Configurable Software Systems Technical Track Clemens Dubslaff TU Dresden, Kallistos Weis Saarland University, Christel Baier TU Dresden, Germany, Sven Apel Saarland University Pre-print Media Attached
5m Talk		A Scalable t-wise Coverage Estimator Technical Track Eduard Baranov Université Catholique de Louvain, Belgium, Sourav Chakraborty Indian Statistical Institute (ISI) , Kolkata, India, Axel Legay Université Catholique de Louvain, Belgium, Kuldeep S. Meel National University of Singapore, N. V. Vinodchandran University of Nebraska-Lincoln DOI Pre-print Media Attached

Thu 12 May
Displayed time zone: Eastern Time (US & Canada) change

13:00 - 14:00	Tools and Environments 6Technical Track / SEIP - Software Engineering in Practice / Journal-First Papers at ICSE room 2 Chair(s): Domenico Bianculli University of Luxembourg

5m Talk		Understanding and improving artifact sharing in software engineering research Journal-First Papers Christopher Steven Timperley Carnegie Mellon University, Lauren Herckis Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA Link to publication DOI Pre-print Media Attached
5m Talk		Towards Build Verifiability for Java-based Systems SEIP - Software Engineering in Practice Jiawen Xiong Huawei China, Yong Shi Huawei Technologies, Boyuan Chen York University, Filipe Cogo Centre for Software Excellence, Huawei, Canada, Zhen Ming (Jack) Jiang York University Pre-print Media Attached
5m Talk		Challenges in Applying Continuous Experimentation: A Practitioners’ Perspective SEIP - Software Engineering in Practice Kevin Anderson Delft University of Technology, Denise Visser bol.com, Jan-Willem Mannen ING, Yuxiang Jiang Delft University of Technology, Arie van Deursen Delft University of Technology, Netherlands DOI Pre-print
5m Talk		When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward SEIP - Software Engineering in Practice Norman Song , Deyun Lyu Kyushu university, Zhenya Zhang Nanyang Technological University, Zhijie Wang University of Alberta, Tianyi Zhang Purdue University, Lei Ma University of Alberta DOI Pre-print Media Attached
5m Talk		Dozer: Migrating Shell Commands to Ansible Modules via Execution Profiling and Synthesis SEIP - Software Engineering in Practice Eric Horton North Carolina State University, Chris Parnin North Carolina State University Pre-print Media Attached
5m Talk		ARCLIN: Automated API Mention Resolution for Unformatted Texts Technical Track Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Hongming Zhang The Hong Kong University of Science and Technology, Michael Lyu The Chinese University of Hong Kong DOI Pre-print Media Attached

Fri 27 May
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:30	Papers 18: Recommender Systems, tools and environmentsTechnical Track / Journal-First Papers / NIER - New Ideas and Emerging Results / SEIP - Software Engineering in Practice at Room 306+307 Chair(s): Christian Bird Microsoft Research

09:00 5m Talk		Predicting the Objective and Priority of Issue Reports in Software Repositories Journal-First Papers Maliheh Izadi Sharif University of Technology, Kiana Akbari Sharif University of technology, Abbas Heydarnoori Sharif University of Technology Link to publication DOI Pre-print Media Attached
09:05 5m Talk		Using Deep Learning to Generate Complete Log Statements Technical Track Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached
09:10 5m Talk		Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning NIER - New Ideas and Emerging Results Martin Weyssow DIRO, Université de Montréal, Houari Sahraoui Université de Montréal, Bang Liu DIRO & Mila, Université de Montréal Pre-print Media Attached
09:15 5m Talk		"Project smells" — Experiences in Analysing the Software Quality of ML Projects with mllint SEIP - Software Engineering in Practice Bart van Oort Delft University of Technology, Luís Cruz Deflt University of Technology, Babak Loni ING Bank N.V., Arie van Deursen Delft University of Technology, Netherlands Pre-print Media Attached
09:20 5m Talk		Discovering Repetitive Code Changes in Python ML Systems Technical Track Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA DOI Pre-print Media Attached
09:25 5m Talk		FlakiMe: Laboratory-Controlled Test Flakiness Impact Assessment Technical Track Maxime Cordy University of Luxembourg, Luxembourg, Renaud Rwemalika University of Luxembourg, Adriano Franci University of Luxembourg, Mike Papadakis University of Luxembourg, Luxembourg, Mark Harman University College London Pre-print Media Attached
09:30 5m Talk		Semantic Image Fuzzing of AI Perception Systems Technical Track Trey Woodlief University of Virginia, Sebastian Elbaum University of Virginia, Kevin Sullivan University of Virginia DOI Pre-print Media Attached
09:35 5m Talk		Understanding and improving artifact sharing in software engineering research Journal-First Papers Christopher Steven Timperley Carnegie Mellon University, Lauren Herckis Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA Link to publication DOI Pre-print Media Attached
09:40 5m Talk		ARCLIN: Automated API Mention Resolution for Unformatted Texts Technical Track Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Hongming Zhang The Hong Kong University of Science and Technology, Michael Lyu The Chinese University of Hong Kong DOI Pre-print Media Attached

Information for Participants

Wed 11 May 2022 05:00 - 06:00 at ICSE room 3 - Tools and Configurations Chair(s): Robert Chatley

Info for room ICSE room 3-odd hours:

Click here to go to the room on Midspace

Thu 12 May 2022 13:00 - 14:00 at ICSE room 2 - Tools and Environments 6 Chair(s): Domenico Bianculli

Info for room ICSE room 2-odd hours:

Click here to go to the room on Midspace