Benchmarking Library Recognition in Tweets (ICPC 2022 - Research)

Who

Ting Zhang, Divya Prabha CHANDRASEKARAN, Ferdian Thung, David Lo

Track

ICPC 2022 Research

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 16 May 2022 20:31 - 20:38 at ICPC room - Session 8: Search and Reuse: Libraries & APIs Chair(s): Masud Rahman

Abstract

Software developers often use social media (such as Twitter) to share programming knowledge such as new tools, sample code snippets, and tips on programming. One of the topics they talk about is the software library. The tweets may contain useful information about a library. A good understanding of this information, e.g., on the developer’s views regarding a library can be beneficial to weigh the pros and cons of using the library as well as the general sentiments towards the library. However, it is not trivial to recognize a library sense of a word from its normal senses. For example, a tweet mentioning the word pandas may refer to the Python pandas library or to the animal. In this work, we created the first benchmark dataset and investigated the task to distinguish whether a tweet actually refers to a programming library or something else. Recently, the pre-trained Transformer model (PTM) has achieved great success in the fields of natural language processing and computer vision. Therefore, we extensively evaluated a broad set of modern PTMs, including both general-purpose and domain-specific ones, to solve this programming library recognition task in tweets. Experimental results show that the use of PTM can outperform the best-performing baseline methods by up to 5% - 43% in terms of F1-score on within-, cross-, and mixed-library settings.

Link to Preprint

https://happygirlzt.com/files/icpc22.pdf

Ting Zhang

Singapore Management University

Singapore

Divya Prabha CHANDRASEKARAN

Singapore Management University

Ferdian Thung

Singapore Management University

David Lo

Singapore Management University

Singapore

Media

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 16 May
Displayed time zone: Eastern Time (US & Canada) change

20:10 - 20:50	Session 8: Search and Reuse: Libraries & APIsResearch / Replications and Negative Results (RENE) at ICPC room Chair(s): Masud Rahman Dalhousie University

20:10 7m Talk		On the Effectiveness of Pretrained Models for API Learning Research Mohammad Abdul Hadi University of British Columbia, Imam Nur Bani Yusuf Singapore Management University, Ferdian Thung Singapore Management University, Kien Luong School of Computing and Information Systems, Singapore Management University, Fatemeh Hendijani Fard University of British Columbia, Lingxiao Jiang Singapore Management University, David Lo Singapore Management University Media Attached
20:17 7m Talk		Deep API Learning Revisited Replications and Negative Results (RENE) James Martin McGill University, Jin L.C. Guo McGill University Pre-print Media Attached
20:24 7m Talk		ARSeek: Identifying API Resource using Code and Discussion on Stack Overflow Research Kien Luong School of Computing and Information Systems, Singapore Management University, Mohammad Abdul Hadi University of British Columbia, Ferdian Thung Singapore Management University, Fatemeh Hendijani Fard University of British Columbia, David Lo Singapore Management University Media Attached
20:31 7m Talk		Benchmarking Library Recognition in Tweets Research Ting Zhang Singapore Management University, Divya Prabha CHANDRASEKARAN Singapore Management University, Ferdian Thung Singapore Management University, David Lo Singapore Management University Pre-print Media Attached
20:38 12m Live Q&A		Q&A-Paper Session 8 Research

Information for Participants

Mon 16 May 2022 20:10 - 20:50 at ICPC room - Session 8: Search and Reuse: Libraries & APIs Chair(s): Masud Rahman

Info for room ICPC room:

Click here to go to the room on Midspace