Deep API Learning Revisited
When working with unfamiliar APIs, programmers frequently resort to learning resources and code examples to understand API usage sequences. Programmers often encounter obstacles finding the appropriate information due to either poor quality of API documentation or ineffective query-based searching strategy. To help solve this issue, researchers have proposed various methods to suggest the sequence of APIs given natural language queries representing the information needs from programmers. Among such efforts, Gu et al. adopts a deep learning method, in particular, an RNN Encoder-Decoder architecture, to perform this task and obtained promising results on common APIs in Java. In this work, we aim to reproduce their results and apply the same methods for APIs in Python. Additionally, we compare the performance with a more recent Transformer-based method, i.e. CodeBERT for the same task. Our experiment reveals a clear drop in performance measure when careful data cleaning is performed. Owing to the pretraining from a large number of source code files and effective encoding technique, CodeBERT outperforms the method by Gu et al, to a large extent.
Mon 16 MayDisplayed time zone: Eastern Time (US & Canada) change
20:10 - 20:50 | Session 8: Search and Reuse: Libraries & APIsResearch / Replications and Negative Results (RENE) at ICPC room Chair(s): Masud Rahman Dalhousie University | ||
20:10 7mTalk | On the Effectiveness of Pretrained Models for API Learning Research Mohammad Abdul Hadi University of British Columbia, Imam Nur Bani Yusuf Singapore Management University, Ferdian Thung Singapore Management University, Kien Luong School of Computing and Information Systems, Singapore Management University, Fatemeh Hendijani Fard University of British Columbia, Lingxiao Jiang Singapore Management University, David Lo Singapore Management University Media Attached | ||
20:17 7mTalk | Deep API Learning Revisited Replications and Negative Results (RENE) Pre-print Media Attached | ||
20:24 7mTalk | ARSeek: Identifying API Resource using Code and Discussion on Stack Overflow Research Kien Luong School of Computing and Information Systems, Singapore Management University, Mohammad Abdul Hadi University of British Columbia, Ferdian Thung Singapore Management University, Fatemeh Hendijani Fard University of British Columbia, David Lo Singapore Management University Media Attached | ||
20:31 7mTalk | Benchmarking Library Recognition in Tweets Research Ting Zhang Singapore Management University, Divya Prabha CHANDRASEKARAN Singapore Management University, Ferdian Thung Singapore Management University, David Lo Singapore Management University Pre-print Media Attached | ||
20:38 12mLive Q&A | Q&A-Paper Session 8 Research |