Thu 12 May 2022 22:20 - 22:25 at ICSE room 3-even hours - Reuse and Economics Chair(s): Rick Kazman
Recently, pre-trained programming language models such as CodeBERT have demonstrated substantial gains in code search tasks. Despite their success, they rely on the availability of huge parallel data for fine-tuning code-query mappings. This restricts their practicality in domain-specific languages that have relatively scarce and costly data. In this paper, we propose CDCS, a novel approach for domain-specific code search. CDCS employs a few-shot transfer learning framework where an initial representation model is pre-trained on a large corpus of common programming languages (e.g., Java, Python) and is further adapted to domain-specific languages such as SQL and Solidity. Unlike cross-language CodeBERT which directly fine-tunes models in the target language, CDCS employs a meta-learning algorithm named MAML to learn an initialized model that can be best reused for the domain-specific language. We evaluate the proposed approach on two domain-specific languages, namely, SQL and Solidity, with model transferred from two widely used languages (Python and Java). Our results show that CDCS significantly outperforms baseline models that directly fine-tune models in the domain-specific language.
Slides (slides.pdf) | 2.40MiB |
Wed 11 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 05:00 | Reuse and AgilityJournal-First Papers / Technical Track / SEIP - Software Engineering in Practice at ICSE room 4-even hours Chair(s): Massimiliano Di Penta University of Sannio, Italy | ||
04:00 5mTalk | CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words Journal-First Papers Chao Liu Chongqing University, Xin Xia Huawei Software Engineering Application Technology Lab, David Lo Singapore Management University, Zhiwei Liu Baidu Inc., Ahmed E. Hassan Queen's University, Shanping Li Zhejiang University Link to publication DOI Pre-print Media Attached | ||
04:05 5mTalk | Präzi: From Package-based to Call-based Dependency Networks Journal-First Papers Joseph Hejderup Endor Labs & Delft University of Technology, Moritz Beller Facebook, USA, Konstantinos Triantafyllou University of Athens, Georgios Gousios Endor Labs & Delft University of Technology DOI Pre-print Media Attached | ||
04:10 5mTalk | The Agile Success Model: A Mixed-methods Study of a Large-scale Agile Transformation Journal-First Papers Daniel Russo Department of Computer Science, Aalborg University Link to publication DOI Pre-print | ||
04:15 5mTalk | Factors Affecting On-Time Delivery in Large-Scale Agile Software Development Journal-First Papers Elvan Kula Delft University of Technology, Eric Greuter ING, Arie van Deursen Delft University of Technology, Netherlands, Georgios Gousios Endor Labs & Delft University of Technology DOI Pre-print Media Attached | ||
04:20 5mTalk | Issues in the Adoption of the Scaled Agile Framework SEIP - Software Engineering in Practice Paolo Ciancarini University of Bologna / Innopolis University, Artem Kruglov Innopolis University, Witold Pedrycz University of Alberta, Dilshat Salikhov Innopolis University, Giancarlo Succi | ||
04:25 5mTalk | Cross-Domain Deep Code Search with Few-Shot Learning Technical Track Yitian Chai School of Software, Shanghai Jiao Tong University, Hongyu Zhang University of Newcastle, Beijun Shen School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Xiaodong Gu Shanghai Jiao Tong University, China DOI Pre-print Media Attached File Attached |
Thu 12 MayDisplayed time zone: Eastern Time (US & Canada) change
22:00 - 23:00 | Reuse and EconomicsNIER - New Ideas and Emerging Results / SEIP - Software Engineering in Practice / Technical Track / Journal-First Papers at ICSE room 3-even hours Chair(s): Rick Kazman University of Hawai‘i at Mānoa | ||
22:00 5mTalk | CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words Journal-First Papers Chao Liu Chongqing University, Xin Xia Huawei Software Engineering Application Technology Lab, David Lo Singapore Management University, Zhiwei Liu Baidu Inc., Ahmed E. Hassan Queen's University, Shanping Li Zhejiang University Link to publication DOI Pre-print Media Attached | ||
22:05 5mTalk | The best defense is a good defense: adapting negotiation methods for tackling pressure over software project estimates NIER - New Ideas and Emerging Results Patricia Matsubara UFAM & UFMS, Igor Steinmacher Northern Arizona University, Bruno Gadelha UFAM, Tayana Conte Universidade Federal do Amazonas Pre-print Media Attached | ||
22:10 5mTalk | The Unexplored Terrain of Compiler Warnings SEIP - Software Engineering in Practice Gunnar Kudrjavets University of Groningen, Aditya Kumar Snap, Inc., Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands DOI Pre-print Media Attached | ||
22:15 5mTalk | SnR: Constraint-Based Type Inference for Incomplete Java Code Snippets Technical Track Yiwen Dong University of Waterloo, Tianxiao Gu Tiktok Inc., Yongqiang Tian University of Waterloo, Chengnian Sun University of Waterloo Pre-print Media Attached | ||
22:20 5mTalk | Cross-Domain Deep Code Search with Few-Shot Learning Technical Track Yitian Chai School of Software, Shanghai Jiao Tong University, Hongyu Zhang University of Newcastle, Beijun Shen School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Xiaodong Gu Shanghai Jiao Tong University, China DOI Pre-print Media Attached File Attached | ||
22:25 5mTalk | GitHub Sponsors: Exploring a New Way to Contribute to Open Source Technical Track Naomichi Shimada Nara Institute of Science and Technology, Tao Xiao Nara Institute of Science and Technology, Hideaki Hata Shinshu University, Christoph Treude University of Melbourne, Kenichi Matsumoto Nara Institute of Science and Technology DOI Pre-print Media Attached |