Augmenting and Structuring User Queries to Support Efficient Free-Form Code Search
Source code terms such as method names and variable types are often different from conceptual words mentioned in a search query. This vocabulary mismatch problem can make code search inefficient. In this paper, we present COde voCABUlary (COCABU), an approach to resolving the vocabulary mismatch problem when dealing with free-form code search queries. Our approach leverages common developer questions and the associated expert answers to augment user queries with the relevant, but missing, structural code entities in order to improve the performance of matching relevant code examples within large code repositories. To instantiate this approach, we build GITSEARCH, a code search engine, on top of GitHub and Stack Overflow Q&A data. We evaluate GITSEARCH in several dimensions to demonstrate that (1) its code search results are correct with respect to user-accepted answers; (2) the results are qualitatively better than those of existing Internet-scale code search engines; (3) our engine is competitive against web search engines, such as Google, in helping users solve programming tasks; and (4) GITSEARCH provides code examples that are acceptable or interesting to the community as answers for Stack Overflow questions.
Fri 1 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
| 11:00 - 12:30 | Code Search, Synthesis, PerformanceTechnical Papers  / Journal first papers at H1 room Chair(s): Shahar Maoz Tel Aviv University | ||
| 11:0020m Research paper | Deep Code Search Technical Papers  Xiaodong Gu The Hong Kong University of Science and Technology, Hongyu Zhang The University of Newcastle, Sunghun Kim Hong Kong University of Science and TechnologyPre-print | ||
| 11:2020m Talk | Augmenting and Structuring User Queries to Support Efficient Free-Form Code Search Journal first papers Raphaël Sirres , Tegawendé F. Bissyandé University of Luxembourg, Luxembourg, Dongsun Kim University of Luxembourg, David Lo Singapore Management University, Jacques Klein University of Luxembourg, SnT, Kisub Kim University of Luxembourg, SnT, Yves Le Traon University of LuxembourgLink to publication DOI Pre-print Media Attached | ||
| 11:4020m Talk | FaCoY – A Code-to-Code Search Engine Technical Papers  Kisub Kim University of Luxembourg, SnT, Dongsun Kim University of Luxembourg, Tegawendé F. Bissyandé University of Luxembourg, Luxembourg, Eunjong Choi Nara Institute of Science and Technology, Li Li University of Luxembourg, Luxembourg, Jacques Klein University of Luxembourg, SnT, Yves Le Traon University of LuxembourgLink to publication DOI Pre-print Media Attached | ||
| 12:0020m Talk | Generalized Data Structure Synthesis Technical Papers  Calvin Loncaric University of Washington, Michael D. Ernst University of Washington, USA, Emina Torlak University of WashingtonDOI Pre-print Media Attached | ||
| 12:2010m Talk | Q&A in groups Technical Papers  | ||





