Write a Blog >>
ASE 2020
Mon 21 - Fri 25 September 2020 Melbourne, Australia
Tue 22 Sep 2020 16:50 - 17:00 at Koala - Maintenance and Evolution (3) Chair(s): Yongjie Zheng
Thu 24 Sep 2020 10:40 - 10:45 at Wombat - Tool Demo Showcase (3) Chair(s): Csaba Nagy

In this paper, we present Sosed, a tool for discovering similar software projects. We use fastText to compute the embeddings of sub-tokens into a dense space for 120,000 GitHub repositories in 200 languages. Then, we cluster embeddings to identify groups of semantically similar sub-tokens that reflect topics in source code. We use a dataset of 9 million GitHub projects as a reference search base. To identify similar projects, we compare the distributions of clusters among their sub-tokens. The tool receives an arbitrary project as input, extracts sub-tokens in 16 most popular programming languages, computes cluster distribution, and finds projects with the closest distribution in the search base. We labeled sub-token clusters with short descriptions to enable Sosed to produce interpretable output.

Sosed is available at https://github.com/JetBrains-Research/sosed/. The tool demo is available at https://www.youtube.com/watch?v=LYLkztCGRt8. The multi-language extractor of sub-tokens is available separately at https://github.com/JetBrains-Research/identifiers-extractor/.

Tue 22 Sep

Displayed time zone: (UTC) Coordinated Universal Time change

16:00 - 17:00
Maintenance and Evolution (3)Research Papers / Tool Demonstrations at Koala
Chair(s): Yongjie Zheng California State University San Marcos
16:00
20m
Talk
Subdomain-Based Generality-Aware Debloating
Research Papers
Qi Xin Georgia Institute of Technology, Myeongsoo Kim Georgia Institute of Technology, Qirun Zhang Georgia Institute of Technology, USA, Alessandro Orso Georgia Tech
16:20
20m
Talk
Revisiting the relationship between fault detection, test adequacy criteria, and test set size.
Research Papers
Yiqun Chen University of Washington, Rahul Gopinath CISPA Helmholtz Center for Information Security, Anita Tadakamalla George Mason University, USA, Michael D. Ernst University of Washington, USA, Reid Holmes University of British Columbia, Gordon Fraser University of Passau, Paul Ammann George Mason University, USA, René Just University of Washington, USA
16:40
10m
Talk
WASim: Understanding WebAssembly Applications through Classification
Tool Demonstrations
Alan Romano University at Buffalo, SUNY, Weihang Wang University at Buffalo, SUNY
16:50
10m
Talk
Sosed: a tool for finding similar software projects
Tool Demonstrations
Egor Bogomolov JetBrains Research, Yaroslav Golubev JetBrains Research, Artyom Lobanov JetBrains Research, Vladimir Kovalenko JetBrains Research, JetBrains N.V., Timofey Bryksin JetBrains Research, Saint Petersburg State University

Thu 24 Sep

Displayed time zone: (UTC) Coordinated Universal Time change

10:20 - 11:20
Tool Demo Showcase (3)Tool Demonstrations at Wombat
Chair(s): Csaba Nagy Software Institute - USI, Lugano, Switzerland
10:20
5m
Talk
FILO: FIx-LOcus Localization for Backward Incompatibilities Caused by Android Framework Upgrades
Tool Demonstrations
Marco Mobilio University of Milano Bicocca, Oliviero Riganelli University of Milano-Bicocca, Italy, Daniela Micucci University of Milano-Bicocca, Italy, Leonardo Mariani University of Milano Bicocca
10:25
5m
Talk
EXPRESS: An Energy-Efficient and Secure Framework for Mobile Edge Computing and Blockchain based Smart Systems
Tool Demonstrations
Jia Xu School of Computer Science and Technology, Anhui University, Xiao Liu School of Information Technology, Deakin University, Xuejun Li School of Computer Science and Technology, Anhui University, Lei Zhang Antwork Robotics Co., Ltm., Hangzhou, China, Yun Yang Swinburne University of Technology
10:30
5m
Talk
SmartBugs: A Framework to Analyze Solidity Smart Contracts
Tool Demonstrations
João F. Ferreira INESC-ID and IST, University of Lisbon, Pedro Cruz IST, University of Lisbon, Portugal, Thomas Durieux KTH Royal Institute of Technology, Sweden, Rui Abreu Faculty of Engineering, University of Porto, Portugal
DOI
10:35
5m
Talk
RepoSkillMiner: Identifying software expertise from GitHub repositories using Natural Language Processing
Tool Demonstrations
Efstratios Kourtzanidis University Of Macedonia, Alexander Chatzigeorgiou University of Macedonia, Apostolos Ampatzoglou University of Macedonia
Pre-print Media Attached File Attached
10:40
5m
Talk
Sosed: a tool for finding similar software projects
Tool Demonstrations
Egor Bogomolov JetBrains Research, Yaroslav Golubev JetBrains Research, Artyom Lobanov JetBrains Research, Vladimir Kovalenko JetBrains Research, JetBrains N.V., Timofey Bryksin JetBrains Research, Saint Petersburg State University
10:45
5m
Talk
GUI2WiRe: Rapid Wireframing with a Mined and Large-Scale GUI Repository using Natural Language Requirements
Tool Demonstrations
Kristian Kolthoff Institute for Enterprise Systems (InES), University Of Mannheim, Christian Bartelt Institute for Software and Systems Engineering, TU Clausthal, Simone Paolo Ponzetto Data and Web Science Group, University of Mannheim
10:50
30m
Live Q&A
Q&A or Discussion
Tool Demonstrations