Write a Blog >>
MSR 2022
Mon 23 - Tue 24 May 2022
co-located with ICSE 2022

System call traces are an invaluable source of information about a program’s runtime behavior, and have been shown to be particularly useful for malware detection in Android apps. However, the paucity of publicly available high quality dataset hinders the development of the field. In this paper, we introduce TwinDroid, a dataset of over 1000 system calls traces, from both benign and infected Android apps. A large part of the dataset is composed of traces from pairs benign and infected apps, identical apart from the inclusion of malware in the latter. This makes TwinDroid an ideal basis for security research, and an earlier version of TwinDroid has already been used for this purpose. In addition to a dataset of traces, TwinDroid includes a fully automated traces generation pipeline, which allows users to seamlessly generate new traces in a standardized manner. This pipeline will allow the dataset to remain up-to-date and relevant despite the rapid pace of change that characterizes Android security.

Thu 19 May

Displayed time zone: Eastern Time (US & Canada) change

20:00 - 20:50
Session 12: Integration & Large-Scale MiningTechnical Papers / Data and Tool Showcase Track at MSR Main room - even hours
Chair(s): Jin L.C. Guo McGill University, Amjed Tahir Massey University
20:00
4m
Short-paper
Is Open Source Eating the World’s Software? Measuring the Proportion of Open Source in proprietary software using Java Binaries
Technical Papers
Julius Musseau Mergebase, John Speed Meyers Chainguard, George P. Sieniawski IQT Labs, C. Albert Thompson Ford Motor Company, Daniel M. German University of Victoria
20:04
7m
Talk
Mining Code Review Data to Understand Waiting Times Between Acceptance and Merging: An Empirical Analysis
Technical Papers
Gunnar Kudrjavets University of Groningen, Aditya Kumar Snap, Inc., Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands
DOI Pre-print
20:11
7m
Talk
Methods for Stabilizing Models across Large Samples of Projects(with case studies on Predicting Defect and Project Health)
Technical Papers
Suvodeep Majumder North Carolina State University, Tianpei Xia North Carolina State University, Rahul Krishna North Carolina State University, Tim Menzies North Carolina State University
Pre-print Media Attached
20:18
7m
Talk
Do Small Code Changes Merge Faster? A Multi-Language Empirical Investigation
Technical Papers
Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands
DOI Pre-print
20:25
7m
Talk
FaST: A linear time stack trace alignment heuristic for crash report deduplication
Technical Papers
Irving Muller Rodrigues Polytechnique Montreal, Montreal, Canada, Daniel Aloise Polytechnique Montreal, Eraldo Rezende Fernandes Leuphana University of Lüneburg
DOI Pre-print
20:32
4m
Talk
TwinDroid: A Dataset of Android app System call traces and Trace Generation Pipeline
Data and Tool Showcase Track
Asma Razgallah Université du Québec à Chicoutimi, Canada, Raphael Khoury Université du Québec à Chicoutimi, Canada, Jean-Baptiste Poulet Université du Québec à Chicoutimi, Canada
20:36
14m
Live Q&A
Discussions and Q&A
Technical Papers


Information for Participants
Thu 19 May 2022 20:00 - 20:50 at MSR Main room - even hours - Session 12: Integration & Large-Scale Mining Chair(s): Jin L.C. Guo, Amjed Tahir
Info for room MSR Main room - even hours:

Click here to go to the room on Midspace