Case Study: Neural Network Malware Detection Verification for Feature and Image Datasets (FormaliSE 2024)

Who

Preston K. Robinette, Diego Manzanas Lopez, Serena Serbinowska, Kevin Leach, Taylor T Johnson

Track

FormaliSE 2024

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 15 Apr 2024 14:00 - 14:30 at Eugénio de Andrade - Machine learning and formal methods Chair(s): Stefania Gnesi

Abstract

Malware, or software designed with harmful intent, is an ever-evolving threat that can have drastic effects on both individuals and institutions. Neural network malware classification systems are key tools for combating these threats but are vulnerable to adversarial machine learning attacks. These attacks perturb input data to cause misclassification, bypassing these protective systems. Existing defenses often rely on enhancing the training process, thereby increasing the model’s robustness to these perturbations, which is quantified using verification. While training improvements are necessary, we propose focusing on the verification process used to evaluate improvements to training. As such, we present a case study that evaluates a novel verification domain that will help to ensure tangible safeguards against adversaries and provide a more reliable means of evaluating the robustness and effectiveness of anti-malware systems. To do so, we describe malware classification and two types of common malware datasets (feature and image datasets), demonstrate the certified robustness accuracy of malware classifiers using the Neural Network Verification (NNV) and Neural Network Enumeration (nnenum) tools, and outline the challenges and future considerations necessary for the improvement and refinement of the verification of malware classification. By evaluating this novel domain as a case study, we hope to increase its visibility, encourage further research and scrutiny, and ultimately enhance the resilience of digital systems against malicious attacks.

Link to Preprint

https://arxiv.org/abs/2404.05703

Preston K. Robinette

Vanderbilt University

Diego Manzanas Lopez

Vanderbilt University

Serena Serbinowska

Vanderbilt University

Kevin Leach

Vanderbilt University

United States

Taylor T Johnson

Vanderbilt University

United States

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 15 Apr
Displayed time zone: Lisbon change

14:00 - 15:30	Machine learning and formal methodsFormaliSE 2024 at Eugénio de Andrade Chair(s): Stefania Gnesi Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo"

14:00 30m Talk		Case Study: Neural Network Malware Detection Verification for Feature and Image Datasets FormaliSE 2024 Preston K. Robinette Vanderbilt University, Diego Manzanas Lopez Vanderbilt University, Serena Serbinowska Vanderbilt University, Kevin Leach Vanderbilt University, Taylor T Johnson Vanderbilt University Pre-print
14:30 30m Talk		Leveraging Large Language Models to Boost Dafny’s Developers Productivity FormaliSE 2024 Álvaro F. Silva Independent Researcher, Alexandra Mendes University of Porto and HASLab, INESC TEC, João F. Ferreira INESC-ID and IST, University of Lisbon
15:00 10m Day closing		Closing FormaliSE 2024