Energy cost and machine learning accuracy impact of k-anonymisation and synthetic data techniques (ICT4S 2023 - Research Papers)

Who

Pepijn de Reus, Ana Oprescu, Koen van Elsen

Track

ICT4S 2023 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+02:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 6 Jun 2023 15:07 - 15:30 at Amphitheater - Session #2 Chair(s): Daniel Pargman

Abstract

To address increasing societal concerns regarding privacy and climate, the EU adopted the General Data Protection Regulation (GDPR) and committed to the Green Deal. Considerable research studied the energy efficiency of software and the accuracy of machine learning models trained on anonymised data sets. Recent work began exploring the impact of privacy-enhancing techniques (PET) on \textit{both} the energy consumption and accuracy of the machine learning models, focusing on $k$-anonymity. As synthetic data is becoming an increasingly popular PET, this paper analyses the energy consumption and accuracy of two phases: a) applying privacy-enhancing techniques to the concerned data set, b) training the models on the concerned privacy-enhanced data set. We use two privacy-enhancing techniques: k-anonymisation (using generalisation and suppression) and synthetic data, and three machine-learning models. Each model is trained on each privacy-enhanced data set. Our results show that models trained on k-anonymised data consume less energy than models trained on the original data, with a similar performance regarding accuracy. Models trained on synthetic data have a similar energy consumption and a similar to lower accuracy compared to models trained on the original data.

Link to Preprint

https://arxiv.org/abs/2305.07116

Pepijn de Reus

University of Amsterdam

Netherlands

Ana Oprescu

University of Amsterdam

Netherlands

Koen van Elsen

Universiteit van Amsterdam

Netherlands

Time Zone

The program is currently displayed in (GMT+02:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+02:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 6 Jun
Displayed time zone: Brussels, Copenhagen, Madrid, Paris change

14:00 - 15:30	Session #2Journal First / Research Papers at Amphitheater Chair(s): Daniel Pargman KTH Royal Institute of Technology

14:00 22m Talk		Just measure IT! – Electricity consumption measurements of electronic devices and estimates of datacenter and network services for one householdResearch Paper Research Papers Jens Malmodin Ericsson Research
14:22 22m Talk		The Impact of Green Feedback on Users' Software UsageJournal First Journal First Adel Noureddine LIUPPA, Université de Pau et des Pays de l'Adour, Noëlle Bru Université de Pau et des Pays de l'Adour, Richard Chbeir Université de Pau et des Pays de l'Adour, Martín Diéguez Lodeiro University of Angers Link to publication
14:45 22m Talk		Evolution of Kotlin Apps in terms of energy consumption: An Exploratory StudyResearch Paper Research Papers Hesham Ahmed Vrije Universiteit Amsterdam, Alina Boshchenko , Niaz Khan Vrije Universiteit Amsterdam, Dmitriy Knyajev Vrije Universiteit Amsterdam, Dinara Garifollina Vrije Universiteit Amsterdam, Gian Luca Scoccia University of L'Aquila, Matias Martinez Universitat Politècnica de Catalunya (UPC), Ivano Malavolta Vrije Universiteit Amsterdam
15:07 22m Talk		Energy cost and machine learning accuracy impact of k-anonymisation and synthetic data techniquesResearch Paper Research Papers Pepijn de Reus University of Amsterdam, Ana Oprescu University of Amsterdam, Koen van Elsen Universiteit van Amsterdam Pre-print