Exploring the Performance of ML Model Size for Classification in Relation to Energy Consumption
The use of large language models (LLMs) is being explored for a multitude of tasks in software engineering (SE), ranging from code generation to bug report assignment. Although LLMs provide impressive results, they require more time and energy than some other machine learning models. For some tasks, simpler models may be more sustainable than LLMs. In this paper, we construct natural language classifiers of different complexity for a use case in the SE domain: commit message classification. We compare the performance of each model with the state-of-the-art with regard to energy consumption for training and inference. We find that simpler models based on Naïve Bayes and LSTM perform similarly to LLMs, while using a fraction of the energy, suggesting that choosing a small model can lead to significant reduction in power usage without compromising performance.
Tue 2 DecDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
11:30 - 13:00 | Artificial Intelligence and Analytics in Software EngineeringIndustry Papers / Research Papers / Short Papers and Posters at Sala degli Affreschi (Fresco Room) Chair(s): Antonio Martini University of Oslo, Norway | ||
11:30 15mTalk | Generating Business Process Models with Open Source Large Language Models using Instruction Tuning Research Papers Gökberk Çelikmasat Boğaziçi University, Atay Özgövde Boğaziçi University, Fatma Başak Aydemir Utrecht University | ||
11:45 15mTalk | Application of Large Language Models in Product Management: A Systematic Literature Review Research Papers Vitor Mori Eindhoven University of Technology (TU/e), Jan Bosch Chalmers University of Technology, Helena Holmström Olsson Malmö University | ||
12:00 15mTalk | Towards Understanding Team Congestion in Large-Scale Software Development Research Papers Javier Gonzalez-Huerta Blekinge Institute of Technology, Ehsan Zabardast Nordea / Blekinge Institute of Technology | ||
12:15 10mTalk | A Small Dataset May Go a Long Way: Process Duration Prediction in Clinical Settings Industry Papers | ||
12:25 7mTalk | Prompts as Software Engineering Artifacts: A Research Agenda and Preliminary Findings Short Papers and Posters Hugo Villamizar fortiss GmbH, Jannik Fischbach Netlight Consulting GmbH and fortiss GmbH, Alexander Korn University of Duisburg-Essen, Andreas Vogelsang paluno – The Ruhr Institute for Software Technology, University of Duisburg-Essen, Daniel Mendez Blekinge Institute of Technology and fortiss | ||
12:32 7mTalk | MAPS-AI – A Tool for AI-Assisted Model-Driven Generation of IT Project Plan and Scope Short Papers and Posters Oksana Nikiforova Riga Technical University, Rihards Bobkovs Riga Technical University, Megija Krista Miļūne Riga Technical University, Kristaps Babris Riga Technical University, Oscar Pastor Universitat Politecnica de Valencia, Jānis Grabis Riga Technical University | ||
12:39 7mTalk | Cost of Artificial Intelligence in Finnish Software Companies: A Survey Short Papers and Posters Antti Klemetti University of Helsinki, Anssi Sorvisto University of Jyväskylä, Mikko Raatikainen University of Helsinki, Jukka K. Nurminen University of Helsinki | ||
12:46 7mTalk | Exploring the Performance of ML Model Size for Classification in Relation to Energy Consumption Short Papers and Posters Andreas Bexell Ericsson, Lo Heander Lund University, Emma Söderberg Lund University, Sigrid Eldh Ericsson AB, Mälardalen University, Carleton University, Per Runeson Lund University | ||
12:53 7mTalk | On the Use of Agentic Coding Manifests: An Empirical Study of Claude Code Short Papers and Posters Worawalan Chatlatanagulchai Kasetsart University, Kundjanasith Thonglek Kasetsart University, Brittany Reid Nara Institute of Science and Technology, Yutaro Kashiwa Nara Institute of Science and Technology, Pattara Leelaprute Kasetsart University, Arnon Rungsawang Kasetsart University, Bundit Manaskasemsak Kasetsart University, Hajimu Iida Nara Institute of Science and Technology | ||