LoCoML: A Framework for Real-World ML Inference Pipelines
The widespread adoption of machine learning (ML) has brought forth diverse models with varying architectures, data requirements, introducing new challenges in integrating these systems into real-world applications. Traditional solutions often struggle to manage the complexities of connecting heterogeneous models, especially when dealing with varied technical specifications. These limitations are amplified in large-scale, collaborative projects where stakeholders contribute models with different technical specifications. To address these challenges, we developed LoCoML, a low-code framework designed to simplify the integration of diverse ML models within the context of the XYZ Project - a large-scale initiative aimed at integrating AI-driven language technologies such as automatic speech recognition, machine translation, text-to-speech, and optical character recognition to support seamless communication across more than 20 languages. Initial evaluations show that LoCoML adds only a small amount of computational load, making it efficient and effective for large-scale ML integration. Our practical insights shows that a low-code approach can be a practical solution for connecting multiple ML models in a collaborative environment.
Sun 27 AprDisplayed time zone: Eastern Time (US & Canada) change
14:00 - 15:30 | Architecting and Testing AI SystemsResearch and Experience Papers at 208 Chair(s): Jan-Philipp Steghöfer XITASO GmbH IT & Software Solutions | ||
14:00 15mTalk | How Do Model Export Formats Impact the Development of ML-Enabled Systems? A Case Study on Model IntegrationDistinguished paper Award Candidate Research and Experience Papers Shreyas Kumar Parida ETH Zurich, Ilias Gerostathopoulos Vrije Universiteit Amsterdam, Justus Bogner Vrije Universiteit Amsterdam Pre-print | ||
14:15 15mTalk | RAGProbe: Breaking RAG Pipelines with Evaluation ScenariosDistinguished paper Award Candidate Research and Experience Papers Shangeetha Sivasothy Applied Artificial Intelligence Institute, Deakin University, Scott Barnett Deakin University, Australia, Stefanus Kurniawan Deakin University, Zafaryab Rasool Applied Artificial Intelligence Institute, Deakin University, Rajesh Vasa Deakin University, Australia | ||
14:30 15mTalk | On-Device or Remote? On the Energy Efficiency of Fetching LLM-Generated Content Research and Experience Papers Vince Nguyen Vrije Universiteit Amsterdam, Hieu Huynh Vrije Universiteit Amsterdam, Vidya Dhopate Vrije Universiteit Amsterdam, Anusha Annengala Vrije Universiteit Amsterdam, Hiba Bouhlal Vrije Universiteit Amsterdam, Gian Luca Scoccia Gran Sasso Science Institute, Matias Martinez Universitat Politècnica de Catalunya (UPC), Vincenzo Stoico Vrije Universiteit Amsterdam, Ivano Malavolta Vrije Universiteit Amsterdam Pre-print Media Attached | ||
14:45 10mTalk | LoCoML: A Framework for Real-World ML Inference Pipelines Research and Experience Papers Kritin Maddireddy IIIT Hyderabad, Santhosh Kotekal Methukula IIIT Hyderabad, Chandrasekar S IIIT Hyderabad, Karthik Vaidhyanathan IIIT Hyderabad | ||
14:55 10mTalk | Towards Continuous Experiment-driven MLOps Research and Experience Papers Keerthiga Rajenthiram Vrije Universiteit Amsterdam, Milad Abdullah Charles University, Ilias Gerostathopoulos Vrije Universiteit Amsterdam, Petr Hnětynka Charles University, Tomas Bures Charles University, Czech Republic, Gerard Pons Universitat Politècnica de Catalunya, Barcelona, Spain, Besim Bilalli Universitat Politècnica de Catalunya, Barcelona, Spain, Anna Queralt Universitat Politècnica de Catalunya, Barcelona, Spain | ||
15:05 25mOther | Discussion Research and Experience Papers |