WebAssembly with Wasi-NN for Edge Machine Learning Inference: Experiences and Lessons LearnedIndustry Track Paper
In industrial automation, machine learning (ML) is used to analyze sensor data for predictive maintenance or process optimization. To reduce latency and protect intellectual property, it is preferred for selected inference tasks to be run on small edge clusters instead of the cloud. Ideally, such tasks should be implemented as portable, hardware-agnostic software and executed in a secure environment to handle untrusted 3rd-party software. A potential solution is to bundle the machine learning model execution in machine-neutral WebAssembly code. The novel wasi-nn API proposal enables efficient execution of ML inference tasks from within a WebAssembly sandbox using a vendor-neutral interface. In this work, we show how these technologies can be applied to provide a solution to this challenge. We analyze existing wasi-nn implementations and design a generic architecture for “isolated inference at the edge” with a prototype implemented in Rust. We find that the wasi-nn ecosystem is still immature and native libraries are often required, which impairs the desired portability. Due to the use of native libraries by wasi-nn implementations, the performance overhead of execution in WebAssembly is insignificant. Finally, we discuss alternatives, such as creating custom host APIs or compiling machine learning frameworks to WebAssembly.
Wed 17 SepDisplayed time zone: Athens change
14:00 - 15:30 | Session 2 - LLMs in Software ArchitectureResearch Papers / Industry Program at Phoenix Chair(s): Zadia Codabux University of Saskatchewan | ||
14:00 30mFull-paper | Using Incremental LLM Context for Cost Reduction in LLM-Driven IoT ApplicationsResearch Track Paper Research Papers | ||
14:30 30mFull-paper | LLMs for Software Architecture Knowledge: A Comparative Analysis among Seven LLMsResearch Track Paper Research Papers Mohamed Soliman Paderborn University, Elia Ashraf Heinz Nixdorf Institut, Paderborn University, Kamel M. K. Abdelsalam Ain Shams University, Jan Keim Karlsruhe Institute of Technology (KIT), Ashwin Prasad Shivarpatna Venkatesh Heinz Nixdorf Institut, Paderborn University | ||
15:00 30mFull-paper | WebAssembly with Wasi-NN for Edge Machine Learning Inference: Experiences and Lessons LearnedIndustry Track Paper Industry Program Joshua Bachmeier FZI Research Center for Information Technology, Vladimir Yussupov ABB Corporate Research, Jörg Henß FZI Forschungszentrum Informatik, Heiko Koziolek ABB Corporate Research | ||