ICSE 2025 (series) /  Tutorials and Technical Briefings / AIware: Balancing Cost and Quality in FMware 
AIware: Balancing Cost and Quality in FMware SE for AI
Sat 3 May 2025 15:30 - 16:00 at FSS2005 - AIware Bootcamp Sat 15:30
This session will provide an in-depth exploration of Foundation Model (FM) routing for FMware, focusing on balancing quality and inference cost. Covered topics include:
- An introduction to FM routing, where requests are routed to FMs of varying sizes and capabilities
- A survey of existing routing methods that rely on data-driven learning to make optimal routing decisions
- The challenges posed by existing approaches, such as reliance on curated data, complex computations, and the evolution of weaker FMs
- The introduction of Real-time Adaptive Routing (RAR), a novel approach that continuously adapts FM routing decisions using guided in-context learning
- How RAR reduces dependence on stronger, more expensive FMs while maintaining high response quality
- The intra-domain generalization benefits of RAR’s guided learning approach in enhancing weaker FMs
Sat 3 MayDisplayed time zone: Eastern Time (US & Canada) change
Sat 3 May
Displayed time zone: Eastern Time (US & Canada) change
| 15:30 - 16:00 | |||
| 15:3030m Talk | AIware: Balancing Cost and Quality in FMware SE for AI Tutorials and Technical Briefings Kirill Vasilevski Huawei Canada | ||