ICSE 2025 (series) / Tutorials and Technical Briefings /
AIware: Balancing Cost and Quality in FMware
SE for AI
This program is tentative and subject to change.
Sat 3 May 2025 15:30 - 16:00 at FSS2005 - AIware Bootcamp Sat 15:30
This session will provide an in-depth exploration of Foundation Model (FM) routing for FMware, focusing on balancing quality and inference cost. Covered topics include:
- An introduction to FM routing, where requests are routed to FMs of varying sizes and capabilities
- A survey of existing routing methods that rely on data-driven learning to make optimal routing decisions
- The challenges posed by existing approaches, such as reliance on curated data, complex computations, and the evolution of weaker FMs
- The introduction of Real-time Adaptive Routing (RAR), a novel approach that continuously adapts FM routing decisions using guided in-context learning
- How RAR reduces dependence on stronger, more expensive FMs while maintaining high response quality
- The intra-domain generalization benefits of RAR’s guided learning approach in enhancing weaker FMs
This program is tentative and subject to change.
Sat 3 MayDisplayed time zone: Eastern Time (US & Canada) change
Sat 3 May
Displayed time zone: Eastern Time (US & Canada) change
15:30 - 16:00 | |||
15:30 30mTalk | AIware: Balancing Cost and Quality in FMware SE for AI Tutorials and Technical Briefings Kirill Vasilevski Huawei Canada |