WOLFFI: A fault injection platform for learning AIOps models
In today’s IT environment with a growing number of costly outages, increasing complexity of the systems, and availability of massive operational data, there is a strengthening demand to effectively leverage Artificial Intelligence and Machine Learning (AI/ML) towards enhanced resiliency.
In this paper, we present an automatic fault injection platform to enable and optimize the generation of data needed for building AI/ML models to support modern IT operations. The merits of our platform include the ease of use, the possibility to orchestrate complex fault scenarios and to optimize the data generation for the modeling task at hand. Specifically, we designed a fault injection service that (i) combines fault injection with data collection in a unified framework, (ii) supports hybrid and multi-cloud environments, and (iii) does not require programming skills for its use. Our current implementation covers the most common fault types both at the application and infrastructure levels. The platform also includes some AI capabilities. In particular, we demonstrate the interventional causal learning capability currently available in our platform. We show how our system is able to learn a model of error propagation in a micro-service application in a cloud environment (when the communication graph among micro-services is unknown and only logs are available) for use in subsequent applications such as fault localization.
Thu 13 OctDisplayed time zone: Eastern Time (US & Canada) change
13:30 - 15:30 | Technical Session 26 - Testing IIIResearch Papers / Industry Showcase at Banquet B Chair(s): Owolabi Legunsen Cornell University | ||
13:30 20mResearch paper | PredART: Towards Automatic Oracle Prediction of Object Placements in Augmented Reality Testing Research Papers Tahmid Rafi University of Texas at San Antonio, Xueling Zhang Rochester Institute of Technology, Xiaoyin Wang University of Texas at San Antonio | ||
13:50 20mResearch paper | Neuroevolution-Based Generation of Tests and Oracles for Games Research Papers Pre-print | ||
14:10 20mIndustry talk | WOLFFI: A fault injection platform for learning AIOps models Industry Showcase Frank Bagehorn IBM Research, Jesus Rios IBM Research, Saurabh Jha IBM Research, Robert Filepp IBM Research, Larisa Shwartz IBM T.J. Watson Research, Naoki Abe IBM, Xi Yang IBM Research | ||
14:30 20mResearch paper | Learning to Construct Better Mutation FaultsVirtualACM SIGSOFT Distinguished Paper Award Research Papers Zhao Tian Tianjin University, Junjie Chen Tianjin University, Qihao Zhu Peking University, Junjie Yang College of Intelligence and Computing, Tianjin University, Lingming Zhang University of Illinois at Urbana-Champaign DOI Pre-print | ||
14:50 20mResearch paper | Differentially Testing Database Transactions for Fun and ProfitVirtual Research Papers Ziyu Cui Institute of Software, Chinese Academy of Sciences, Wensheng Dou Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Qianwang Dai Institute of Software, Chinese Academy of Sciences, Jiansen Song , Wei Wang , Jun Wei Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Dan Ye Institute of Software, Chinese Academy of Sciences |