Modern distributed systems are designed to tolerate node crashes. However, incorrect crash recovery mechanisms and implementations can still introduce crash recovery bugs, and hurt reliability and availability of cloud systems. In this paper, we present Deminer, a novel crash injection technique that automatically injects node crashes/reboots to effectively expose crash recovery bugs in cloud systems. We observe that, node crashes that interrupt the execution of related operations, which store common data to different places (i.e., different storage paths or nodes), are more likely to trigger crash recovery bugs. Based on this observation, Deminer first tracks the critical data usage in a correct run. Then Deminer identifies related operations and predicts error-prone crash points. Finally, Deminer tests the predicted crash points and checks whether the target system can behave correctly. We have evaluated Deminer on three widely-used cloud systems: ZooKeeper, HBase and HDFS. Deminer has detected 6 crash recovery bugs. A video demonstration of Deminer is available at https://youtu.be/7wWqXlpSjiM.
Mon 9 MayDisplayed time zone: Eastern Time (US & Canada) change
20:00 - 21:00 | Dynamic AnalysisDEMO - Demonstrations at ICSE Demo room 1 Chair(s): Shiyi Wei University of Texas at Dallas | ||
20:00 15mDemonstration | Common Data Guided Crash Injection for Cloud Systems DEMO - Demonstrations Yu Gao Institute of Software, Chinese Academy of Sciences, China, Dong Wang Institute of software, Chinese academy of sciences, Qianwang Dai Institute of Software, Chinese Academy of Sciences, Wensheng Dou Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Jun Wei Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences Pre-print Media Attached | ||
20:15 15mDemonstration | Dynaplex: Inferring Asymptotic Runtime Complexity of Recursive Programs DEMO - Demonstrations Didier Ishimwe University of Nebraska-Lincoln, ThanhVu Nguyen George Mason University, KimHao Nguyen University of Nebraska-Lincoln Pre-print Media Attached | ||
20:30 15mDemonstration | DistFax: A Toolkit for Measuring Interprocess Communications and Quality of Distributed Systems DEMO - Demonstrations Xiaoqin Fu Washington State University, Boxiang Lin Washington State University, Haipeng Cai Washington State University, USA DOI Pre-print Media Attached |