DeployFix: Dynamic Repair of Software Deployment Failures via Constraint Solving
Software deployment misconfiguration often happens and has been one of the major causes of deployment failures that give rise to service interruptions. However, there is currently no existing approach to automatically repairing deployment failures. We propose DeployFix, which automatically repairs software deployment failures via constraint solving in the dynamic-changing deployment environments. DeployFix first defines DeployIR as a unified intermediate representation to achieve the translation of heterogeneous specifications from different schedulers with different syntaxes. By reducing the root-cause analysis of deployment failures to the conflict resolution in propositional logic, DeployFix uses off-the-shelf constraint solvers to achieve automatic localization and diagnosis of conflicting constraints, which are the root causes of deployment failures. DeployFix finally resolves the conflicting constraints and generates repaired deployment configurations in terms of practical requirements. We evaluate DeployFix in both simulation and production environments with tens of thousands of nodes at Alibaba, on which tens of thousands of applications are running guided by hundreds of thousands of deployment constraints. Experimental results demonstrate that DeployFix outperforms the state of the art and it correctly repairs the deployment failures in minutes, even in a large production data center.
Wed 30 OctDisplayed time zone: Pacific Time (US & Canada) change
10:30 - 12:00 | Log and trace; failure and faultResearch Papers / Industry Showcase at Carr Chair(s): Yiming Tang Rochester Institute of Technology | ||
10:30 15mTalk | Demonstration-Free: Towards More Practical Log Parsing with Large Language Models Research Papers | ||
10:45 15mTalk | Unlocking the Power of Numbers: Log Compression via Numeric Token Parsing Research Papers | ||
11:00 15mTalk | Towards Synthetic Trace Generation of Modeling Operations using In-Context Learning Approach Research Papers Vittoriano Muttillo University of Teramo, Claudio Di Sipio University of l'Aquila, Riccardo Rubei University of L'Aquila, Luca Berardinelli Johannes Kepler University Linz, MohammadHadi Dehghani Johannes Kepler University Linz | ||
11:15 15mTalk | DeployFix: Dynamic Repair of Software Deployment Failures via Constraint Solving Industry Showcase Haoyu Liao East China Normal University, Jianmei Guo East China Normal University, Bo Huang East China Normal University, Yujie Han East China Normal University, Dingyu Yang Zhejiang University, Kai Shi Alibaba Group, Jonathan Ding Intel, Guoyao Xu Alibaba Group, Guodong Yang Alibaba Group, Liping Zhang Alibaba Group | ||
11:30 15mTalk | FAIL: Analyzing Software Failures from the News Using LLMs Research Papers Dharun Anandayuvaraj Purdue University, Matthew Campbell Purdue University, Arav Tewari Purdue University, James C. Davis Purdue University DOI Pre-print | ||
11:45 15mTalk | Do not neglect what's on your hands: localizing software faults with exception trigger stream Research Papers Xihao Zhang School of Computer Science, Wuhan University, Yi Song School of Computer Science, Wuhan University, Xiaoyuan Xie Wuhan University, Qi Xin Wuhan University, Chenliang Xing School of Computer Science, Wuhan University |