Infusing ML into VM Provisioning in Cloud
In cloud computing, provisioning virtual machines (VMs) fast and reliably is a fundamental yet challenging problem, particularly in cloud environments of changing workloads and shifting demand patterns. A trivial solution of instantiating a VM from scratch per customer request may not satisfy business SLA’s and would degrade customer experience. Hence, provisioning VMs in advance motivated with machine learning (ML) infused into the cloud computing system to predict upcoming VM request demands is an advisable solution. In this paper, we first describe a number of system integration challenges including 1) how to achieve low latency provisioning to quickly adjust to customer demand pattern shifts, 2) how to efficiently scale to serve a large number of VM configurations supported in the cloud environment, and 3) how to reliably consume recommended prediction results for VM provision despite of anticipated operation failures and timeout. We then present the high level solution design with discussions of our developed system to address the aforementioned challenges. Our system has been deployed successfully in Microsoft Azure exhibiting significant improvements for VM provisioning experience with regards to latency and reliability requirements.
Sat 29 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
16:30 - 17:20 | Project Showcase SessionCloudIntelligence 2021 at CloudIntelligence Room Chair(s): Yingnong Dang Microsoft, USA | ||
16:30 12mDemonstration | Building a Secured Data Intelligence Platform CloudIntelligence 2021 Conan Yang Salesforce | ||
16:42 12mDemonstration | Infusing ML into VM Provisioning in Cloud CloudIntelligence 2021 Chuan Luo Microsoft Research, China, Randolph Yao Microsoft, USA, Bo Qiao Microsoft Research, Beijing, China, Qingwei Lin Microsoft Research, Beijing, China, Tri M. Tran Microsoft Azure, Gil Shafriri Microsoft Azure, Yingnong Dang Microsoft, USA, Raphael Ghelman Microsoft Azure, Pulak Goyal Microsoft Azure, Eli Cortez Microsoft Azure, Daud Howlader Microsoft Azure, Sushant Rewaskar Microsoft Azure, Murali Chintalapati Microsoft Azure, Dongmei Zhang Microsoft Research | ||
16:55 12mDemonstration | F3: Fault Forecasting Framework for Cloud Systems CloudIntelligence 2021 Chuan Luo Microsoft Research, China, Pu Zhao Microsoft Research, Beijing, China, Bo Qiao Microsoft Research, Beijing, China, Youjiang Wu Microsoft, USA, Yingnong Dang Microsoft, USA, Murali Chintalapati Microsoft Azure, Susy Yi Microsoft 365, Paul Wang Microsoft 365, Andrew Zhou Microsoft 365, Saravanakumar Rajmohan Microsoft Office, United States, Qingwei Lin Microsoft Research, Beijing, China, Dongmei Zhang Microsoft Research | ||
17:07 12mDemonstration | SEAT: statistically sound infra-side deployment and integration testing CloudIntelligence 2021 Nutcha Temiyasathit Facebook, Tao Yang Facebook, Karan Luthra Facebook, Nick Ruff Facebook, Petar Zuljevic Facebook, Ethan Benowitz Facebook, Boris Baracaldo Facebook, Oytun Eskiyenenturk Facebook, Xin Fu Facebook |
Go directly to this room on Clowdr