TracePicker: Optimization-based Trace Sampling for Microservice-based Systems
Distributed tracing is a pivotal technique for software operators to understand and diagnose issues within microservice-based systems, offering a comprehensive view of user requests propagated through various services. However, the unprecedented volume of traces imposes expensive storage and analytical burdens for online systems. Conventional tracing implementations typically use random sampling with a fixed probability for each trace, posing a risk of losing valuable traces. To circumvent this loss, several tail-based sampling methods have been proposed to sample traces based on their content. Nevertheless, these methods primarily evaluate traces on an individual basis, neglecting the collective attributes of the sample set, including its comprehensiveness, balance, and consistency. To address these issues, we propose TracePicker, an optimization-based online sampler designed to enhance the quality of sampled data while mitigating storage burden. TracePicker employs a streaming anomaly detector to capture and retain anomalous traces that are crucial for troubleshooting. For normal traces, the sampling process is segmented into quota allocation and group sampling, formulating both as optimization problems. By solving these problems using dynamic programming and evolution algorithms, TracePicker selects a high-quality subset of data, minimizing overall information loss. Experimental results demonstrate that TracePicker outperforms existing tail-based sampling methods in terms of both sampling quality and time consumption.
Tue 24 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
10:30 - 12:30 | Architecture, Services, and CloudIndustry Papers / Demonstrations / Research Papers / Ideas, Visions and Reflections at Andromeda Chair(s): Paris Avgeriou University of Groningen, The Netherlands | ||
10:30 10mTalk | Knowledge-Based Multi-Agent Framework for Automated Software Architecture Design Ideas, Visions and Reflections Yiran Zhang , Ruiyin Li Wuhan University, China; University of Groningen, The Netherlands, Peng Liang Wuhan University, China, Weisong Sun Nanyang Technological University, Yang Liu Nanyang Technological University | ||
10:40 10mTalk | Designing for Scalability: Building a Universal Serverless Messaging Architecture with Apache RocketMQ Industry Papers Juntao Ji Alibaba Cloud Computing Co. Ltd., Yubao Fu Alibaba Cloud Computing Co. Ltd., Rongtong Jin Alibaba Cloud Computing Co. Ltd., Qingshan Lin Alibaba Cloud Computing Co. Ltd. | ||
10:50 20mTalk | A Multimodal Intelligent Change Assessment Framework for Microservice Systems Based on Large Language Models Industry Papers Yongqian Sun Nankai University, zhengtinghua Nankai university, Xidao Wen BizSeer, Weihua Kuang Nankai university, Heng Liu CHINA TIANCHEN ENGINEERING CORPORATION LTD., Shenglin Zhang Nankai University, Chao Shen Nankai University, Bo Wu Tencent Technologies, Dan Pei Tsinghua University | ||
11:10 20mTalk | TracePicker: Optimization-based Trace Sampling for Microservice-based Systems Research Papers Shuaiyu Xie School of Computer Science, Wuhan University, China, Jian Wang Wuhan University, Maodong Li School of Computer Science, Wuhan University, China, Peiran Chen School of Computer Science, Wuhan University, China, Jifeng Xuan Wuhan University, Bing Li Wuhan University DOI | ||
11:30 10mTalk | Analyzing Evolution of Microservice-based Systems Demonstrations Tomas Cerny University of Arizona, Gabriel Goulis University of Arizona, Samanta Perry University of Arizona, Malia Edmonds University of Arizona, Amr Elsayed The University of Arizona, Matteo Esposito University of Oulu, Alexander Bakhtin University of Oulu, Valentina Lenarduzzi University of Oulu, Davide Taibi University of Oulu | ||
11:40 20mTalk | SemServGen: Advancing Industrial Domain-Specific Language Engineering through Semantic Service Generation Industry Papers Yong Wang Beihang University, Ning Ge School of Software, Beihang University, Jingyao Li Beihang University, Loulin Wang Beihang University, Guangyu Zhou Huawei, Chengrui Deng Huawei, Li Zhang Beihang University, Chunming Hu Beihang University | ||
12:00 10mTalk | CloudHeatMap: Heatmap-Based Monitoring for Large-Scale Cloud Systems Demonstrations Sarah Sohana Rogers Communications Canada Inc., Toronto, Canada, William Pourmajidi Toronto Metropolitan University, Toronto, Canada, John Steinbacher IBM, Andriy Miranskyy Toronto Metropolitan University (formerly Ryerson University) DOI Pre-print | ||
12:10 20mTalk | Te-PID: An Adaptive Erasure Coding Temperature Management System for Optimized Cloud Storage Industry Papers Pei Xiao Peking University, Lu Wang Microsoft Research, Fangkai Yang Microsoft Research, Guoqing Geng Microsoft, Haoran Li Microsoft, Jeff Zhu Microsoft, Yu Kang Microsoft Research, Yifan Li Microsoft, Terry Chen Microsoft, Yue Chen Microsoft, Saravan Rajmohan Microsoft 365, Qi Zhang Microsoft |
Andromeda is located close to the restaurant and the bar, at the end of the corridor on the side of the bar.
From the registration desk, go towards the restaurant, turn left towards the bar, walk until the end of the corridor.