Logs are diverse in structure and large in volume. While containing important information about systems at runtime, they must be preprocessed before analysis can be performed. First, logs need to be parsed into a useful format and second, often times, logs need to be separated into groups before to reduce noise. We identify two challenges in adopting preprocessing of logs in large scale: the preprocessing steps must be generalizable to handle the diversity and evolving nature of logs, and efficient to keep up with the large volume produced by applications. To tackle these challenges, we first focus our research on studying the use of identifiers used for log groupings. We then propose an alternative approach based on interleaved sequence models. We also investigate log parsing on console logs, a type of logs of which parsing is not well studied. Finally, we propose a log parsing technique based on entropy estimated with a language model.
Tue 29 AprDisplayed time zone: Eastern Time (US & Canada) change
14:00 - 15:00 | Session 3: Maintenance (talks and panel)Doctoral Symposium at 212 Chair(s): Alexander Serebrenik Eindhoven University of Technology | ||
14:00 6mTalk | Concern-based Management of Software Design Complexity Doctoral Symposium Jason Lefever Drexel University | ||
14:06 6mTalk | Mitigating Waste That Tacitly Accrues in Continuous Integration Pipelines Doctoral Symposium Nimmi Rashinika Weeraddana University of Waterloo Pre-print | ||
14:12 6mTalk | Automated Detection and Refactoring of Mock Clones in Java Projects Doctoral Symposium Gengwu Zhao Stevens Institute of Technology | ||
14:18 6mTalk | Practical Preprocessing of Logs at Scale Doctoral Symposium JianChen Zhao University of Waterloo | ||
14:24 6mTalk | Bridging the Gap Between Log Parsing Techniques and Practitioners: Challenges and Solutions Doctoral Symposium Hetong Dai University of Waterloo | ||
14:30 30mPanel | Panel: Maintenance Doctoral Symposium Sridhar Chimalakonda Indian Institute of Technology Tirupati, Wesley Assunção Johannes Kepler University Linz, Hetong Dai University of Waterloo, Jason Lefever Drexel University, Nimmi Weeraddana University of Waterloo, JianChen Zhao University of Waterloo, Gengwu Zhao Stevens Institute of Technology |