Loading [MathJax]/extensions/MathZoom.js
LogGenius: An Unsupervised Log Parsing Framework with Zero-shot Prompt Engineering | IEEE Conference Publication | IEEE Xplore

LogGenius: An Unsupervised Log Parsing Framework with Zero-shot Prompt Engineering


Abstract:

Efficient and accurate parsing of unstructured logs is crucial for anomaly detection, root cause localization, and log compression. Although many existing works have made...Show More

Abstract:

Efficient and accurate parsing of unstructured logs is crucial for anomaly detection, root cause localization, and log compression. Although many existing works have made good progress relying on Large Language Models (LLMs) and prompt engineering techniques, most of them require a certain degree of labeling or few-shot prompts, which limits their applicability in large-scale real-time heterogeneous log environments. To tackle this issue, we develop LogGenius, a novel unsupervised log parsing framework. It initially enriches the diversity of the parsed logs by leveraging generative LLMs with zero-shot prompts. It then employs an unsupervised parsing model on the augmented log data to accomplish log parsing. In order to alleviate the impact of potential hallucination issues caused by generative LLMs, we conduct a meticulous analysis and summarize the biases inherent in LLMs when directly applying them to generate diversified logs. Building upon these insights, we propose an effective log diversity augmentation algorithm to mitigate the aforementioned concerns.We thoroughly evaluate LogGenius based on various open-source system runtime log datasets and a new alarm log dataset from a commercial cloud production environment. The experimental results demonstrate that LogGenius can improve the parsing accuracy by up to about 30%, and the parsing accuracy in unseen logs by up to about 100%, compared to the state-of-the-art unsupervised-based methods.
Date of Conference: 07-13 July 2024
Date Added to IEEE Xplore: 15 October 2024
ISBN Information:

ISSN Information:

Conference Location: Shenzhen, China

Funding Agency:

No metrics found for this document.

I. Introduction

As a kind of textual data, logs have become an indispensable component in various computer software systems. To better record system status information, logs tend to be more unstructured [1]. Automatically parsing these unstructured logs is crucial for tasks like spotting system anomalies [2], [3], identifying root causes of failures [4], [5], mining log patterns, improving compression rates [6], [7], and enhancing security diagnostics [8], [9].

Usage
Select a Year
2025

View as

Total usage sinceOct 2024:211
010203040JanFebMarAprMayJunJulAugSepOctNovDec30370000000000
Year Total:67
Data is updated monthly. Usage includes PDF downloads and HTML views.
Contact IEEE to Subscribe

References

References is not available for this document.