Loading [MathJax]/extensions/MathZoom.js
SparseRCA: Unsupervised Root Cause Analysis in Sparse Microservice Testing Traces | IEEE Conference Publication | IEEE Xplore

SparseRCA: Unsupervised Root Cause Analysis in Sparse Microservice Testing Traces


Abstract:

Microservice architecture has become a predominant paradigm in the software industry. This architecture necessitates robust end-to-end testing to ensure seamless integrat...Show More

Abstract:

Microservice architecture has become a predominant paradigm in the software industry. This architecture necessitates robust end-to-end testing to ensure seamless integration of all components before deployment. Rapidly pinpointing issues when test cases fail is crucial for enhancing software development efficiency. However, in testing environments, the available trace is often sparse, and the system is continuously upgrading, which renders existing microservice-based root cause analysis (RCA) ineffective. To address these challenges, we propose SparseRCA. By assessing the abnormality of the exclusive latency, SparseRCA directly determines the probability of the root cause, solving the challenge of not being able to fully obtain the fault propagation information, such as call relationships in sparse trace scenarios. At the same time, by reconstructing the exclusive latency using the decoupled atomic span units, it solves the problem of latency prediction for new traces caused by frequent upgrades. We evaluate SparseRCA on real-world datasets from a large e-commerce system’s testing environment, where it demonstrates significant improvements over existing models. Our findings underscore the effectiveness of SparseRCA in addressing the challenges of RCA in microservice testing environments.
Date of Conference: 28-31 October 2024
Date Added to IEEE Xplore: 03 December 2024
ISBN Information:

ISSN Information:

Conference Location: Tsukuba, Japan

Funding Agency:


I. Introduction

The microservice architecture, characterized by its design dividing large systems into smaller, self-contained components that communicate with each other, has emerged as a predominant architectural paradigm in the software industry, enhancing scalability and flexibility in large-scale applications [1]–[5].

Contact IEEE to Subscribe

References

References is not available for this document.