Conferences >IEEE INFOCOM 2023 - IEEE Conf...

Latency-Oriented Elastic Memory Management at Task-Granularity for Stateful Streaming Processing

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In a streaming application, an operator is usually instantiated into multiple tasks for parallel processing. Tasks across operators have various memory demands due to dif...Show More

Metadata

Abstract:

In a streaming application, an operator is usually instantiated into multiple tasks for parallel processing. Tasks across operators have various memory demands due to different processing logic (e.g., stateful vs. stateless tasks). The memory demands of tasks from the same operator could also vary and fluctuate due to workload variability. Improper memory provision will cause some tasks to have relatively high latency, or even unbound latency that can eventually lead to system instability. We found that the task with the maximum latency of an operator has a significant and even decisive impact on the end-to-end latency.In this paper, we present our task-level memory manager. Based on our quantitative modeling of memory and task-level latency, the manager can adaptively allocate optimal memory size to each task for minimizing the end-to-end latency. We integrate our memory management on Apache Flink. The experiments show that our memory management could significantly reduce end-to-end latency for various applications at different scales and configurations, compared to the Flink native setting.

Published in: IEEE INFOCOM 2023 - IEEE Conference on Computer Communications

Date of Conference: 17-20 May 2023

Date Added to IEEE Xplore: 29 August 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/INFOCOM53939.2023.10228963

Conference Location: New York City, NY, USA

Funding Agency:

Contents

I. Introduction

Streaming Processing Engines (SPEs) [13], [26], [34], [37] have emerged to provide quick analysis for applications that need real-time responses and whose input data is in the form of streams, such as quantitative finance, network monitoring, and alert triggering. These applications typically require consistently short end-to-end (E2E) latency, the time from the data generated to the result is output. Short E2E latency can bring about a smooth user experience, service-level agreements guarantee, and huge profits in the financial market. For example, autonomous high-frequency algorithmic trading [36] must make timely adjustments to market fluctuations, or traders may lose trading opportunities or even incur losses.

References is not available for this document.

Latency-Oriented Elastic Memory Management at Task-Granularity for Stateful Streaming Processing

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Latency-Oriented Elastic Memory Management at Task-Granularity for Stateful Streaming Processing

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References