I. Introduction
Streaming Processing Engines (SPEs) [13], [26], [34], [37] have emerged to provide quick analysis for applications that need real-time responses and whose input data is in the form of streams, such as quantitative finance, network monitoring, and alert triggering. These applications typically require consistently short end-to-end (E2E) latency, the time from the data generated to the result is output. Short E2E latency can bring about a smooth user experience, service-level agreements guarantee, and huge profits in the financial market. For example, autonomous high-frequency algorithmic trading [36] must make timely adjustments to market fluctuations, or traders may lose trading opportunities or even incur losses.