Abstract:
The container resource autoscaling technique provides scalability to cloud services composed of microservice architecture in a cloud-native computing environment. However...Show MoreMetadata
Abstract:
The container resource autoscaling technique provides scalability to cloud services composed of microservice architecture in a cloud-native computing environment. However, the service efficiency is reduced as the scaling is delayed because dynamic loads occur with various workload patterns. Furthermore, estimating the efficient resource size for the workload is difficult, resulting in resource waste and overload. Therefore, this study proposes high-performance resource management (HiPerRM), which stably and elastically manages container resources to ensure service scalability and efficiency even under rapidly changing dynamic loads. HiPerRM forecasts future workloads using a sample convolutional and interaction network (SCINet) model applied with the reversible instance normalization (RevIN) method. HiPerRM generates a resource request with an elastic size based on the forecasted CPU and memory usage, and then efficiently adjusts the pod's resource request and the number of replicas via HiPerRM's VPA (Hi-VPA) and HiPerRM's HPA (Hi-HPA). As a result of evaluating the performance of HiPerRM, the average resource utilization was improved by approximately 3.96–34.06% compared to conventional autoscaling techniques, even when the resource size was incorrectly estimated for various workloads, and there were relatively fewer overloads.
Published in: IEEE Transactions on Cloud Computing ( Volume: 11, Issue: 4, Oct.-Dec. 2023)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Scalable ,
- Resource Management ,
- Resource Utilization ,
- Dynamic Loading ,
- Memory Usage ,
- Requests For Resources ,
- Average Use ,
- Microservices ,
- CPU Usage ,
- Reaction Mechanism ,
- Forecasting ,
- Deep Learning Models ,
- Mean Absolute Error ,
- Long Short-term Memory ,
- Input Sequence ,
- Low Resource ,
- High Resource ,
- Resource Costs ,
- Resource Usage ,
- Mean Absolute Percentage Error ,
- High Resource Utilization ,
- Usage Metrics ,
- Worker Nodes ,
- Master Node ,
- Bidirectional Long Short-term Memory ,
- Temporal Convolutional Network ,
- Future Resource ,
- Decision Tree Regression ,
- Control Plane ,
- Autoregressive Integrated Moving Average
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Scalable ,
- Resource Management ,
- Resource Utilization ,
- Dynamic Loading ,
- Memory Usage ,
- Requests For Resources ,
- Average Use ,
- Microservices ,
- CPU Usage ,
- Reaction Mechanism ,
- Forecasting ,
- Deep Learning Models ,
- Mean Absolute Error ,
- Long Short-term Memory ,
- Input Sequence ,
- Low Resource ,
- High Resource ,
- Resource Costs ,
- Resource Usage ,
- Mean Absolute Percentage Error ,
- High Resource Utilization ,
- Usage Metrics ,
- Worker Nodes ,
- Master Node ,
- Bidirectional Long Short-term Memory ,
- Temporal Convolutional Network ,
- Future Resource ,
- Decision Tree Regression ,
- Control Plane ,
- Autoregressive Integrated Moving Average
- Author Keywords