Forecasting Workload in Cloud Computing: Towards Uncertainty-Aware Predictions and Domain Generalization

Andrea Rossi
Andrea Visentin
Diego Carraro
Steven Prestwich
Kenneth N. Brown

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Predicting future resource demand in Cloud Computing is essential for optimizing the trade-off between serving customers' requests efficiently and minimizing the provisioning cost. Modelling prediction uncertainty is also desirable to better inform the resource decision-making process, but research in this field is under-investigated. In this paper, we propose univariate and bivariate Bayesian deep learning models that provide predictions of future workload demand and its uncertainty. We run extensive experiments on Google and Alibaba clusters, where we first train our models with datasets from different cloud providers and compare them with LSTM-based baselines. Results show that modelling the uncertainty of predictions has a positive impact on performance, especially on service level metrics, because uncertainty quantification can be tailored to desired target service levels that are critical in cloud applications. Moreover, we investigate whether our models benefit transfer learning capabilities across different domains, i.e.\dataset distributions. Experiments on the same workload datasets reveal that acceptable transfer learning performance can be achieved within the same provider (because distributions are more similar). Also, domain knowledge does not transfer when the source and target domains are very different (e.g.\from different providers), but this performance degradation can be mitigated by increasing the training set size of the source domain.

Version published to 10.21203/rs.3.rs-4934203/v1 on Research Square
Sep 16, 2024

Enhancing HPC Job Run Time Predictions leveraging Machine Learning, Historical Job Data, and Metaheuristic Optimization

This article has 4 authors:
1. Suja Ramachandran
2. M. L. Jayalal
3. M. Vasudevan
4. R. Jehadeesan
This article has no evaluationsLatest version Dec 15, 2025
Intelligent 5G Network Performance Optimization through Gradient Boosting

This article has 2 authors:
1. Mohammed Al-Hubaishi
2. Abdulkader Alabdullah
This article has no evaluationsLatest version Jan 6, 2026
A Survey on Robust Sequential Recommendation: Fundamentals, Challenges, Taxonomy, and Future Directions

This article has 5 authors:
1. Yatong Sun
2. Xiaochun Yang
3. Bin Wang
4. Yan Wang
5. Zhu Sun
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhancing HPC Job Run Time Predictions leveraging Machine Learning, Historical Job Data, and Metaheuristic Optimization

Intelligent 5G Network Performance Optimization through Gradient Boosting

A Survey on Robust Sequential Recommendation: Fundamentals, Challenges, Taxonomy, and Future Directions