Leveraging Single Tasks for Better Generalization of Multitask Gaussian Process on Multivariate Time Series

Zhongkui Sun
Kai Chen

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

By leveraging the knowledge of separate single tasks, we propose a simple and principled algorithm for multitask Gaussian process (GP), known as stochastic hyperparameter averaging (SHA), to obtain better generalization. Specifically, we focus on multivariate time series learning to improve the generalization of extrapolation and interpolation. The knowledge of a single task is extracted by a GP separately trained on one task-specific dimension of a multivariate time series. The single task GP (STGP) has the same kernel with the latent functions in multitask GP. By averaging hyperparameters of separate STGPs to initialize the latent functions of multitask GP,SHA identifies solutions that are significantly better than those found by popular training methods, but with only a few training steps of STGPs. SHA is kernel agnostic, remarkably straightforward to implement, and enhances generalization performance. Our SHA attains a significant boost in test accuracy across various diverse multivariate time series tasks, including interpolation, extrapolation, robustness with varying model complexities, and insensitivity to different hyperparameter initializations.

Version published to 10.21203/rs.3.rs-4839107/v1 on Research Square
Aug 7, 2024

Perturbing the Gradient for alleviating Meta Overfitting

This article has 3 authors:
1. Manas Gogoi
2. Sambhavi Tiwari
3. Shekhar Verma
This article has no evaluationsLatest version Jul 23, 2024
Upscaling A Smaller LLM to More Parameters via Manual Regressive Distillation

This article has 3 authors:
1. Felix Merrick
2. Maria Radcliffe
3. Rupert Hensley
This article has no evaluationsLatest version Jul 23, 2024
Interpreting Pretext Tasks for Active Learning: A Reinforcement Learning Approach

This article has 2 authors:
1. Dongjoo Kim
2. Minsik Lee
This article has no evaluationsLatest version Jul 25, 2024

Listed in

Abstract

Article activity feed

Related articles

Perturbing the Gradient for alleviating Meta Overfitting

Upscaling A Smaller LLM to More Parameters via Manual Regressive Distillation

Interpreting Pretext Tasks for Active Learning: A Reinforcement Learning Approach