Adaptive Pulsating Workload Scheduling for Cosmic Simulation: A Thermal-Aware Distributed Computing Architecture

xiaochen xiao

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We propose an adaptive pulsating workload scheduling architecture for cosmic simulations, which dynamically balances computational intensity with thermal and memory bandwidth efficiency in distributed GPU clusters. The core innovation lies in a self-regulating algorithm inspired by pulsating heat pipe dynamics, where workload intensity alternates between high and low phases to maintain optimal operating conditions. The scheduler integrates real-time thermal feedback from infrared sensors and on-die probes, adjusting task parallelism and frequency scaling to prevent overheating while meeting computational deadlines. Moreover, a memory bandwidth optimizer dynamically switches between strided prefetching and cache-aware data reorganization, further improving efficiency. The system employs a transformer-based reinforcement learning agent to predict optimal pulsation cycles, minimizing energy consumption, deadline misses, and thermal violations. Unlike conventional static schedulers, our method achieves significant improvements in both performance and hardware longevity, particularly for magnetohydrodynamic cosmology and dark matter distribution simulations. Experimental results demonstrate that the proposed architecture reduces energy consumption by up to 27\% while maintaining 98\% deadline adherence under thermal constraints. This work bridges the gap between high-performance computing and sustainable resource utilization, offering a scalable solution for next-generation cosmic simulations.

Version published to 10.21203/rs.3.rs-6504227/v1 on Research Square
Apr 23, 2025

Adaptive Mesh Refinement with Dynamic Load Balancing for Scalable Cosmological Simulations: A Hybrid Meta-Heuristic Approach

This article has 1 author:
1. xiaochen xiao
This article has no evaluationsLatest version Apr 23, 2025
Efficient Resource Management in Edge Computing for Autonomous Systems with An Energy-aware Approach

This article has 1 author:
1. Machha Narender
This article has no evaluationsLatest version May 2, 2025
Physics-Inspired Single-Particle Tracking Accelerated with Parallelism

This article has 2 authors:
1. Lance W.Q. Xu
2. Steve Pressé
This article has no evaluationsLatest version Jun 3, 2025

Listed in

Abstract

Article activity feed

Related articles

Adaptive Mesh Refinement with Dynamic Load Balancing for Scalable Cosmological Simulations: A Hybrid Meta-Heuristic Approach

Efficient Resource Management in Edge Computing for Autonomous Systems with An Energy-aware Approach

Physics-Inspired Single-Particle Tracking Accelerated with Parallelism