3CBench: A Unified Benchmarking Framework for the Computing Capacity of Heterogeneous AI Clusters

Weixing Zhang
Xizhi Wang
Jun Yan
Jiasun Feng
Yiying Liu
Haiyan Li
Qun Chen
Zhe Tang
Xin Cui
Fei Yang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid evolution of Artificial Intelligence has driven the demand for extensivecomputational resources and the deployment of AI tasks across heterogeneouscomputing platforms. However, existing benchmarking systems face several chal-lenges, including limited compatibility with diverse hardware, insufficient supportfor varied deep learning frameworks and tasks, and a lack of comprehensive eval-uation metrics for the computing capacities. To address these issues, we propose3CBench, a unified benchmarking framework designed for heterogeneous AI clus-ters. Featuring a modular architecture encompassing environment management,task execution, and metrics analysis, 3CBench provides automated workflows andensures seamless compatibility with diverse GPU architectures and deep learningframeworks. It provides a comprehensive evaluation metrics system to rigorouslyassess computational performance and stability across both transformer-basedlarge language models and convolutional neural networks, thereby covering dom-inant deep learning architectures. Extensive experiments demonstrate 3CBench’sscalability on heterogeneous AI clusters, compatibility with various deep learningframeworks and tasks, and the support for a wide range of applications. Addition-ally, 3CBench aids in problem diagnosis during the development process of GPUvendors. These features establish 3CBench as a robust tool for benchmarking,optimization, and system-level evaluation in heterogeneous AI clusters.

Version published to 10.21203/rs.3.rs-7602328/v1 on Research Square
Oct 9, 2025

Characterization of high-resolution AI data center training workloads on single and multiple GPU nodes

This article has 3 authors:
1. Ahmed Abd Elaziz Elsayed
2. Abdullah Azhar Al-Obaidi
3. Hany E.Z. Farag
This article has no evaluationsLatest version Oct 29, 2025
Adaptive Dataflow and Precision Optimization for Deep Learning on Configurable Hardware Architectures

This article has 3 authors:
1. Gulnaz Rati
2. Rafael Mendes
3. Aisha Noor
This article has no evaluationsLatest version Oct 8, 2025
Optimizing Deep Learning Architectures forEnhanced Computational Efficiency

This article has 2 authors:
1. Ying Wang
2. Hui Li
This article has no evaluationsLatest version Sep 22, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Characterization of high-resolution AI data center training workloads on single and multiple GPU nodes

Adaptive Dataflow and Precision Optimization for Deep Learning on Configurable Hardware Architectures

Optimizing Deep Learning Architectures forEnhanced Computational Efficiency