XTorch: A High-Performance C++ Framework for Deep Learning Training

Kamran Saberifard

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The deep learning ecosystem is predominantly driven by high-level Python frameworks like PyTorch and TensorFlow, which offer exceptional flexibility and ease of use. However, the reliance on a Python front-end can introduce significant performance overhead, partic- ularly in data-intensive training pipelines, often necessitating multi-GPU setups to achieve acceptable training times. This paper introduces XTorch, a high-level C++ deep learning framework built atop LibTorch, designed to bridge the gap between Python’s usability and C++’s raw performance. XTorch provides a familiar API for datasets, transforms, and mod- els while eliminating Python-related bottlenecks. We demonstrate its efficacy by training a Deep Convolutional Generative Adversarial Network (DCGAN) on the CelebA dataset. Our results show that XTorch, running on a single NVIDIA RTX 3090 GPU, completes a 5-epoch training run in 219 seconds. This represents a 37% speedup over a standard PyTorch implementation which required 350 seconds using two RTX 3090 GPUs with DataParallel. This work validates that a native C++ framework can not only match but significantly outperform common multi-GPU Python setups, offering a compelling case for reducing hardware costs and accelerating research and deployment.

Version published to 10.20944/preprints202507.0540.v1
Jul 7, 2025

PUNet: A Lightweight Parallel U-Net Architecture Integrating Mamba-CNN for High-Precision Image Segmentation

This article has 5 authors:
1. Zhaoyan Xie
2. Xiaowei Li
3. Hongyao Ma
4. Sihao Wu
5. Dayou Cui
This article has no evaluationsLatest version May 29, 2025
Deep Learning 2.0.1: Mind and Cosmos - Towards Cosmos-Inspired Interpretable Neural Networks

This article has 1 author:
1. Taha Bouhsine
This article has no evaluationsLatest version Jun 25, 2025
ViT-StyleGAN2-ADA for Limited-Data Training

This article has 3 authors:
1. Md Mahabubur Rahman
2. Biwei Chen
3. Hui Zeng
This article has no evaluationsLatest version Jul 3, 2025

Listed in

Abstract

Article activity feed

Related articles

PUNet: A Lightweight Parallel U-Net Architecture Integrating Mamba-CNN for High-Precision Image Segmentation

Deep Learning 2.0.1: Mind and Cosmos - Towards Cosmos-Inspired Interpretable Neural Networks

ViT-StyleGAN2-ADA for Limited-Data Training