A Comparative Study of Time–Frequency Representations for Bearing and Rotating Fault Diagnosis Using Vision Transformer
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
This study presents a comparative analysis of bearing and rotating component fault classification based on different time–frequency representations using ViT-base model. Four different time–frequency transformation techniques—Short-Time Fourier Transform (STFT), Continuous Wavelet Transform (CWT), Hilbert-Huang Transform (HHT), and Wigner-Ville Distribution (WVD)—were applied to convert the signals into 2D images. A pre-trained ViT-Base architecture was fine-tuned on the resulting images for classification tasks. The model was evaluated on two separate scenarios: (i) eight-class rotating component fault classification and (ii) four-class bearing fault classification. Importantly, in each task, the samples were collected under varying conditions of the other component (i.e., different rotating conditions in bearing classification and vice versa). This design allowed for an independent assessment of the model’s ability to generalize across fault domains. The experimental results demonstrate that the ViT-based approach achieves high classification performance across various time–frequency representations, highlighting its potential for mechanical fault diagnosis in rotating machinery. Notably, the model achieved higher accuracy in bearing fault classification compared to rotating component faults, suggesting a higher sensitivity to bearing-related anomalies.