|
pp. 5141-5162
S&M4237 Research Paper https://doi.org/10.18494/SAM5950 Published: November 26, 2025 Gas Leak Detection in Industrial Air Compressors Using Vision Transformer with Multistage Transfer Learning [PDF] Hsuan-Chao Huang, Yuh-Shihng Chang, and Zheng-Yu Ku (Received September 25, 2025; Accepted October 28, 2025) Keywords: vision transformer, air compressor leak detection, transfer learning, audio feature representation, Mel spectrogram
Recently, machine learning techniques, particularly deep learning models, have been increasingly applied to the analysis, optimization, and prediction of system performance in industrial air compressor leak detection. In this study, we integrated such models for effective gas leak detection based on sensor-derived data, thereby supporting the development of more accurate and efficient factory leak sensing systems to reduce manufacturing costs. We investigated the use of Vision Transformer (ViT) models with transfer learning for gas leak detection in industrial air compressors, based on both audio and visual data. A labeled dataset was created using recordings of compressor sounds under leak and nonleak conditions. Using the intensity stereo localization method with microphone arrays, we estimated the sound intensity and source location. A multistage transfer learning strategy was adopted: ViT models pretrained on ImageNet were adapted with the Environmental Sound Classification (ESC-50) and fine-tuned on real leak data. Among four audio representations, Mel spectrograms achieved the highest accuracy (80%), making them most effective for ViT-based leak detection.
Corresponding author: Yuh-Shihng Chang![]() ![]() This work is licensed under a Creative Commons Attribution 4.0 International License. Cite this article Hsuan-Chao Huang, Yuh-Shihng Chang, and Zheng-Yu Ku, Gas Leak Detection in Industrial Air Compressors Using Vision Transformer with Multistage Transfer Learning, Sens. Mater., Vol. 37, No. 11, 2025, p. 5141-5162. |