pp. 4029-4037
S&M3094 Research Paper of Special Issue https://doi.org/10.18494/SAM4059 Published: November 16, 2022 Swin Transformer UNet for Very High Resolution Image Dehazing [PDF] Yuxin Bian, Enguang Zhang, Jiayan Wang, Rixin Xie, and Shenlu Jiang (Received July 29, 2022; Accepted September 22, 2022) Keywords: image dehazing, VHR image processing, deep learning, transformer, UNet
Rapid image acquisition for a region affected by an earthquake is important to manage the rescue operation. The use of an unmanned aerial vehicle (UAV) to rapidly cruise an affected region and obtain very high resolution (VHR) images is highly advantageous. However, haze is a problem for many UAV aerial images, especially when UAVs cross clouds. In this paper, we present a parallel predicting workflow that cooperates with Swin Transformer UNet (ST-UNet) for this task. ST-UNet utilizes the Swin Transformer instead of a convolutional layer (CNN), which greatly enhances the processing speed without accuracy loss. The predicting workflow employs parallel processing and a reasonable data structure to maximize the computing resources for rapid processing. To demonstrate the advantageousness of the proposed workflow, we employed three public remote sensing datasets for evaluation, and the proposed ST-UNet obtained the highest accuracy and speed. Furthermore, the high dehazing performance of ST-UNet was demonstrated using a real post-earthquake scene.
Corresponding author: Shenlu JiangThis work is licensed under a Creative Commons Attribution 4.0 International License. Cite this article Yuxin Bian, Enguang Zhang, Jiayan Wang, Rixin Xie, and Shenlu Jiang, Swin Transformer UNet for Very High Resolution Image Dehazing, Sens. Mater., Vol. 34, No. 11, 2022, p. 4029-4037. |