基于GAN与超分辨率重建的裂缝图像增强技术研究
Research on Crack Image Enhancement Technology Based on GAN and Super-resolution Reconstruction
-
摘要: 针对基于深度学习的裂缝识别任务中样本数据不足、图像质量较低的问题,开展基于生成对抗网络(Generative Adversarial Network,GAN)与超分辨率(Super-resolution,SR)重建的裂缝图像增强技术研究。首先,通过对Tunnel Crack、CFD公开裂缝数据集进行筛选与清洗,建立基础裂缝数据集;其次,采用镜像变换、亮度调整、随机遮掩等方法扩充样本集,并利用SA-BAGAN-GP模型进行对抗生成,结合自注意力机制有效提升生成图像的结构一致性和裂缝纹理的真实性;再次,针对生成图像细节模糊的问题,引入基于Real-ESRGAN的超分辨率重建算法实现对裂缝边缘与细节的显著增强;最后,利用Labelme进行数据标注,构建包含2400张裂缝图像及对应标签的高质量数据集,为裂缝识别算法的优化提供可靠的数据支撑。Abstract: To address the issues of insufficient sample data and poor image quality in the crack recognition task based on deep learning, a study on crack image enhancement technology using generative adversarial network (GAN) and superresolution (SR) reconstruction is conducted. Firstly, a basic crack database is established using the public crack datasets from tunnel crack and CFD. Secondly, the sample set is expanded by methods such as mirror transformation, brightness adjustment, and random masking, and the model is improved using SA-BAGAN-GP for adversarial generation. The self-attention mechanism is combined to effectively enhance the structural consistency and authenticity of the generated images. Thirdly, to address the problem of blurry details in the generated images, a super-resolution reconstruction algorithm based on Real-ESRGAN is introduced to significantly enhance the crack edges and details. Finally, the data is labeled using labelme to construct a high-quality dataset containing 2400 crack images and corresponding labels, providing reliable data support for the optimization of the crack recognition algorithm.
下载: