Aiming at the complexity of features in remote sensing images and the existence of an elongated and continuous distribution of roads that are easy to obscure, a Road Extraction Model for Remote Sensing Images Combining Attention and Context Fusion (ACFD-LinkNet) was proposed. The network is based on the D-LinkNet network. Firstly, a strip attention module was used in the codec part of the D-LinkNet network to enhance the feature extraction capability of roads at different scales, to better capture the global features of the roads, and to capture the long-distance information of the roads. Secondly, a Context Fusion Module (CFM) was proposed and added to the feature delivery part of the network codec to predict road connections between neighboring pixels, fusing road information between different layers of the context to solve the problem of obstacle obstruction interfering with road connections. Finally, the cross-entropy loss function and Dice loss function of the improved model were set up with multiple loss function hyperparameter weight assignments to solve the dataset positive and negative sample inhomogeneity, and the optimal segmentation accuracy was obtained by adjusting the weight ratios. Experiments on the DeepGlobe and CHN6-CUG datasets resulted in F1 values of 86.76% and 92.12% for the composite metrics, respectively, which is an improvement of 3.96% and 1.13% compared to the D-LinkNet model, in addition to optimal performance compared to semantic segmentation methods such as Unet, Deeplabv3+, A2-FPN, etc.
实验采用精确度()、召回率(Recall)、F1分数(F1-score)和平均交并比(Mean intersection over union,mIoU)4个指标进行模型有效性评估。精确度表示预测为道路的像素中真正的道路像素比例;召回率表示模型正确预测的真实道路像素比例;平均交并比衡量预测和真实道路区域的重叠程度;F1分数是精确度和召回率的调和平均数,能反映模型的总体性能。
ChenR, LiX, HuY, et al. Road extraction from remote sensing images in wildland-urban interface areas[J]. IEEE Geoscience and Remote Sensing Letters, 2020, 19: 1-5.
[2]
ZhaoK, LiuJ, WangQ, et al. Road damage detection from post-disaster high-resolution remote sensing images based on tld framework[J]. IEEE Access, 2022, 10: 43552-43561.
[3]
ZhangX, JiangY, WangL, et al. Complex mountain road extraction in high-resolution remote sensing images via a light roadformer and a new benchmark[J]. Remote Sensing, 2022, 14(19): No.4729.
TanGuo-jin, JiOu, AiYong-ming, et al. Bridge crack image segmentation method based on improved DeepLabv3+ model[J]. Journal of Jilin University (Engineering and Technology Edition), 2024,54(1):173-179.
LiuYang, MaoKe-ming. Retrieval of texture feature information data for small differentiated images based on adaptive feedback mechanism[J]. Journal of Jiangsu University(Natural Science Edition), 2025, 46(1): 73-81.
YangYang, HeTong-yao, ZhanYong-zhao, et al. Depth image enhancement method based on soft clustering[J]. Journal of Jiangsu University(Natural Science Edition), 2024, 45(2): 183-190.
[12]
LongJ, ShelhamerE, DarrellT. Fully convolutional networks for semantic segmentation[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston,USA,2015: 3431-3440.
[13]
WangS, MuX, YangD, et al. Road extraction from remote sensing images using the inner convolution integrated encoder-decoder network and directional conditional random fields[J]. Remote Sensing, 2021, 13(3): No.465.
[14]
RonnebergerO, FischerP, BroxT. U-net: Convolutional networks for biomedical image segmentation[C]∥18th International Conference on Medical Image Computing and Computer-assisted Intervention-MICCAI, Munich, Germany, 2015: 234-241.
[15]
ZhouL, ZhangC, WuM. D-LinkNet: LinkNet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City,USA, 2018: 182-186.
[16]
ChaurasiaA, CulurcielloE. Linknet: Exploiting encoder representations for efficient semantic segmentation[C]∥2017 IEEE Visual Communications and Image Processing, St. Petersburg,USA,2017: 1-4.
[17]
WuK, CaiF. Dual Attention D-LinkNet for road segmentation in remote sensing images[C]∥2022 IEEE 14th International Conference on Advanced Infocomm Technology, Chongqing, China,2022: 304-307.
[18]
KampffmeyerM, DongN, LiangX, et al. ConnNet: a long-range relation-aware pixel-connectivity network for salient segmentation[J]. IEEE Transactions on Image Processing, 2018, 28(5): 2518-2529.
[19]
MajiD, SigedarP, SinghM. Attention Res-UNet with guided decoder for semantic segmentation of brain tumors[J]. Biomedical Signal Processing and Control, 2022, 71: No.103077.
[20]
DaiL, ZhangG, ZhangR. RADANet: road augmented deformable attention network for road extraction from complex high-resolution remote-sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 1-13.
[21]
SongQ, MeiK, HuangR. AttaNet: Attention-augmented network for fast and accurate scene parsing[C]∥Proceedings of the AAAI Conference on Artificial Intelligence, Menlo Park,USA,2021, 35(3): 2567-2575.
[22]
OuyangD, HeS, ZhangG, et al. Efficient multi-scale attention module with cross-spatial learning[C]∥ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island,Greece,2023: 1-5.
[23]
ChenL C, ZhuY, PapandreouG, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[C]∥Proceedings of the European Conference on Computer Vision, Munich,Germany,2018: 801-818.
[24]
ZhouG, ChenW, GuiQ, et al. Split depth-wise separable graph-convolution network for road extraction in complex environments from high-resolution remote-sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 60: 1-15.
[25]
LiR, WangL, ZhangC, et al. A2-FPN for semantic segmentation of fine-resolution remotely sensed images[J]. International Journal of Remote Sensing, 2022, 43(3): 1131-1155.
[26]
RuanJ, XieM, GaoJ, et al. Ege-unet: an efficient group enhanced unet for skin lesion segmentation[C]∥International Conference on Medical Image Computing and Computer-Assisted Intervention,Vancouver,Canada,2023: 481-490.
[27]
RuanJ, XiangS. Vm-unet: Vision mamba unet for medical image segmentation[J/OL].[2024-03-25].