To address issues such as low recognition of lesion areas in chest X-ray images and the difficulty in accurately capturing the spatial positions of lesions, a multi-scale attention information multiplexing network that helps improve the dassification accuracy of chest X-ray images was proposed in this paper. Firstly, by introducing multiple spatial information multiplexing blocks, the network enhances the positional connections between disease regions on feature maps and across channels; Secondly, through a multi-scale integration attention blocks, the network integrates multi-scale image feature information to automatically capture disease location variations and flexibly focus on key pathological information; Finally, the problem of imbalanced distribution of chest disease samples was alleviated by using an asymmetric shift focus loss function. Multiple experiments on the publicly available datasets ChestX-ray14 and CheXpert have shown that the average area under curve (AUC) value of the proposed network on two datasets reached 0.847 and 0.901 respectively, which is superior the more advanced network models in recent years. This indicates that the network can effectively improve the classification accuracy of chest diseases.
此外,许多研究人员将注意力机制应用到疾病分类领域。该机制通过调整可用的资源配置,引导网络模型聚焦于CXR图像中的病变区域。Wang等[11]提出三重注意网络A3Net,用于通道级、元素级及规模级的注意学习。Zhu等[12]提出逐像素分类和注意力网络(Pixel-wise classification and attention network,PCAN),为疾病分类提供了可解释性支持。Chen等[13]提出基于金字塔卷积和洗牌注意力模块的胸部疾病分类和COVID-19检测新网络PCSANet。
WeiX L, LiW, ZhangM M, et al. Medical hyperspectral image classification based on end-to-end fusion deep neural network[J]. IEEE Transactions on Instrumentation and Measurement, 2019, 68(11): 4481-4492.
LiuGui-xia, TianYu-xin, WangTao, et al. Pancreas segmentation algorithm based on dual input 3D convolutional neural network[J]. Journal of Jilin University (Engineering and Technology Edition), 2023, 53(12): 3565-3572.
WangXue, LiZhan-shan, Ying-daLyu. Medical image segmentation based on multi-scale context-aware and semantic adaptor[J]. Journal of Jilin University (Engineering and Technology Edition), 2022, 52(3): 640-647.
[6]
WangX S, PengY F, LuL, et al. ChestX-ray8: Hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases[C]∥IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, USA, 2017: 3462-3471.
[7]
KrizhevskyA, SutskeverI, HintonE G. ImageNet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84-90.
[8]
HeK M, ZhangX Y, RenS Q, et al. Deep residual learning for image recognition[C]∥IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ: IEEE, USA, 2016: 770-778.
[9]
IrvinJ, RajpurkarP, KoM, et al. CheXpert: A large chest radiograph dataset with uncertainty labels and expert comparison[C]∥Proceedings of the AAAI Conference on Artificial Intelligence Washington, DC: AAAI Press, 2019: 590-597.
[10]
JiangX B, ZhuY, CaiG, et al. MXT: A new variant of pyramid vision transformer for multi-label chest x-ray image classification[J]. Cognitive Computation, 2022, 14(4): 1362-1377.
[11]
WangW H, XieE Z, LiX, et al. Pyramid vision transformer: A versatile backbone for dense prediction without convolutions[C]∥2021 IEEE/CVF International Conference on Computer Vision. Piscataway, NJ: IEEE, 2021: 548-558.
HuJin-bo, NieWei-zhi, SongDan, et al. Chest X-ray imaging disease diagnosis model assisted by deformable transformer[J]. Journal of Zhejiang University (Engineering Science), 2023, 57(10): 1923-1932.
[14]
WangH Y, WangS S, QinZ B, et al. Triple attention learning for classification of 14 thoracic diseases using chest radiography[J]. Medical Image Analysis, 2021, 67: 101846.
[15]
ZhuX F, PangS M, ZhangX X, et al. PCAN: Pixel-wise classification and attention network for thoracic disease classification and weakly supervised localization[J]. Computerized Medical Imaging and Graphics, 2022, 102: 102137.
[16]
ChenK, WangX Q, ZhangS W. Thorax disease classification based on pyramidal convolution shuffle attention neural network[J]. IEEE Access, 2022, 10: 85571-85581.
[17]
LuZ C, DebK, BoddetiV N. MUXConv: Information multiplexing in convolutional neural networks[J]. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, 6: 12041-12050.
RidnikT, BenB E, ZamirN, et al. Asymmetric loss for multi-label classification[C]∥IEEE/CVF International Conference on Computer Vision. Piscataway, NJ: IEEE, 2021: 82-91.
[20]
GuanQ J, HuangY P, LuoY W, et al. Discriminative feature learning for thorax disease classification in chest X-ray images[J]. IEEE Transactions on Image Processing, 2021, 30: 2476-2487.
[21]
LeeY W, HuangS K, ChangR F. CheXGAT: A disease correlation-aware network for thorax disease diagnosis from chest X-ray images[J]. Artificial Intelligence in Medicine, 2022, 132: 102382.
[22]
ChenB Z, ZhangZ, LiY J, et al. Multi-label chest X-ray image classification via semantic similarity graph embedding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(4): 2455-2468.
[23]
PhamH, LeT, TranD, et al. Interpreting chest X-rays via CNNs that exploit hierarchical disease dependencies and uncertainty labels[J]. Neurocomputing, 2021, 437: 186-194.
[24]
SelvarajuR, CogswellM, DasA, et al. Grad-CAM: Visual explanations from deep networks via gradient-based localization[C]∥IEEE International Conference on Computer Vision. Piscataway, NJ: IEEE, 2017: 618-626.