Objective To construct a model with a spatial and channel reconstruction convolutional module for accurate identification and classification of lung sound data. Method We propose a convolutional network architecture combining the spatial-channel reconstruction convolution (SCConv) module. A lung sound feature extraction method combining the dual tunable Q-factor wavelet transform (DTQWT) with the triple Wigner-Ville transform (WVT) was used to improve the model's ability to capture the key features of the lung sounds by adaptively focusing on the important channel and spatial features. The performance of the model for classification of normal, crackles, wheezes, and crackles with wheezes was tested using the ICBHI2017 dataset. Results and Conclusion The accuracy, sensitivity, specificity and F1 score of the proposed method reached 85.68%, 93.55%, 86.79% and 90.51%, respectively, demonstrating its good performance in classification tasks in the ICBHI2017 lung sound database, especially for distinguishing normal from abnormal lung sounds.
XieSN, GirshickR, DollárP, et al. Aggregated residual transformations for deep neural networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, HI, USA. IEEE, 2017: 5987-95.
SzegedyC, IoffeS, VanhouckeV, et al. Inception-v4, inception-ResNet and the impact of residual connections on learning[J]. Proc AAAI Conf Artif Intell, 2017, 31(1): 4480-8.
MnihV, HeessN, GravesA, et al. Recurrent models of visual attention[EB/OL]. 2014: arXiv: 1406.6247.
[8]
BaJ, MnihV, KavukcuogluK. Multiple object recognition with visual attention[J]. arXiv E Prints, 2014: arXiv:
[9]
GregorK, DanihelkaI, GravesA, et al. DRAW: a recurrent neural network for image generation[EB/OL]. 2015: arXiv: 1502.04623.
[10]
XuL, ChengJH, LiuJ, et al. ARSC-net: adventitious respiratory sound classification network using parallel paths with channel-spatial attention[C]//2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Houston, TX, USA. IEEE, 2021: 1125-30.
[11]
LiJF, WenY, HeLH. SCConv: spatial and channel reconstruction convolution for feature redundancy[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vancouver, BC, Canada. IEEE, 2023: 6153-62.
ChenH, YuanXC, PeiZY, et al. Triple-classification of respiratory sounds using optimized S-transform and deep residual networks[J]. IEEE Access, 2845, 7: 32845-52.
[14]
BahdanauD, ChoK, BengioY. Neural machine translation by jointly learning to align and translate[J]. arXiv E Prints, 2014: arXiv:
[15]
XuK, BaJ, KirosR, et al. Show, attend and tell: neural image caption generation with visual attention[EB/OL]. 2015: arXiv: 1502.03044.
[16]
HouQB, LuCZ, ChengMM, et al. Conv2Former: a simple transformer-style ConvNet for visual recognition[J]. IEEE Trans Pattern Anal Mach Intell, 2024, doi: 10.1109/TPAMI.2024.3401450 . Online ahead of print.
[17]
GulzarH, LiJY, ManzoorA, et al. Transfer Learning based Diagnosis and Analysis of Lung Sound Aberrations[J]. Int J Bioinform Biosci, 2023, 13(1): 29-40.
[18]
RochaBM, FilosD, MendesL, et al. Α respiratory sound database for the development of automated classification[C]//International Conference on Biomedical and Health Informatics. Singapore: Springer, 2018: 33-37.
[19]
BohadanaA, IzbickiG, KramanSS. Fundamentals of lung auscultation[J]. N Engl J Med, 2014, 370(8): 744-51.
[20]
PerezL, WangJ. The Effectiveness of Data Augmentation in Image Classification using Deep Learning[J]. arXiv E Prinst, 2017: ar Xiv:1712. 04621.
[21]
MaY, XuXZ, YuQ, et al. LungBRN: a smart digital stethoscope for detecting respiratory disease using bi-ResNet deep learning algorithm[C]//2019 IEEE Biomedical Circuits and Systems Conference (BioCAS). Nara, Japan. IEEE, 2019: 1-4.
[22]
WuCW, YeN, JiangJL. Classification and recognition of lung sounds based on improved Bi-ResNet model[J]. IEEE Access, 2024, 12: 73079-94.
[23]
NguyenT, PernkopfF. Lung sound classification using snapshot ensemble of convolutional neural networks[C]//2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). Montreal, QC, Canada. IEEE, 2020: 760-3.
[24]
SongWJ, HanJQ, SongHW. Contrastive embeddind learning method for respiratory sound classification[C]//ICASSP 2021‑2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, ON, Canada. IEEE, 2021: 1275-9.