Objective The aircraft ground maintenance job card is the essential reference for maintenance operations and records. It requires handwritten image identification and digital storage. Due to the limitations of maintenance rules or manuals in civil aviation, a mixture of Chinese and English words often forms complex sentences on the same job card, which creates difficulties in word character extraction and reduces recognition precision. This study applies a new method of multi-branch convolution, the Re-parameterized and Multi-branch Convolution Algorithm (RMCA), to enhance the recognition of complex structures and similar words, improving mean average precision (MAP) and identification efficiency. This study addresses several problems in handwritten word identification. First, the number of layers in the deep convolution network affects the results of characteristic extraction. Second, features extracted from different layers represent varying dimensions in the feature matrix. Third, Chinese words demonstrate varying levels of complexity. Methods The identification precision index is defined as the mean average precision of Top1 and Top5, and the identification efficiency index is expressed as memory access cost (MAC) to evaluate the proposed model. However, calculating MAC during the model training process presents challenges. Therefore, MAC is replaced with the number of processed image pieces per second. The improved RMCA algorithm utilizes the strengths of deep convolution to extract image characteristics related to boundaries and fine details. Deep convolution layers are known for learning features at different abstraction levels, while lower layers capture more localized details. In addition, the dimension of the convolutional kernel influences the receptive field and local features within a certain layer. The core of the handwritten word identification model lies in the added convolutional channels and layers, along with an adaptive identification algorithm designed for identical and similar words in handwritten images. Higher identification precision and efficiency serve as reference indices for evaluating the model. The improved RMCA algorithm applies four branches in the initial layers, which differs from the original re-parameterized structure. The kernel sizes in the four branches are set to 11 and 77, equivalent to a variable dimension kernel of 77. The following of a fully connected layer can cause the loss of boundary or specific layer features, making it challenging to meet identification requirements for words with complex or simple structures. Hence, the improved RMCA algorithm utilizes spatial features. The fully connected layer is replaced with fully convolutional layers, and the spatial features from the fourth layer are passed to the classifier. This design enables the improved model to adapt automatically to various word structure complexities. The improved model comprises four functional components. The innovation of this study lies in several aspects. First, the enhanced re-parameterized structure across multiple stages and branches achieves an effect equivalent to variable convolution. Second, the refined classifier with fully convolutional layers combines features from specific intermediate layers with the output layer, resulting in improved precision for complex and similar words. The feature extraction performance is enhanced. Compared to feature outputs from the fourth and fifth layers of traditional models, the conclusion is confirmed. The simple Chinese word characteristics in the fourth layer are more abundant than in the fifth, whereas complex handwritten Chinese words contain similarly detailed features across both layers. Results and Discussions The training image datasets for the experiment consist of two groups. Group 1 includes the HWDB1.0‒1.1, comprising 3 755 classes of Chinese words, totaling 2.68 million images. The test dataset is the ICDAR‒2013, containing 224 thousand Chinese word images. Group 2 extends Group 1 by incorporating English uppercase and lowercase letters (52 classes) and digits from 0 to 9 (10 classes). The test dataset expands the ICDAR‒2013 with additional images of English letters and digits. The experimental results presented in this study demonstrate improvements in evaluation indices compared to other models, including the pre-improvement model. The experiments are divided into two categories: ablation and comparative. The results of the ablation experiment indicate that the Top1 and Top5 precision indices improved to 97.50% and 99.79%, respectively. Specifically, altering the 77 kernel in the first layer increases precision by 0.3%, while modifying the classifier results in a 0.6% gain. Group 2 achieves a Top1 recognition accuracy of 97.23%. The results of the comparative experiment, based on ten traditional models, showed that the proposed model occupies 69.1 MB of storage, slightly more than the 48.34 MB of the lightweight MobileNetV2. However, the model achieves superior precision (97.50%) and identification speed (1 410 cards per second). Compared to the original Rep‒VGG and ResNet50, the improved model increases precision by 6.90% and 8.43%, respectively. Identification speed improves by 8.8% and 17.2%, respectively. Conclusions These results confirm that the proposed method enhances word recognition precision and efficiency. Experiments involving similar word identification yield consistent findings. In the long term, the improved model proves applicable in the field of aircraft maintenance job cards and other specialized areas requiring handwritten word identification.
ThulasyT N, NohuddinP N, Abd RahimN,et al.Skill set issues in aircraft maintenance from industrial revolution 4.0 context:A document analytics survey[J].Human Systems Ma-nagement,2022,41(4):503‒516. doi:10.3233/hsm-210013
ChenJiashu.Research on coexistence of IMT and high-alti-tude communication platform system in 5G candidate frequency band[D].Beijing:Beijing University of Posts and Te-lecommunications,2019.
[5]
陈家书.5G候选频段内IMT与高空通信平台系统的共存研究[D].北京:北京邮电大学,2019.
[6]
LiuChenglin, JinLianwen, BaiXiang,et al.Frontiers of intelligent document analysis and recognition:Review and pr-ospects[J].Journal of Image and Graphics,2023(8):2223‒2252.
LiYunqing, DuJun, HuPengfei,et al.A method of radical form and hierarchical structure based handwritten Chinese character error correction[J].Journal of Image and Graphics,2023,28(8):2382‒2395.
JinLianwen, ZhongZhuoyao, YangZhao,et al.Applicatio-ns of deep learning for handwritten Chinese character recognition:A review[J].Acta Automatica Sinica,2016,42(8):1125‒1141.
ZhuoTiantian, SangQingbing.Application of attention me-chanism and composite convolution in handwriting recognition[J].Journal of Frontiers of Computer Science & Tec-hnology,2022,16(4):888‒897.
ShiPingping, HuangHongqiong.Lightweight MobileNetV2 offline handwritten Chinese character recognition based on attention mechanism[C]//Proceedings of the International Symposium on Robotics,Artificial Intelligence,and Information Engineering(RAIIE 2022).Hohhot:SPIE,2022:528‒535. doi:10.1117/12.2659091
[15]
ZouJunyi, ZhangJinliang, WangLudi.Handwritten Chinese character recognition by convolutional neural network and similarity ranking[EB/OL].(2019‒08‒30)[2023‒07‒01].
[16]
KimuraF, TakashinaK, TsuruokaS,et al.Modified quadra-tic discriminant functions and the application to Chinese character recognition[J].IEEE Transactions on Pattern An-alysis and Machine Intelligence,1987,9(1):149‒153. doi:10.1109/tpami.1987.4767881
[17]
WeiXiaohua, LuShujing, LuYue.Compact MQDF classifiers using sparse coding for handwritten Chinese character recognition[J].Pattern Recognition,2018,76:679‒690. doi:10.1016/j.patcog.2017.09.044
SuTonghua, YouHongming, LiuShuchen,et al.FPRNet:End-to-end full-page recognition model for handwritten Chin-ese essay[C]//International Conference on Frontiers in Ha-ndwriting Recognition.Cham:Springer,2022:231‒244. doi:10.1007/978-3-031-21648-0_16
[20]
HuangWanrong, HeKai, LiuKun,et al.Handwritten Chine-se character recognition based on attention mechanism[J].Laser & Optoelectronics Progress,2020,57(8):081002.
XieCanyu, LaiSongxuan, LiaoQianying,et al.High performance offline handwritten Chinese text recognition with a new data preprocessing and augmentation pipeline[C]//International Workshop on Document Analysis Systems.Ch-am:Springer,2020:45‒59. doi:10.1007/978-3-030-57058-3_4
[23]
ZhouYuchuan, TanQinhong, XiChuanlong.Offline handwritten Chinese character recognition of SqueezeNet and dynamic network surgery[J].Journal of Chinese Computer Systems,2021,42(3):556‒560.
DingXiaohan, ZhangXiangyu, MaNingning,et al.RepVGG:Making VGG-style ConvNets great again[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Nashville:IEEE,2021:13728‒13737. doi:10.1109/cvpr46437.2021.01352
[26]
TrangK, NguyenA H, TonThatL,et al.Improving RepVGG model with variational data imputation in COVID-19 classification[J].IAES International Journal of Artificial Intelligence(IJ-AI),2022,11(4):1278. doi:10.11591/ijai.v11.i4.pp1278-1286
[27]
ShiriP, BaniasadiA.Convolutional fully-connected capsule network(CFC-CapsNet):A novel and fast capsule network[J].Journal of Signal Processing Systems,2022,94(7):645‒658. doi:10.1007/s11265-021-01731-6
[28]
ShelhamerE, LongJ, DarrellT.Fully convolutional networ-ks for semantic segmentation[C]//Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligen-ce.Boston:IEEE,2016:640‒651. doi:10.1109/tpami.2016.2572683
[29]
LiuChenglin, YinFei, WangDahan,et al.Online and offline handwritten Chinese character recognition:Benchma-rking on new databases[J].Pattern Recognition,2013,46(1):155‒162. doi:10.1016/j.patcog.2012.06.021
[30]
IoffeS, SzegedyC.Batch normalization:Accelerating deep network training by reducing internal covariate shift[EB/OL].(2015‒02‒11)[2023‒07‒01].
[31]
YinFei, WangQiufeng, ZhangXuyao,et al.ICDAR 2013 Chinese handwriting recognition competition[C]//Proceedings of the 2013 12th International Conference on Document Analysis and Recognition.Washington:IEEE,2013:1464‒1470. doi:10.1109/icdar.2013.218
[32]
ShaoYunxue, GaoGuanglai, WangChunheng.A connection reduced network for similar handwritten Chinese character discrimination[C]//Proceedings of the 2016 15th International Conference on Frontiers in Handwriting Recognition(ICFHR).Shenzhen:IEEE,2017:54‒59. doi:10.1109/icfhr.2016.0023
[33]
HuangZetao, ZhangQian.Skew correction of handwritten Chinese character based on ResNet[C]//Proceedings of the 2019 International Conference on High Performance Big Data and Intelligent Systems(HPBD&IS).Shenzhen:IEEE,2019:223‒227. doi:10.1109/hpbdis.2019.8735469
[34]
SimonyanK, ZissermanA.Very deep convolutional netwo-rks for large-scale image recognition[EB/OL].(2014‒09‒ 04)[2023‒07‒01].
[35]
TanMingxing, LeQ V.EfficientNet:Rethinking model scaling for convolutional neural networks[EB/OL].(2019‒05‒28)[2023‒07‒01].
[36]
HeKaiming, ZhangXiangyu, RenShaoqing,et al.Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Las Vegas:IEEE,2016:770‒778. doi:10.1109/cvpr.2016.90
[37]
ChenLi, WangSong, FanWei,et al.Beyond human recognition:A CNN-based framework for handwritten character recognition[C]//Proceedings of the 2015 3rd IAPR Asian Conference on Pattern Recognition(ACPR).Kuala Lump-ur:IEEE,2016:695‒699. doi:10.1109/acpr.2015.7486592
[38]
ZhangYaping, LiangShan, NieShuai,et al.Robust offline handwritten character recognition through exploring writer-independent features under the guidance of printed data[J].Pattern Recognition Letters,2018,106:20‒26. doi:10.1016/j.patrec.2018.02.006