面向图片数据的混凝土材料文本智能识别与分析

邓旭方 ,  刘乐平 ,  陈正虎 ,  钟恒 ,  吕沅庚 ,  封婧仪

水利水电技术(中英文) ›› 2025, Vol. 56 ›› Issue (S1) : 85 -94.

PDF (12478KB)
水利水电技术(中英文) ›› 2025, Vol. 56 ›› Issue (S1) : 85 -94. DOI: 10.13928/j.cnki.wrahe.2025.S1.014
知识驱动的长江大保护智慧EPC管控技术专栏

面向图片数据的混凝土材料文本智能识别与分析

作者信息 +

Intelligent recognition and analysis of concrete material text based on image data

Author information +
文章历史 +
PDF (12777K)

摘要

在混凝土坝建设过程中,产生了大量以非结构化文本表达的材料信息,对工程质量检测与材料进一步研发具有重要意义。受数据管理技术限制,存在大量以图片形式存储的材料文本数据,难以直接编辑与利用,无法满足混凝土材料数据智能分析与管理的需求。此外,针对海量的材料文本数据,目前缺乏智能的信息提取机制,难以高效获取文本中的关键信息。因此,提出了基于图片数据的混凝土材料文本智能解译方法,识别图片数据中的文本信息,提高了倾斜材料文本的检测与识别效率。以解译的图片数据为基础,从多角度文本特征关系出发,以MMR算法为框架,结合BERT模型以及TF-IDF算法,考虑文本语义与专业术语的重要性,建立了一套混凝土材料文本智能分析技术,提取混凝土材料文本中的关键信息。以实际混凝土材料文本为基础,该方法提取关键词的准确率为86.67%,优于其他常用的关键词提取模型。研究成果为混凝土材料不可编辑文本数据的处理提供了一种新的方法,有助于提升混凝土材料数据智能化管理水平。

Abstract

During the construction of concrete dams, a large amount of material information expressed in unstructured text is generated, which is of great significance for engineering quality inspection and further research and development of materials. Due to the limitations of data management technology, there is a large amount of material text data stored in the form of images, which is difficult to directly edit and utilize, and cannot meet the needs of intelligent analysis and management of concrete dam material data. In addition, there is currently a lack of intelligent information extraction mechanisms for massive material text data, making it difficult to efficiently obtain key information from the text. An intelligent interpretation method was proposed for concrete material text based on image data, which identifies text information in image data and improves the detection and recognition efficiency of inclined material text. Based on the interpreted image data, starting from the multi perspective text feature relationship, using MMR algorithm as the framework, combined with BERT model and TF-IDF algorithm, considering the importance of text semantics and professional terminology, a set of intelligent analysis technology for concrete material text was established to extract key information from concrete material text. Based on actual concrete material text, the accuracy of extracting keywords using this method is 86.67%, which is superior to other commonly used keyword extraction models. Research findings provide a new method for processing non editable text data of concrete materials, which helps to improve the intelligent management level of concrete dam material data.

关键词

混凝土坝 / 材料数据 / 文本检测 / 智能识别 / 关键信息

Key words

concrete dam / material data / text detection / intelligent recognition / key information

引用本文

引用格式 ▾
邓旭方,刘乐平,陈正虎,钟恒,吕沅庚,封婧仪. 面向图片数据的混凝土材料文本智能识别与分析[J]. 水利水电技术(中英文), 2025, 56(S1): 85-94 DOI:10.13928/j.cnki.wrahe.2025.S1.014

登录浏览全文

4963

注册一个新账户 忘记密码

参考文献

基金资助

中国长江电力股份有限公司科研项目(Z212302036)

AI Summary AI Mindmap
PDF (12478KB)

0

访问

0

被引

详细

导航
相关文章

AI思维导图

/