PDF (8739K)
摘要
工程规范是工程建设过程中常用的重要标准文件之一。面对这些非结构化工程规范文本,高效、准确地从中抽取相关知识,并将这些知识以可视化形式呈现,对于提高知识的利用效率、提升管理人员对工程规范文本的理解效率有着重要的作用。针对典型的工程规范文本,提出一种基于深度学习的工程规范知识抽取方法,融合ALBERT(A Lite Bidirectional Encoder Representation from Transformers)、BiLSTM(Bi-directional Long Shot-Term Memory)和CRF(Conditional Random Fields),建立工程规范实体识别模型,增强文本语义特征,获得工程规范中的实体;融合Attention机制和BiLSTM提取工程规范中的关系,根据所提取出的知识构建工程规范知识图谱。以《给水排水管道工程施工及验收规范》为典型实例对该方法进行了验证,结果表明,工程规范实体识别的F1值为78.18%,优于传统模型;关系抽取的F1值为98.35%。利用所抽取知识建立了工程规范知识图谱,通过基于知识图谱的全局信息展示、特定信息检索,提升工程规范的利用效率,辅助工程现场施工。
Abstract
Engineering specifications are one of the important standard documents commonly used in the construction process. Faced with these unstructured engineering specification texts, efficiently and accurately extracting relevant knowledge and presenting this knowledge in a visual format plays a significant role in improving knowledge utilization efficiency and enhancing management personnel′s understanding of engineering specification texts. A deep learning-based method was proposed for extracting knowledge from typical engineering specification texts, integrating ALBERT(A Lite Bidirectional Encoder Representation from Transformers), BiLSTM(Bi-directional Long Short-Term Memory), and CRF(Conditional Random Fields) to establish an entity recognition model for engineering specifications. The model enhances the semantic features of the text to identify entities within the engineering specifications. Additionally, it employs the Attention mechanism and BiLSTM to extract relationships from the engineering specifications and constructs an engineering specification knowledge graph based on the extracted knowledge. Using the “Construction and Acceptance Specifications for Water Supply and Drainage Pipeline Projects” as a typical example, the method was validated, yielding an F1 score of 78.18% for entity recognition, which is superior to traditional models, and an F1 score of 98.35% for relationship extraction. Leveraging this knowledge, an engineering specification knowledge graph was established. Through a knowledge graph-based global information display, specific information retrieval, the efficiency of utilizing engineering specification knowledge was improved, assisting with on-site construction.
关键词
工程规范
/
知识抽取
/
ALBERT预训练模型
/
BiLSTM
/
CRF
/
注意力机制
Key words
engineering specification
/
knowledge extraction
/
ALBERT pre-training model
/
BiLSTM
/
CRF
/
attention
Author summay
[Author(id=1248676020572791657, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=deng_xufang@ctg.com.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248676020631511915, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676020572791657, language=EN, stringName=Xufang DENG, firstName=Xufang, middleName=null, lastName=DENG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1. China Yangtze Power Co., Ltd., Wuhan 430014, Hubei, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248676020673454957, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676020572791657, language=CN, stringName=邓旭方, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.中国长江电力股份有限公司, 湖北 武汉 430014, bio={"content":"邓旭方(1987—), 男, 工程师, 水工金属结构主任师, 学士, 主要从事水工建筑物运维和大坝安全管理领域的技术研究工作。Email: deng_xufang@ctg.com.cn
"}, bioImg=null, bioContent=邓旭方(1987—), 男, 工程师, 水工金属结构主任师, 学士, 主要从事水工建筑物运维和大坝安全管理领域的技术研究工作。Email: deng_xufang@ctg.com.cn
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248676020421796701, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, xref=null, ext=[AuthorCompanyExt(id=1248676020438573919, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020421796701, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1. China Yangtze Power Co., Ltd., Wuhan 430014, Hubei, China), AuthorCompanyExt(id=1248676020451156833, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020421796701, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.中国长江电力股份有限公司, 湖北 武汉 430014)])]), Author(id=1248676020719592304, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248676020786701171, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676020719592304, language=EN, stringName=Fei CHENG, firstName=Fei, middleName=null, lastName=CHENG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248676020828644213, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676020719592304, language=CN, stringName=成飞, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248676020497294179, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, xref=null, ext=[AuthorCompanyExt(id=1248676020509877092, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China), AuthorCompanyExt(id=1248676020526654310, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350)])]), Author(id=1248676020874781560, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=lyg@tju.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248676020929307515, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676020874781560, language=EN, stringName=Yuangeng LYU, firstName=Yuangeng, middleName=null, lastName=LYU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248676020983833469, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676020874781560, language=CN, stringName=吕沅庚, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248676020497294179, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, xref=null, ext=[AuthorCompanyExt(id=1248676020509877092, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China), AuthorCompanyExt(id=1248676020526654310, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350)])]), Author(id=1248676021025776511, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248676021105468290, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676021025776511, language=EN, stringName=Lun DENG, firstName=Lun, middleName=null, lastName=DENG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1. China Yangtze Power Co., Ltd., Wuhan 430014, Hubei, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248676021151605635, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676021025776511, language=CN, stringName=邓伦, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.中国长江电力股份有限公司, 湖北 武汉 430014, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248676020421796701, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, xref=null, ext=[AuthorCompanyExt(id=1248676020438573919, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020421796701, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1. China Yangtze Power Co., Ltd., Wuhan 430014, Hubei, China), AuthorCompanyExt(id=1248676020451156833, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020421796701, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.中国长江电力股份有限公司, 湖北 武汉 430014)])]), Author(id=1248676021197742981, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248676021260657543, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676021197742981, language=EN, stringName=Leping LIU, firstName=Leping, middleName=null, lastName=LIU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248676021315183496, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676021197742981, language=CN, stringName=刘乐平, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248676020497294179, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, xref=null, ext=[AuthorCompanyExt(id=1248676020509877092, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China), AuthorCompanyExt(id=1248676020526654310, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350)])]), Author(id=1248676021365515146, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, orderNo=5, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248676021424235404, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676021365515146, language=EN, stringName=Jingyi FENG, firstName=Jingyi, middleName=null, lastName=FENG, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248676021470372749, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, authorId=1248676021365515146, language=CN, stringName=封婧仪, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248676020497294179, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, xref=null, ext=[AuthorCompanyExt(id=1248676020509877092, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2. State Key Laboratory of Hydraulic Engineering Intelligent Construction and Operation, Tianjin University, Tianjin 300350, China), AuthorCompanyExt(id=1248676020526654310, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596674512277607, companyId=1248676020497294179, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350)])])]
邓旭方,成飞,吕沅庚,邓伦,刘乐平,封婧仪.
基于混合深度学习算法的工程规范知识抽取[J].
水利水电技术(中英文), 2025, 56(S1): 76-84 DOI:10.13928/j.cnki.wrahe.2025.S1.013
基金资助
中国长江电力股份有限公司科研项目(Z212302036)