PDF (6537K)
摘要
施工方案是施工组织设计的核心环节。运用自然语言处理技术,从非结构化文本中提取施工方案,并予以审阅和审查,可有效提高施工方案的审查效率,提高编制质量,同时有利于发现施工方案中潜在的安全和质量风险,以便施工过程加以预警。从非结构化文本提取施工方案,需要明确不同类型的施工方案的内容构成,对相关段落进行内容归类。针对非结构化施工方案段落内容归类问题,在深入研究工程施工方案类别及内容构成框架的基础上,以城市管网工程施工组织设计段落为样本,进行了施工组织设计段落内容分类,提出了融合Albert、TextRCNN的段落文本分类模型,该模型采用Albert预训练语言模型进行词嵌入,将生成的词向量输入到TextRCNN分类器中完成文本分类,准确率提高0.79%,试验表明:结合Albert的TextRCNN可以有效对施工组织设计段落进行内容分类,为进一步施工方案提取提供基础。
Abstract
Construction scheme is one of the most important contents of construction organization design. The application of natural language processing to extract and review construction schemes from unstructured texts can improve review efficiency, enhance preparation quality, and identify potential safety and quality risks for early construction warnings. To extract construction schemes from unstructured texts, it is necessary to first clarify the content composition of different types of construction schemes and classify the contents of relevant paragraphs. Aiming at the classification of unstructured construction scheme paragraphs, classification of construction organization design paragraphs of urban pipe network engineering was taken as samples on the basis of in-depth study of construction scheme categories and content composition framework, and proposes a paragraph text classification model integrating Albert and TextRCNN. This model uses Albert pretraining language model for word embedding, and input the generated word vector into TextRCNN classifier to complete text classification. It outperforms Albert-TextCNN, which has the best classification effect among the other three models, and the accuracy increases by 0.79%. The experiment shows that: TextRCNN combined with Albert can effectively classify the contents of the construction organization design paragraphs, providing a basis for further construction scheme extraction.
关键词
施工方案
/
段落文本
/
Albert
/
TextRCNN
Key words
construction scheme
/
paragraph text
/
Albert
/
TextRCNN
Author summay
[Author(id=1248675997663469632, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=du_runlong@ctg.com.cn, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248675997717995586, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675997663469632, language=EN, stringName=Runlong DU, firstName=Runlong, middleName=null, lastName=DU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1. Yangtze Three Gorges Technology and Economy Development Co., Ltd., Beijing 101100, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248675997759938628, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675997663469632, language=CN, stringName=杜润隆, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.长江三峡技术经济发展有限公司, 北京 101100, bio={"content":"杜润隆(1990—), 男, 高级工程师, 学士, 主要从事数字化研究。E-mail: du_runlong@ctg.com.cn
"}, bioImg=null, bioContent=杜润隆(1990—), 男, 高级工程师, 学士, 主要从事数字化研究。E-mail: du_runlong@ctg.com.cn
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248675997516668979, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, xref=null, ext=[AuthorCompanyExt(id=1248675997529251893, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997516668979, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1. Yangtze Three Gorges Technology and Economy Development Co., Ltd., Beijing 101100, China), AuthorCompanyExt(id=1248675997541834806, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997516668979, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.长江三峡技术经济发展有限公司, 北京 101100)])]), Author(id=1248675997831241800, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=tankexin@tju.com, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248675997889962059, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675997831241800, language=EN, stringName=Kexin TAN, firstName=Kexin, middleName=null, lastName=TAN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2. State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin University, Tianjin 300350, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248675997936099405, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675997831241800, language=CN, stringName=谭柯鑫, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248675997596360762, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, xref=null, ext=[AuthorCompanyExt(id=1248675997608943676, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997596360762, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2. State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin University, Tianjin 300350, China), AuthorCompanyExt(id=1248675997621526588, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997596360762, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350)])]), Author(id=1248675997978042447, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248675998032568401, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675997978042447, language=EN, stringName=Tian GAO, firstName=Tian, middleName=null, lastName=GAO, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1. Yangtze Three Gorges Technology and Economy Development Co., Ltd., Beijing 101100, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248675998078705746, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675997978042447, language=CN, stringName=高添, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.长江三峡技术经济发展有限公司, 北京 101100, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248675997516668979, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, xref=null, ext=[AuthorCompanyExt(id=1248675997529251893, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997516668979, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1. Yangtze Three Gorges Technology and Economy Development Co., Ltd., Beijing 101100, China), AuthorCompanyExt(id=1248675997541834806, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997516668979, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.长江三峡技术经济发展有限公司, 北京 101100)])]), Author(id=1248675998120648788, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248675998175174742, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675998120648788, language=EN, stringName=Zhi HAN, firstName=Zhi, middleName=null, lastName=HAN, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2. State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin University, Tianjin 300350, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248675998217117783, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675998120648788, language=CN, stringName=韩郅, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=2, address=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248675997596360762, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, xref=null, ext=[AuthorCompanyExt(id=1248675997608943676, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997596360762, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2. State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin University, Tianjin 300350, China), AuthorCompanyExt(id=1248675997621526588, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997596360762, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2.天津大学 水利工程智能建设与运维全国重点实验室, 天津 300350)])]), Author(id=1248675998263255129, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, orderNo=4, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1248675998321975387, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675998263255129, language=EN, stringName=Yunfeng XU, firstName=Yunfeng, middleName=null, lastName=XU, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1. Yangtze Three Gorges Technology and Economy Development Co., Ltd., Beijing 101100, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1248675998368112732, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, authorId=1248675998263255129, language=CN, stringName=徐云凤, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1.长江三峡技术经济发展有限公司, 北京 101100, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1248675997516668979, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, xref=null, ext=[AuthorCompanyExt(id=1248675997529251893, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997516668979, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1. Yangtze Three Gorges Technology and Economy Development Co., Ltd., Beijing 101100, China), AuthorCompanyExt(id=1248675997541834806, tenantId=1045748351789510663, journalId=1221126710357164034, articleId=1248596672814006519, companyId=1248675997516668979, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1.长江三峡技术经济发展有限公司, 北京 101100)])])]
杜润隆,谭柯鑫,高添,韩郅,徐云凤.
施工方案类别及文本分类模型实现研究分类[J].
水利水电技术(中英文), 2025, 56(S1): 95-101 DOI:10.13928/j.cnki.wrahe.2025.S1.015
基金资助
中国长江三峡集团有限公司科研项目(202103551)