基于迁移学习与知识蒸馏的管网工程领域语音识别

封婧仪 ,  郭先强 ,  张存根 ,  吕沅庚 ,  刘乐平

水利水电技术(中英文) ›› 2025, Vol. 56 ›› Issue (S2) : 6 -9.

PDF (3325KB)
水利水电技术(中英文) ›› 2025, Vol. 56 ›› Issue (S2) : 6 -9. DOI: 10.13928/j.cnki.wrahe.2025.S2.002
知识驱动的长江大保护智慧EPC管控技术专栏

基于迁移学习与知识蒸馏的管网工程领域语音识别

作者信息 +

Speech recognition in pipeline engineering domain based on transfer learning and knowledge distillation

Author information +
文章历史 +
PDF (3404K)

摘要

市政管道工程是城市建设的重点领域,传统信息录入手段记录专家施工经验效率低。语音识别技术可提升效率,但通用模型在该领域的准确率低。为此,提出一种基于迁移学习与知识蒸馏的管网工程领域语音识别模型。该模型采用端到端方式,通过迁移学习将开放域模型参数迁移到领域数据,再用知识蒸馏压缩模型,提升识别精度和效率。试验表明,迁移学习使字错误率降低6.2%,知识蒸馏使模型参数减少83.2 MB,推理速度提高。

Abstract

Municipal pipeline engineering is a key area in urban construction. Traditional method of recording expert construction guidance are inefficient. Speech recognition technology can improve efficiency but often has low accuracy in specialized domains. A speech recognition model was proposed for the pipeline engineering domain based on transfer learning and knowledge distillation. The model uses an end-to-end approach, adapts parameters from an open-domain model to the target domain via transfer learning, and then compresses the model using knowledge distillation.[Results]show that transfer learning reduces the word error rate by 6.2%, and knowledge distillation reduces model parameters by 83.2 MB while improving inference speed.

关键词

管网工程 / 专家语音识别 / 迁移学习 / 知识蒸馏 / 轻量化

Key words

pipeline engineering / expert speech recognition / transfer learning / knowledge distillation / lightweight design

引用本文

引用格式 ▾
封婧仪,郭先强,张存根,吕沅庚,刘乐平. 基于迁移学习与知识蒸馏的管网工程领域语音识别[J]. 水利水电技术(中英文), 2025, 56(S2): 6-9 DOI:10.13928/j.cnki.wrahe.2025.S2.002

登录浏览全文

4963

注册一个新账户 忘记密码

参考文献

基金资助

中国长江三峡集团有限公司科研项目(202103551)

AI Summary AI Mindmap
PDF (3325KB)

0

访问

0

被引

详细

导航
相关文章

AI思维导图

/