命名实体识别方法及在电力领域的应用
Methods of named entity recognition and applications in electric power domain
领域命名实体识别方法在大语言模型技术赋能的背景下,推动了电力领域命名实体识别的发展。在此背景下,本文综述电力领域命名实体识别方法的发展历程,介绍早期基于规则和词典的方法再到统计机器学习的方法。从分布式嵌入层、文本编码层和标签解码层总结基于深度学习方法的模型。本文还探讨大语言模型在命名实体识别任务中的应用及其影响,并且探索当前电力领域命名实体识别存在的问题。
The development of named entity recognition (NER) in the power sector has been propelled by domain-specific NER methods under the empowerment of large language model technology. In this context, the evolutionary journey of NER methods within the power domain is reviewed in this paper, where early approaches based on rules and dictionaries are introduced, followed by statistical machine learning methods. Deep learning-based models are summarized from the perspectives of the distributed embedding layer, the text encoding layer, and the label decoding layer. The application of large language models to NER tasks and their impact are also examined. Furthermore, the existing challenges currently faced by power domain NER are explored. Finally, an outlook on future research directions is presented.
| [1] |
王颖洁, 张程烨, 白凤波, |
| [2] |
|
| [3] |
李猛, 李艳玲, 林民 . 命名实体识别的迁移学习研究综述[J]. 计算机科学与探索, 2021, 15(2): 206-218. |
| [4] |
|
| [5] |
李新鹏, 徐建航, 郭子明, |
| [6] |
|
| [7] |
王慧芳, 曹靖, 罗麟 . 电力文本数据挖掘现状及挑战[J]. 浙江电力, 2019, 38(3): 1-7. |
| [8] |
|
| [9] |
冀振燕, 孔德焱, 刘伟, |
| [10] |
|
| [11] |
袁金斗, 潘明明, 张腾, |
| [12] |
|
| [13] |
徐鹏, 龚伟, 宋俊典 . 基于MRC的设备故障命名实体识别方法[J]. 计算机应用与软件, 2024, 41(5): 171-176. |
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
刘梓权, 王慧芳 . 基于知识图谱技术的电力设备缺陷记录检索方法[J]. 电力系统自动化, 2018, 42(14): 158-164. |
| [18] |
|
| [19] |
孙玉芹, 肖静婷, 王海超 . 基于多模型融合的电力运检命名实体识别[J]. 科学技术与工程, 2023, 23(36): 15545-15552. |
| [20] |
|
| [21] |
潘晖, 赵岩, 李麟, |
| [22] |
|
| [23] |
孔静静, 于琦, 李敬华, |
| [24] |
|
| [25] |
|
| [26] |
李嘉皓, 熊威, 龚康, |
| [27] |
|
| [28] |
纪鑫, 武同心, 余婷, |
| [29] |
|
| [30] |
田雪涵, 董坤, 赵剑锋, |
| [31] |
|
| [32] |
吴智妍, 金卫, 岳路, |
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
杜修明, 秦佳峰, 郭诗瑶, |
| [37] |
|
| [38] |
蒋逸雯, 李黎, 李智威, |
| [39] |
|
| [40] |
李强, 庄莉, 赵峰, |
| [41] |
|
| [42] |
张宇波, 王有元, 梁玄鸿, |
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
杨秋勇, 彭泽武, 苏华权, |
| [49] |
|
| [50] |
张大波, 郭怀新, 储著伟, |
| [51] |
|
| [52] |
肖勇, 郑楷洪, 王鑫, |
| [53] |
|
| [54] |
江叶峰, 孙少华, 仇晨光, |
| [55] |
|
| [56] |
徐会芳, 张中浩, 谈元鹏, |
| [57] |
|
| [58] |
毛宏亮, 艾孜尔古丽, 陈德刚 . 基于多头注意力的电网调度领域命名实体识别[J]. 计算机技术与发展, 2023, 33(2): 181-186. |
| [59] |
|
| [60] |
|
| [61] |
吴超, 王汉军 . 基于GRU的电力调度领域命名实体识别方法[J]. 计算机系统应用, 2020, 29(8): 185-191. |
| [62] |
|
| [63] |
宋厚岩, 王汉军 . 基于GRU和PCNN的电力知识抽取[J]. 计算机系统应用, 2021, 30(9): 200-205. |
| [64] |
|
| [65] |
|
| [66] |
张汝佳, 代璐, 王邦, |
| [67] |
|
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
俞阳, 何玮, 康雨萌 . 一种面向自然语言问题的命名实体识别模型[J]. 电子设计工程, 2023, 31(14): 29-32. |
| [72] |
|
| [73] |
顾亦然, 霍建霖, 杨海根, |
| [74] |
|
| [75] |
刘斐, 文中, 吴艺 . 基于BERT—BILSTM—CRF模型的电力行业事故文本智能分析[J]. 中国安全生产科学技术, 2023, 19(1): 209-215. |
| [76] |
|
| [77] |
黄锋, 崔志美, 黄志都, |
| [78] |
|
| [79] |
龚泽威一, 肖妮, 曹占国, |
| [80] |
|
| [81] |
|
| [82] |
孙宏云, 李喜旺 . 面向配电网数据的命名实体识别[J]. 计算机系统应用, 2023, 32(2): 387-393. |
| [83] |
|
| [84] |
张智源, 孙水华, 徐诗傲, |
| [85] |
|
| [86] |
蒋晨, 王渊, 胡俊华, |
| [87] |
|
| [88] |
徐翀, 王其清 . 面向知识获取的电力科技领域语言模型研究[J]. 电力信息与通信技术, 2023, 21(4): 31-36. |
| [89] |
|
| [90] |
|
| [91] |
杨政, 蔡迪, 李慧斌 . 基于层次化表示的电力文本命名实体识别和匹配算法[J]. 计算机与现代化, 2022(5): 75-81. |
| [92] |
|
| [93] |
黄源航, 强梦烨, 李涛, |
| [94] |
|
| [95] |
张锐, 刘剑青, 张伯远, |
| [96] |
|
| [97] |
王佳琪, 俞灵, 夏文岳, |
| [98] |
|
| [99] |
|
| [100] |
皮俊波, 齐世雄, 孙文多, |
| [101] |
|
| [102] |
|
| [103] |
|
| [104] |
陈伟, 杨燕 . 基于指针网络的抽取生成式摘要生成模型[J]. 计算机应用, 2021, 41(12): 3527-3533. |
| [105] |
|
| [106] |
何俊, 刘鹏, 聂勇, |
| [107] |
|
| [108] |
|
| [109] |
冯曙明, 胡天牧, 杨永成, |
| [110] |
|
| [111] |
|
| [112] |
|
| [113] |
|
| [114] |
|
| [115] |
|
| [116] |
|
| [117] |
|
| [118] |
|
| [119] |
|
| [120] |
|
| [121] |
|
| [122] |
|
| [123] |
|
广东省重点实验室基金资助项目(2023B1212060076)
深圳市科技计划基金资助项目(KJZD20230923114405012)
南方电网科技基金资助项目(031900KC23040016(GDKJXM20230399))
南方电网科技基金资助项目(031900KC23040017(GDKJXM20230401))
/
| 〈 |
|
〉 |