基于图注意力神经网络的实体消歧方法
牛泽群 , 李晓戈 , 强成宇 , 韩伟 , 姚怡 , 刘洋
山东大学学报(理学版) ›› 2024, Vol. 59 ›› Issue (03) : 71 -80.
基于图注意力神经网络的实体消歧方法
Entity disambiguation method based on graph attention networks
针对链接对象为存在半结构化数据的知识库,提出了一种基于图注意力神经网络的短文本实体指称消歧方法。通过信息抽取与融入关键词,将含有半结构化数据的知识库构建为全局知识图谱;同时基于Bert预训练模型对短文本中的实体指称项进行嵌入融合;使用图注意力神经网络对全局知识图谱中候选实体节点进行加权聚合表征,并计算实体指称项与各候选实体之间的相似度得分,实现实体消歧。在CCKS2019数据集上的实验结果表明,基于图注意力神网络的实体消歧模型有效提高了实体消歧效果。
We propose an entity disambiguation method based on graph attention networks for semi-structured knowledge base data. First, a global knowledge graph is constructed from the semi-structured knowledge base, and the entity reference items are embedded by Bert pre-trained model meanwhile. Next, graph attention networks which leverages masked self-attention layers is applyed on candidate entity nodes of global knowledge graph to fetch a vector of node level. Furtherly, we compute similarity scores rank between the entity reference items and the candidate entity to complete the task of entity disambiguation. The experimental results on CCKS2019 dataset achieves state-of-the-art.
| [1] |
李天然, 刘明童, 张玉洁, |
| [2] |
|
| [3] |
刘峤, 李杨, 段宏, |
| [4] |
|
| [5] |
段宗涛, 李菲, 陈柘. 实体消歧综述[J]. 控制与决策, 2021, 36(5): 1025-1039. |
| [6] |
|
| [7] |
BOIŃSKIT, SZYMANŃSKIJ, DUDEKB, |
| [8] |
SINGHK, LYTRAI, RADHAKRISHNASA, |
| [9] |
|
| [10] |
张丹阳, 李楠, 陈翀. 实体链接技术研究述评[J]. 情报工程, 2020, 6(6): 45-55. |
| [11] |
|
| [12] |
HUANGD, WANGJ. An approach on Chinese microblog entity linking combining baidu encyclopedia and word2vec[J]. Procedia Computer Science, 2017, 111: 37-45. |
| [13] |
|
| [14] |
武川, 陆伟. 基于上下文特征的短文本实体链接研究[J]. 情报科学, 2016(2): 144-147. |
| [15] |
|
| [16] |
谭咏梅, 王睿, 李茂林. 基于上下文信息和排序学习的实体链接方法[J]. 北京邮电大学学报, 2015(5): 33-36. |
| [17] |
|
| [18] |
昝红英, 吴泳钢, 贾玉祥, |
| [19] |
|
| [20] |
周鹏程, 武川, 陆伟. 基于多知识库的短文本实体链接方法研究: 以Wikipedia和Freebase为例[J]. 现代图书情报技术, 2016(6): 1-11. |
| [21] |
|
| [22] |
|
| [23] |
FRANCIS—LANDAUM, DURRETTG, KLEIND. Capturing semantic similarity for entity linking with convolutional neural networks[J/OL]. arXiv, 2016. https://arxiv.org/pdf/1604.00734.pdf. |
| [24] |
HUANGH, |
| [25] |
张涛, 刘康, 赵军. 一种基于图模型的维基概念相似度计算方法及其在实体链接系统中的应用[J]. 中文信息学报, 2015, 29(2): 58-67. |
| [26] |
|
| [27] |
周金, 朱永华, 张铁男, |
| [28] |
|
| [29] |
郭宇航, 秦兵, 刘挺, |
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
祁志卫, 王笳辉, 岳昆, |
| [34] |
|
| [35] |
WANGD, CUIP, ZHUW. Structural deep network embedding[C]// Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2016: 1225-1234. |
| [36] |
CAOS, LUW, XUQ. Deep neural networks for learning graph representations[C]// Proceedings of the AAAI Conference on Artificial Intelligence.[S.l.]: AAAI, 2016, 30(1). |
| [37] |
HAMILTONWL, YINGR, LESKOVECJ. Inductive representation learning on large graphs[J/OL]. arXiv, 2017. https://arxiv.org/pdf/1706.02216.pdf. |
| [38] |
KIPFTN, WELLINGM. Semi—supervised classification with graph convolutional networks[J/OL]. arXiv, 2016. https://arxiv.org/abs/1609.02907. |
| [39] |
|
| [40] |
张玉帅, 赵欢, 李博. 基于BERT和BiLSTM的语义槽填充[J]. 计算机科学, 2021, 48(1): 247-252. |
| [41] |
|
| [42] |
DEVLINJ, CHANGMW, LEEK, |
| [43] |
VASWANIA, SHAZEERN, PARMARN, |
| [44] |
金燕, 黄杰. 基于信息熵与词长信息改进的TFIDF算法[J]. 浙江工业大学学报, 2021, 49(2): 203-209. |
| [45] |
|
| [46] |
张晟旗, 王元龙, 李茹, |
| [47] |
|
| [48] |
姚源林, 王树伟, 徐睿峰, |
| [49] |
|
| [50] |
LIS, ZHAOZ, HUR, |
| [51] |
DEMSZKYD, MOVSHOVITZ—ATTIASD, KOJ, |
| [52] |
KIMY. Convolutional neural networks for sentence classification[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: ACL, 2014: 1746-1751. |
| [53] |
SCHUSTERM, PALIWALKK. Bidirectional recurrent neural networks[J]. IEEE Transactions on Signal Processing, 1997, 45(11): 2673-2681. |
| [54] |
VAN DER MAATENL, HINTONG. Visualizing data using t—SNE[J]. Journal of Machine Learning Research, 2008, 9(11): 2579-2605. |
| [55] |
SONGY, SHIS, LIJ, |
| [56] |
JOULINA, GRAVEE, BOJANOWSKIP, |
| [57] |
TANGD, WEIF, YANGN, |
国家重点研发计划资助项目(2018YFB1402905)
陕西省重点研发计划资助项目(2020GY-227)
陕西省重点研发计划资助项目(2020ZDLGY09-05)
陕西省技术创新引导专项基金(2022PT-49)
/
| 〈 |
|
〉 |