南京大学学报(自然科学版) ›› 2020, Vol. 56 ›› Issue (4): 541–548.doi: 10.13232/j.cnki.jnju.2020.04.012

• • 上一篇    下一篇

基于翻译模型的异质重边信息网络链路预测研究

郑建兴1,2(),李沁文1,王素格1,2,李德玉1,2   

  1. 1.山西大学计算机与信息技术学院,太原,030006
    2.计算智能与中文信息处理教育部重点实验室(山西大学),太原,030006
  • 收稿日期:2020-06-28 出版日期:2020-07-30 发布日期:2020-08-06
  • 通讯作者: 郑建兴 E-mail:jxzheng@sxu.edu.cn
  • 基金资助:
    国家自然科学基金(61632011);山西省重点研发计划(国际科技合作)(201803D421024);山西省自然科学基金(201901D211174);山西省高等学校科技创新项目(2020L0001);山西省软科学研究一般项目(2018041015?3)

Research of link prediction based on translation model in heterogeneous multi⁃edge information network

Jianxing Zheng1,2(),Qinwen Li1,Suge Wang1,2,Deyu Li1,2   

  1. 1.School of Computer and Information Technology,Shanxi University,Taiyuan,030006,China
    2.Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education (Shanxi University),Taiyuan,030006,China
  • Received:2020-06-28 Online:2020-07-30 Published:2020-08-06
  • Contact: Jianxing Zheng E-mail:jxzheng@sxu.edu.cn

摘要:

在异质信息网络中,异质节点对象之间具有多元关系,形成异质重边信息网络.知识图谱表示旨在将实体和关系在低维的向量空间进行嵌入,可以用来学习异质重边信息网络中实体间的多元关系.首先通过注意力机制对异质重边信息网络中的多元关系进行融合表示,进而将异质节点的类型信息进行多元关系融合空间的映射,在多元关系融合空间上提出基于翻译的异质重边嵌入模型,用以学习异质节点之间的链路关系.最后,在MovieLens100k电影数据集上进行了异质节点多元关系的链路预测实验.实验结果表明,在异质重边信息网络中,基于改进的翻译模型在实体间链路预测性能方面要优于传统的知识表示方法,可以有效地提升链路预测的精度.

关键词: 异质重边信息网路, 链路预测, 翻译模型, 表示学习

Abstract:

In heterogeneous information network,heterogeneous nodes have multiple relations which can form heterogeneous multi?edge information network. Knowledge graph?based representation aims to embed object entities and relations into a low?dimensional vector space which can be used to learn the multiple relations between entities in heterogeneous multi?edge information network. In this paper,we first leverage fused representation of multiple relations for heterogeneous multi?edge information network in terms of attention mechanism. Then,projected matrices are adopted to map the types of heterogeneous nodes into fused spaces of multiple relations. More,in the fusion representation space of multiple relations,translation?based heterogeneous multi?edge embedding model is proposed to learn the link relations among heterogeneous nodes. Finally,link prediction experiments of heterogeneous multi?edge relations are carried out on MovieLens100k dataset. The experimental results demonstrate that the novel translation model is superior to traditional knowledge representation methods at the aspect of link prediction performance,which can effectively improve the accuracy of link prediction.

Key words: heterogeneous multi?edge information network, link prediction, translation model, representation learning

中图分类号: 

  • TP391

图1

IMDB数据集的异质重边信息网络实例"

图2

基于翻译的异质重边嵌入模型框架"

表1

异质重边数据集简介"

DatasetsVerticesEdgesTrainTestValid
MovieLens100k110369545564664726

表2

不同方法在单一、多元关系下的粗糙结果"

MethodMRRMRHits@5Hits@3Hits@1
TransE0.052659.13700.04370.01360.0045
TransH0.061355.50300.07380.02710.0030
TransR0.064260.69010.07440.03720.0041
TransD0.071149.85240.07230.03310.0075
TransE+Mule0.72253.95930.92920.85240.5633
TransH+Mule0.71327.36300.90570.85300.5627
TransR+Mule0.70077.39910.90060.82980.5459
TransD+Mule0.707110.39120.90290.82540.5431
TransHME0.75117.44730.93220.84340.6250

表3

不同方法在单一、多元关系下的过滤结果"

MethodMRRMRHits@5Hits@3Hits@1
TransE0.159446.73190.21990.15210.0693
TransH0.161744.21390.23490.15360.0693
TransR0.151149.37330.21490.15010.0550
TransD0.179139.19730.23040.16110.0889
TransE+Mule0.78373.62350.94580.89610.6581
TransH+Mule0.77076.91870.92520.89460.6416
TransR+Mule0.76826.89760.92470.86750.6521
TransD+Mule0.76639.85540.92300.86980.6409
TransHME0.81907.05120.95030.89460.7304

表4

TransHME在不同γ下的过滤结果"

γMRRMRHits@5Hits@3Hits@1
40.63978.12050.89460.82230.4398
50.73736.09790.91570.83890.6054
60.79274.76050.92620.87950.6928
70.81907.05120.95030.89460.7304
80.80855.77710.93220.88860.7189

表5

TransHME在不同λ下的过滤结果"

λMRRMRHits@5Hits@3Hits@1
0.010.80284.02260.93670.89310.7108
0.10.81907.05120.95030.89460.7304
1.00.80635.72130.92920.88890.7139

表6

TransHME在不同嵌入维度下的过滤结果"

DMRRMRHits@5Hits@3Hits@1
320.76107.40060.89310.85540.6611
640.78349.980710.91460.86500.6873
1000.81907.05120.95030.89460.7304
1 李翠平,蓝梦微,邹本友等. 大数据与推荐系统. 大数据,2015,1(3):23-35.
Li C P,Lan M W,Zou B Y,et al. Big data and recommendation system. Big Data Research,2015,1(3):23-35.
2 齐金山,梁循,李志宇等. 大规模复杂信息网络表示学习:概念、方法与挑战. 计算机学报,2018,41(10):2394-2420.
Qi J S,Liang X,Li Z Y,et al. Representation learning of large?scale complex information network:concepts,methods and challenges. Chinese Journal of Computers,2018,41(10):2394-2420.
3 李宇琦,陈维政,闫宏飞等. 基于网络表示学习的个性化商品推荐. 计算机学报,2019,42(8):1767-1778.
Li Y Q,Chen W Z,Yan H F,et al. Learning graph?based embedding for personalized product re?commendation. Chinese Journal of Computers,2019,42(8):1767-1778.
4 Zheng J X,Wang S G,Li D Y,et al. Personalized recommendation based on hierarchical interest overlapping community. Information Sciences,2019,479:55-75.
5 Shadbolt N,Hall W,Bernerslee T. The semantic web revisited. IEEE Intelligent Systems,2006,21(3):96-101.
6 王昊奋,漆桂林,陈华钧. 知识图谱:方法、实践与应用. 北京:电子工业出版社,2019:15-30.
7 Bordes A,Usunier N,Garcia?Durán A,et al. Translating embeddings for modeling multi?relational data∥Proceedings of the 26th International Conference on Neural Information Processing Systems. Lake Tahoe,NV,USA:MIT Press,2013:2787-2795.
8 Ji G L,He S Z,Xu L H,et al. Knowledge graph embedding via dynamic mapping matrix∥Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Beijing,China:Association for Computational Linguistics,2015:687-696.
9 Wang Z,Zhang J W,Feng J L,et al. Knowledge graph embedding by translating on hyperplanes∥Proceedings of 28th AAAI Conference on Arti?cial Intelligence. Québec City,Canada:AAAI,2014:1112-1119.
10 Lin Y K,Liu Z Y,Sun M S,et al. Learning entity and relation embeddings for knowledge graph completion∥Proceedings of 29th AAAI Conference on Arti?cial Intelligence. Austin,TX,USA:AAAI,2015:2181-2187.
11 Vaswani A,Shazeer N,Parmar N,et al. Attention is all you need∥Proceedings of the 31st International Conference on Neural Information Processing System. Long Beach,CA,USA:NIPS,2017:1-12.
12 Goyal P,Ferrara E. Graph embedding techniques,applications and performance:a survey. Knowledge?Based Systems,2018,151:78-94.
13 Roweis S T,Saul L K. Nonlinear dimensionality reduction by locally linear embedding. Science,2000,290(5500):2323-2326.
14 Luo D J,Huang H,Nie F P,et al. Cauchy graph embedding∥Proceedings of the 28th International Conference on International Conference on Machine Learning. Bellevue,WA,USA:ACM,2011:553-560.
15 Perozzi B,Al?Rfou R,Skiena S. Deepwalk:online learning of social representations∥Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York,NY,USA:ACM,2014:701-710.
16 Tu C C,Zhang W C,Liu Z Y,et al. Max?margin deepwalk:discriminative learning of network representation∥Proceedings of the 25th International Joint Conference on Artificial Intelligence. New York,NY,USA:Morgan Kaufmann,2016:3889-3895.
17 Grover A,Leskovec J. Node2vec:scalable feature learning for networks∥Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco,CA,USA:ACM,2016:855-864.
18 Tang J,Qu M,Wang M Z,et al. Line:large?scale information network embedding∥Proceedings of the 24th International Conference on World Wide Web. Florence,Italy:ACM,2015:1067-1077.
19 Wang D X,Cui P,Zhu W W. Structural deep network embedding∥Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco,CA,USA:ACM,2016:1225-1234.
20 Dong Y X,Chawla N V,Swami A. Metapath2vec:scalable representation learning for heterogeneous networks∥Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Nova Scotia,Canada:ACM,2017:135-144.
21 Wang X,Ji H Y,Shi C,et al. Heterogeneous graph attention network∥The World Wide Web Conference. San Francisco,CA,USA:ACM,2019:2022-2032.
22 Xiao H,Huang M L,Zhu X Y. TransG:A generative model for knowledge graph embedding∥Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin,Germany:ACL,2016:2316-2325.
23 He S Z,Liu K,Ji G L,et al. Learning to represent knowledge graphs with gaussian embedding∥Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. Melbourne,Australia:ACM,2015:623-632.
24 Tu C C,Zhang Z Y,Liu Z Y,et al. Transnet:translation?based network representation learning for social relation extraction∥Proceedings of the 26th International Joint Conference on Artificial Intelligence. Melbourne,Australia:Morgan Kaufmann,2017:2864-2870.
25 Wang X,Wang D X,Xu C N,et al. Explainable reasoning over knowledge graphs for recommendation∥Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Honolulu,HI,USA:AAAI,2019:5329-5336.
26 Wang H W,Zhang F Z,Wang J L,et al. RippleNet:Propagating user preferences on the knowledge graph for recommender systems∥Proceedings of the 27th ACM International Conference on Information and Knowledge Management. Turin,Italy:ACM,2018:417-426.
27 Chen C,Zhang M,Ma W Z,et al. Ef?cient heterogeneous collaborative filtering without negative sampling for recommendation∥Proceedings of the 34th AAAI Conference on Artificial Intelligence. New York,NY,USA:AAAI,2020:19-26
28 Sun Y Z,Han J W. Mining heterogeneous information networks:principles and methodologies. San Rafael:Morgan & Claypool Publishers,2012,1-159.
[1] 钱付兰, 黄鑫, 赵姝, 张燕平. 基于路径相互关注的网络嵌入算法[J]. 南京大学学报(自然科学版), 2019, 55(4): 573-580.
[2] 齐小刚,张 权*. 认知无线电网络中一种基于主用户活动性预测的拓扑控制方法[J]. 南京大学学报(自然科学版), 2018, 54(4): 848-.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 段新春,施 斌,孙梦雅,魏广庆,顾 凯,冯晨曦. FBG蒸发式湿度计研制及其响应特性研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1075 -1084 .
[2] 许 林,张 巍*,梁小龙,肖 瑞,曹剑秋. 岩土介质孔隙结构参数灰色关联度分析[J]. 南京大学学报(自然科学版), 2018, 54(6): 1105 -1113 .
[3] 阚 威, 李 云. 基于LSTM的脑电情绪识别模型[J]. 南京大学学报(自然科学版), 2019, 55(1): 110 -116 .
[4] 顾健伟, 曾 诚, 邹恩岑, 陈 扬, 沈 艺, 陆 悠, 奚雪峰. 基于双向注意力流和自注意力结合的机器阅读理解[J]. 南京大学学报(自然科学版), 2019, 55(1): 125 -132 .
[5] 关立强, 祝伟光, 李义丰. Lamb波时间反转椭圆定位和层析成像混合技术研究[J]. 南京大学学报(自然科学版), 2019, 55(2): 191 -201 .
[6] 狄 岚, 何锐波, 梁久祯. 基于可能性聚类和卷积神经网络的道路交通标识识别算法[J]. 南京大学学报(自然科学版), 2019, 55(2): 238 -250 .
[7] 李嘉明, 邹 勇, 郑 浩, 魏钟波, 杨柳燕, 缪爱军. 养殖塘生态系统重金属污染状况与风险评价[J]. 南京大学学报(自然科学版), 2019, 55(2): 272 -281 .
[8] 曹 群,陈蓓蓓,宫辉力,周超凡,罗 勇,高明亮,王 旭,史 珉,赵笑笑,左俊杰. 基于SBAS和IPTA技术的京津冀地区地面沉降监测[J]. 南京大学学报(自然科学版), 2019, 55(3): 381 -391 .
[9] 吕海敏,沈水龙,严学新,史玉金,许烨霜. 上海地面沉降对轨道交通安全运营风险评估[J]. 南京大学学报(自然科学版), 2019, 55(3): 392 -400 .
[10] 严学新,杨天亮,林金鑫,黄鑫磊,王建秀. 超深基坑减压降水引发地面沉降的估算及其影响因素分析[J]. 南京大学学报(自然科学版), 2019, 55(3): 401 -408 .