南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (4): 573–580.doi: 10.13232/j.cnki.jnju.2019.04.007

所属专题: 测试专题

• • 上一篇    下一篇

基于路径相互关注的网络嵌入算法

钱付兰(),黄鑫,赵姝,张燕平   

  1. 安徽大学计算机科学与技术学院,合肥,230601
  • 收稿日期:2019-05-28 出版日期:2019-07-30 发布日期:2019-07-23
  • 通讯作者: 钱付兰 E-mail:qianfulan@hotmail.com
  • 基金资助:
    国家重点研究与发展项目(2017YFB1401903);国家自然科学基金(61673020, 61702003,61876001);安徽省自然科学基金(1808085MF175)

Path⁃based mutual attention algorithm for network embedding

Fulan Qian(),Xin Huang,Shu Zhao,Yanping Zhang   

  1. School of Computer Science and Technology, Anhui University, Hefei, 230601, China
  • Received:2019-05-28 Online:2019-07-30 Published:2019-07-23
  • Contact: Fulan Qian E-mail:qianfulan@hotmail.com

摘要:

网络嵌入,或者称为网络表示学习,旨在将网络中的节点映射到表示空间中,生成低维稠密的向量,从而在保留网络结构信息的前提下对网络中的节点进行表示,而后通过已有的机器学习方法解决诸如链接预测、节点分类、社团发现和网络可视化等下游任务.随机游走算法可以很好地探索网络中节点的局部结构,然而之前的基于随机游走的表示学习算法只能为节点产生一种角色嵌入,没有考虑到和不同邻居进行交互时节点扮演的不同角色嵌入.因此,提出一种基于路径相互关注的网络嵌入算法,使用节点随机游走产生的上下文信息,通过注意力机制为每个节点生成上下文相互关注的节点嵌入.在真实数据集上的实验结果表明,与三个经典的网络嵌入算法相比,该算法具有更好的表现.

关键词: 网络表示学习, 随机游走, 相互关注, 注意力机制

Abstract:

Network embedding,or network representation learning,aims to map nodes in the network into the representation space and generate low?dimensional dense vectors to represent the nodes in the network while preserving the network structure information,then solve downstream tasks such as link prediction,node classification,community discovery and network visualization through existing machine learning methods. The random walk algorithm can well explore the structure of nodes in the network. However,the previous representation learning algorithm based on random walk can only generate one kind of embedding for one node,without considering that the nodes play different roles when interacting with different neighbors. Therefore,this paper proposes a network embedding algorithm based on mutual attention of paths. Through the context information generated by random walks of nodes,each node generates a node embedding in which contexts are of mutual attention. And our algorithm has better performance than the three classic network embedding algorithms.

Key words: network representation learning, random walk, mutual attention, attention mechanism

中图分类号: 

  • TP391

图1

基于路径的相互关注机制"

图2

PMANE算法框架"

表1

三个真实网络"

数据集CoraHepThNetScience
Vertices227710381461
Edges521419902742
Typedirecteddirectedundirected
Labels7--

表2

Cora数据集下AUC值"

训练比例15%25%35%45%55%65%75%85%95%
Deepwalk56.063.070.275.580.185.285.387.890.3
LINE55.058.666.473.077.682.885.688.489.3
Node2vec55.962.466.175.078.781.685.987.388.2
PMANE62.266.470.378.281.986.788.090.090.5

表3

HepTh数据集下AUC值"

训练比例15%25%35%45%55%65%75%85%95%
Deepwalk55.266.070.075.781.383.387.688.988.0
LINE53.760.466.573.978.583.887.587.787.6
Node2vec57.163.669.976.284.387.388.489.289.2
PMANE59.667.074.381.585.289.489.591.192.9

表4

NetScience数据集下AUC值"

训练比例15%25%35%45%55%65%75%85%95%
Deepwalk80.683.590.194.595.397.498.298.497.6
LINE77.881.588.492.094.296.897.898.498.7
Node2vec80.585.589.893.895.097.397.498.097.2
PMANE81.885.691.894.696.297.598.398.698.8

图3

不同比例下PMANE对基线算法的提升百分率"

1 HamiltonW L,YingR,LeskovecJ. Representa?tion learning on graphs:methods and applications. 2017,arXiv:1709.05584.
2 涂存超,杨成,刘知远等. 网络表示学习综述. 中国科学:信息科学,2017,47(8):980-996.
TuC C,YangC,LiuZ Y,et al. Network representation learning:an overview. Scientia Sinica(Informationis),2017,47(8):980-996.
3 ChenH C,PerozziB,Al?RfouR,et al. A tutorial on network embeddings. 2018,arXiv:1808.02590.
4 ZhangD K,YinJ,ZhuX Q,et al. Network representation learning: a survey. IEEE Transactions on Big Data,2018,doi:10.1109/TBDATA.2018.2850013.
doi: 10.1109/TBDATA.2018.2850013
5 CaiH Y,ZhengV W,ChangK. A comprehensive survey of graph embedding:problems,techniques,and applications. IEEE Transactions on Know?ledge and Data Engineering,2018,30(9):1616-1637.
6 TuC C,LiuH,LiuZ Y,et al. Cane:context?aware network embedding for relation modeling∥Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Vancouver,Canada:Association for Computa?tional Linguistics,2017:1722-1731.
7 YangC,LiuZ Y,ZhaoD L,et al. Network representation learning with rich text information∥Proceedings of the 24th International Conference on Artificial Intelligence. Buenos Aires,Argentina:AAAI Press,2015:2111-2117.
8 SunX F,GuoJ,DingX,et al. A general framework for content?enhanced network repre?sentation learning. 2016,arXiv:1610.02906.
9 RoweisS T,SaulL K. Nonlinear dimensionality reduction by locally linear embedding. Science,2000,290(5500):2323-2326.
10 BelkinM,NiyogiP. Laplacian eigenmaps and spectral techniques for embedding and clustering∥Proceedings of the 14th International Conference on Neural Information Processing Systems. Vancouver,Canada: MIT Press,2001:585-591.
11 WangD,CuiP,ZhuW. Structural deep network embedding∥Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco,CA,USA:ACM,2016:1225-1234.
12 MikolovT,SutskeverI,ChenK,et al. Distributed representations of words and phrases and their compositionality∥Proceedings of the 26th International Conference on Neural Information Processing Systems. Lake Tahoe,NV,USA:MIT Press,2013:3111-3119.
13 PerozziB,Al?RfouR,SkienaS. Deepwalk:online learning of social representations∥Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York,NY,USA:ACM,2014:701-710.
14 TangJ,QuM,WangM Z,et al. Line:large?scale information network embedding∥Proceedings of the 24th International Conference on World Wide Web. Florence,Italy:ACM,2015:1067-1077.
15 GroverA,LeskovecJ. node2vec:scalable feature learning for networks∥Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco,CA,USA:ACM,2016:855-864.
16 KalchbrennerN,GrefenstetteE,BlunsomP. A convolutional neural network for modelling sentences. 2014,arXiv:1404.2188.
17 HuB T,LuZ D,LiH,et al. Convolutional neural network architectures for matching natural language sentences∥Proceedings of the 27th International Conference on Neural Information Processing Systems. Montreal,Canada:MIT Press,2014:2042-2050.
18 Dos SantosC,TanM,XiangB,et al. Attentive pooling networks. 2016,arXiv:1602.03609.
19 KingmaD P,BaJ. Adam:a method for stochastic optimization. 2014,arXiv:1412.6980.
20 McCallumA K,NigamK,RennieJ,et al. Automating the construction of internet portals with machine learning. Information Retrieval,2000,3(2):127-163.
21 LeskovecJ,KleinbergJ,FaloutsosC. Graphs over time:densification laws,shrinking diameters and possible explanations∥Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining. Chicago,IL,USA:ACM,2005:177-187.
22 NewmanM E J. Finding community structure in networks using the eigenvectors of matrices. Physical Review E,2006,74(3):036104,doi:10.1103/PhysRevE.74.036104.
doi: 10.1103/PhysRevE.74.036104
[1] 朱伟,张帅,辛晓燕,李文飞,王骏,张建,王炜. 结合区域检测和注意力机制的胸片自动定位与识别[J]. 南京大学学报(自然科学版), 2020, 56(4): 591-600.
[2] 徐扬,周文瑄,阮慧彬,孙雨,洪宇. 基于层次化表示的隐式篇章关系识别[J]. 南京大学学报(自然科学版), 2019, 55(6): 1000-1009.
[3] 郑文萍,刘韶倩,穆俊芳. 一种基于相对熵的随机游走相似性度量模型[J]. 南京大学学报(自然科学版), 2019, 55(6): 984-999.
[4] 曹欣怡,李鹤,王蔚. 基于语料库的语音情感识别的性别差异研究[J]. 南京大学学报(自然科学版), 2019, 55(5): 758-764.
[5] 顾健伟, 曾 诚, 邹恩岑, 陈 扬, 沈 艺, 陆 悠, 奚雪峰. 基于双向注意力流和自注意力结合的机器阅读理解[J]. 南京大学学报(自然科学版), 2019, 55(1): 125-132.
[6] 朱 尧, 朱启海, 毛晓蛟, 杨育彬. 基于有监督显著性检测的目标跟踪[J]. 南京大学学报(自然科学版), 2017, 53(4): 747-.
[7] 梁晋1,2,梁吉业1,2*,赵兴旺1,2. 一种面向大规模社会网络的社区发现算法[J]. 南京大学学报(自然科学版), 2016, 52(1): 159-166.
[8] 曹江中1*,陈 佩2,戴青云3,凌永权1. 基于Markov随机游走的谱聚类相似图构造方法[J]. 南京大学学报(自然科学版), 2015, 51(4): 772-780.
[9] 施静静,张鹏,阮雅端,陈启美*. 多媒体信息网络相似度计算方法研究[J]. 南京大学学报(自然科学版), 2015, 51(2): 290-296.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 段新春,施 斌,孙梦雅,魏广庆,顾 凯,冯晨曦. FBG蒸发式湿度计研制及其响应特性研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1075 -1084 .
[2] 汪 勇,刘 瑾*,宋泽卓,白玉霞,王琼亚,祁长青,孙少锐. 高分子稳定剂加固河道边坡表层砂土室内试验研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1095 -1104 .
[3] 卢 毅,于 军,龚绪龙,王宝军,魏广庆,季峻峰. 基于DFOS的连云港第四纪地层地面沉降监测分析[J]. 南京大学学报(自然科学版), 2018, 54(6): 1114 -1123 .
[4] 孙 玫,张 森,聂培尧,聂秀山. 基于朴素贝叶斯的网络查询日志session划分方法研究[J]. 南京大学学报(自然科学版), 2018, 54(6): 1132 -1140 .
[5] 胡 淼,王开军,李海超,陈黎飞. 模糊树节点的随机森林与异常点检测[J]. 南京大学学报(自然科学版), 2018, 54(6): 1141 -1151 .
[6] 周星星,张海平,吉根林. 具有时空特性的区域移动模式挖掘算法[J]. 南京大学学报(自然科学版), 2018, 54(6): 1171 -1182 .
[7] 赵天龙,刘 峥,韩慧健,张彩明. 基于二分图的个性化图像标签推荐算法[J]. 南京大学学报(自然科学版), 2018, 54(6): 1193 -1205 .
[8] 王伯伟, 聂秀山, 马林元, 尹义龙. 基于语义相似度的无监督图像哈希方法[J]. 南京大学学报(自然科学版), 2019, 55(1): 41 -48 .
[9] 张滨,张胜,陈建飞. 毫米波综合孔径辐射计的压缩感知成像方法研究[J]. 南京大学学报(自然科学版), 2019, 55(5): 718 -724 .
[10] 刘颖,刘甲,杨国强,王苏,张志炳,周政,周爱东. 相转移催化H2O2氧化环己烯合成环氧环己烷的工艺研究[J]. 南京大学学报(自然科学版), 2019, 55(5): 850 -858 .