南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (6): 10201029.doi: 10.13232/j.cnki.jnju.2019.06.014
Bianfang Chai1,Chunli Wei1,Xinyu Cao1,Jianling Wang2()
摘要:
网络结构发现可识别网络多类型聚类模式,但其准确率有待提升.批量主动学习选择质量高的节点集合构造先验,可提升无监督网络结构发现的性能.面向属性网络分类的主动学习BMAL(Batch Mode Active Learning)只考虑链接信息实现网络节点选择,但不能有效选择使模型性能提升至最优的节点集合,且依赖初始人工标注及参数.提出一个新的批量主动学习算法,利用目标函数的子模性迭代选择最优的节点集合.该方法基于未标记节点的不确定性和非冗余影响力选择最优节点集合,不确定性依据节点及其邻居的类隶属度,影响力依据节点的非重叠中心性,两个指标的权重依据熵权法自动确定.人工和真实网络上的实验结果表明,该方法能选择使结构发现性能提升最大的节点集合.
中图分类号:
1 | Newman M E J , Leicht E A . Mixture models and exploratory analysis in networks. Proceedings of the National Academy of Sciences of the United States of America,2007,104(23):9564-9569. |
2 | 柴变芳,贾彩燕,于剑 . 基于概率模型的大规模网络结构发现方法. 软件学报,2014,25(12):2753-2766. |
Chai B F , Jia C Y , Yu J . . Approaches of structure exploratory based on probabilistic models in massive networks. Journal of Software,2014,25(12):2753-2766. | |
3 |
Cheng J J , Leng M W , Li L J ,et al . Active semi?supervised community detection based on must?link and cannot?link constraints. PLOS One,2014,doi:10.1371/journal.pone.0110088 .
doi: 10.1371/journal.pone.0110088 |
4 |
Yang L , Jin D , Wang X ,et al . Active link selection for efficient semi?supervised community detection. Scientific Reports,2015,doi:10.1038/srep09039 .
doi: 10.1038/srep09039 |
5 | Liu Z Y , Huang S J . Active sampling for open?set classification without initial annotation. Proceedings of the AAAI Conference on Artificial Intelligence,2019,33(1):4416-4423. |
6 | Anand D B , Saravanan R , Suji R A . Adaptive batch mode active learning technique using an improved time adaptive support vector machine for classification of remote sensed image applications. Journal of Computational and Theoretical Nanoscience,2017,14(2):1108-1113. |
7 | Shi L X , Zhao Y H , Tang J . Batch mode active learning for networked data. ACM Transactions on Intelligent Systems and Technology,2012,3(2):1-25. |
8 | Nogueira B M , Tomas Y K B , Marcacini R M . Integrating distance metric learning and cluster?level constraints in semi?supervised clustering∥International Joint Conference on Neural Networks. Anchorage,AK,USA:IEEE,2017:4118-4125. |
9 | Goudjil M , Koudil M , Bedda M ,et al . A novel active learning method using SVM for text classification. International Journal of Automation and Computing,2018,15(3):290-298. |
10 | Chen X , Wang T . Combining active learning and semi?supervised learning by using selective label spreading∥2017 IEEE International Conference on Data Mining Workshops (ICDMW). New Orleans,LA,USA:IEEE,2017:850-857. |
11 | Huang S J , Chen J L , Mu X ,et al . Cost?effective active learning from diverse labelers∥26th International Joint Conference on Artificial Intelligence. Melbourne,Australia:IJCAI,2017:1879-1885. |
12 | 陈嶷瑛,柴变芳,李文斌 等 . 基于迭代框架的主动链接选择半监督社区发现算法. 计算机应用,2017,37(11):3085-3089. |
Chen Y Y , Chai B F , Li W B ,et al . Semi?supervised community detection algorithm using active link selection based on iterative framework. Journal of Computer Applications,2017,37(11):3085-3089. | |
13 | Xiong S C , Azimi J , Fern X Z . Active learning of constraints for semi?supervised clustering. IEEE Transactions on Knowledge & Data Engineering,2014,26(1):43-54. |
14 | Leng M W , Huang L , Li L J ,et al . Active semisupervised community detection based on asymmetric similarity measure. International Journal of Modern Physics B,2015,29(13):1550078. |
15 | Moore C , Yan X R , Zhu Y J ,et al . active learning for node classification in assortative and disassortative networks∥ACM SIGKDD Interna?tional Conference on Knowledge Discovery and Data Mining. San Diego,CA,USA:ACM,2011:841-849. |
16 | Ping S Q , Liu D Y , Yang B ,et al . Batch mode active learning for node classification in assortative and disassortative networks. IEEE Access,2018,6:4750-4758. |
17 | Zhu X , Ghahramani Z , Lafferty J . Semi?supervised learning using Gaussian fields and harmonic functions∥Proceeding 20th International Conference on Machine Learning,2003:912-919. |
18 | Newman M E J , Leicht E A . Mixture models and exploratory analysis in networks. Proceedings of the National Academy of Sciences of the United States of America,2007,104(23):9564-9569. |
19 | Jia C Y , Li Y F , Carson M B ,et al . Node attribute?enhanced community detection in complex networks. Scientific Reports,2017,7(1):2626. |
20 | Fagbote E O , Olanipekun E O , Uyi H S . Water quality index of the ground water of bitumen deposit impacted farm settlements using entropy weighted method. International Journal of Environmental Science and Technology,2014,11(1):127-138. |
21 | 赵学华,杨博,陈贺昌 . 一种高效的随机块模型学习算法. 软件学报,2016,27(9):2248-2264. |
Zhao X H , Yang B , Chen H C . Fast learning algorithm for stochastic block model. Journal of Software,2016,27(9):2248-2264. |
[1] | 杨红鑫,杨绪兵,张福全,业巧林. 半监督平面聚类算法设计[J]. 南京大学学报(自然科学版), 2020, 56(1): 9-18. |
[2] | 常瑜1.2** ,梁吉业1, 2,高嘉伟1,2,杨静1·2 . 一种基于Seeds集和成对约束的半监督聚类算法*[J]. 南京大学学报(自然科学版), 2012, 48(4): 405-411. |
[3] | 申 彦**,宋顺林,朱玉全 . 一种基于半监督的大规模数据集聚类算法* [J]. 南京大学学报(自然科学版), 2011, 47(4): 372-382. |
|