南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (4): 592600.doi: 10.13232/j.cnki.jnju.2019.04.009
所属专题: 测试专题
Tong Wang1,Wei Wei1,2(),Feng Wang1,2
摘要:
聚类集成的目标是通过集成多个聚类结果来提高聚类算法的稳定性、鲁棒性以及精度.近些年,聚类集成受到了越来越多的关注.现有的集成聚类通常平等地对待所有基聚类,而不考虑它们的重要度.虽然学者们已经在这一方面做出了一些努力,例如使用加权策略来改进共协关系矩阵,但无论是给基聚类加权还是对类重要度评价时都忽略了样本对于其所在类贡献的差异.为此,提出了基于样本对加权共协关系矩阵的聚类集成算法,该算法利用k?means算法产生多个基聚类结果,然后对于其中的每个类再利用k?means算法产生多个小类,并计算去掉样本对所在的小类后类的不确定性变化的程度来评价该样本对的重要度,最后通过层次聚类算法得到聚类结果.在六个UCI数据集上的实验结果表明,基于样本对加权共协关系矩阵的聚类集成算法的性能优于三种经典的基于共协关系矩阵的聚类集成算法.
中图分类号:
1 | Van HamF. Using Multilevel call matrices in large software projects∥IEEE Symposium on Information Visualization 2003. Seattle,WA,USA:IEEE,2003:227-232. |
2 | FurnasG F. Generalized fisheye views∥Proceedings of the SIGCHI Conference on |
Human Factors in Computing Systems. New York,NY,USA:ACM,1986:16-23. | |
3 | SchafferD,ZaoZ P,GreenbergS,et al. Navigating hierarchically clustered networks through fisheye and full?zoom methods. ACM Transaction on Computer?Human Interaction,1996,3(2):162-188. |
4 | 贾云得,吕宏静,刘万春. 鱼眼变形立体图像恢复稠密深度图的方法. 计算机学报,2000,23(12):1332-1336. |
Jia Y D,Lu H J,Liu W C.Fish?eye lens camera stereo vision for dense depth map recovery. Chinese Journal of Computers,2000,23(12):1332-1336. | |
5 | Van HeeK,SidorovaN,VoorhoeveM. Soundness and separability of workflow nets in the stepwise refinement approach∥Proceedings of the 24th International Conference on Application and Theory of Petri Nets. Springer Berlin Heidelberg,2003,2679:337-356. |
6 | StrehlA,GhoshJ. Cluster ensembles?a knowledge reuse framework for combining multiple partitions. The Journal of Machine Learning Research,2003,3:583-617. |
7 | Iam?OnN,BoongoenT,GarrettS,et al. A link?based approach to the cluster ensemble problem. IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(12):2396-2409. |
8 | FredA L N,JainA K. Combining multiple clusterings using evidence accumulation. IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(6):835-850. |
9 | NgA Y,JordanM I,WeissY. On spectral clustering:analysis and an algorithm∥Proceedings of the 14th International Conference on Neural Information Processing Systems. Vancouver,Canada:MIT Press,2002:849-856. |
10 | FernX Z,BrodleyC E. Solving cluster ensemble problems by bipartite graph partitioning∥Proceedings of the 21st International Conference on Machine Learning. New York,NY,USA:ACM,2004:36-43. |
11 | KaufmanL,RousseeuwP J. Finding groups in data:an introduction to cluster analysis. Hoboken:John Wiley and Sons Inc,1990. |
12 | YangY,ChenK. Temporal data clustering via weighted clustering ensemble with different representations. IEEE Transactions on Know?ledge and Data Engineering,2010,23(2):307-320. |
13 | NandaA,PujariA K. Weighted co?clustering based clustering ensemble∥3rd National Conference on Computer Vision,Pattern Recognition,Image Processing and Graphics. Hubli,India:IEEE Press,2011:46-49. |
14 | ZhongC M,YueX D,ZhangZ H,et al. A clustering ensemble:Two?level?refined co?association matrix with path?based transformation. Pattern Recognition,2015,48(8):2699-2709. |
15 | HuangD,WangC D,LaiJ H. Locally weighted ensemble clustering. IEEE Transactions on Cybernetics,2018,48(5):1460-1473. |
16 | 黄栋,王昌栋,赖剑煌等. 基于决策加权的聚类集成算法. 智能系统学报,2016,11(3):418-425. |
Huang D,Wang C D,Lai J H,et al.Clustering ensemble by decision weightingCAAI Transactions on Intelligent Systems,2016,11(3):418-425. | |
17 | BerikovV,PestunovI. Ensemble clustering based on weighted co?association matrices:error bound and convergence properties. Pattern Recognition,2017,63:427-436. |
18 | NazariA,DehghanA,NejatianS,et al. A comprehensive study of clustering ensemble weighting based on cluster quality and diversity. Pattern Analysis and Applications,2019,22(1):133-145. |
19 | RandW M. Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association,1971,66(336):846-850. |
20 | NguyenN,CaruanaR. Consensus clusterings∥Proceedings of the 7th IEEE International Conference on Data Mining. Omaha,NE,USA:IEEE,2007:607-612. |
[1] | 王丽娟,丁世飞,丁玲. 基于迁移学习的软子空间聚类算法[J]. 南京大学学报(自然科学版), 2020, 56(4): 515-523. |
[2] | 陈俊芬,赵佳成,韩洁,翟俊海. 基于深度特征表示的Softmax聚类算法[J]. 南京大学学报(自然科学版), 2020, 56(4): 533-540. |
[3] | 王露,王士同. 改进模糊聚类在医疗卫生数据的Takagi⁃Sugeno模糊模型[J]. 南京大学学报(自然科学版), 2020, 56(2): 186-196. |
[4] | 杨红鑫,杨绪兵,张福全,业巧林. 半监督平面聚类算法设计[J]. 南京大学学报(自然科学版), 2020, 56(1): 9-18. |
[5] | 柴变芳,魏春丽,曹欣雨,王建岭. 面向网络结构发现的批量主动学习算法[J]. 南京大学学报(自然科学版), 2019, 55(6): 1020-1029. |
[6] | 杨鑫, 施虹, 王平心, 徐刚. 基于稳定性的三支聚类[J]. 南京大学学报(自然科学版), 2019, 55(4): 546-552. |
[7] | 徐秀芳, 徐 森, 花小朋, 徐 静, 皋 军, 安 晶. 一种基于t-分布随机近邻嵌入的文本聚类方法[J]. 南京大学学报(自然科学版), 2019, 55(2): 264-271. |
[8] | 贾海宁, 王士同. 面向重尾噪声的模糊规则模型[J]. 南京大学学报(自然科学版), 2019, 55(1): 61-72. |
[9] | 陆慎涛, 葛洪伟, 周 竞. 自动确定聚类中心的移动时间势能聚类算法[J]. 南京大学学报(自然科学版), 2019, 55(1): 143-153. |
[10] | 朱庆峰1,2,葛洪伟1,2*. 快速特征映射优化的流形密度峰聚类[J]. 南京大学学报(自然科学版), 2018, 54(4): 838-. |
[11] | 底晓强1,2*,于力伟,刘 旭,Syed Umer. 一种基于演化博弈的低轨卫星切换算法研究[J]. 南京大学学报(自然科学版), 2018, 54(4): 855-. |
[12] | 帅 惠, 袁晓彤, 刘青山. 基于L0约束的稀疏子空间聚类[J]. 南京大学学报(自然科学版), 2018, 54(1): 23-. |
[13] | 刘 宏,申德荣*,寇 月,聂铁铮,于 戈. 基于实体演化的记录链接算法[J]. 南京大学学报(自然科学版), 2017, 53(6): 991-. |
[14] | 严丽宇1,魏 巍1,2*,郭鑫垚1,崔军彪1. 一种基于带核随机子空间的聚类集成算法[J]. 南京大学学报(自然科学版), 2017, 53(6): 1033-. |
[15] | 孙梦梦1,唐旭清1,2*. 基于粒度空间的最小生成树分类算法[J]. 南京大学学报(自然科学版), 2017, 53(5): 963-. |
|