南京大学学报(自然科学版) ›› 2021, Vol. 57 ›› Issue (2): 189196.doi: 10.13232/j.cnki.jnju.2021.02.003
• • 上一篇
Changlong Shao, Tongfeng Sun, Shifei Ding()
摘要:
聚类集成的目的是通过集成多个不同的基聚类来生成一个更好的聚类结果,近年来研究者已经提出多个聚类集成算法,但是目前仍存在的局限性是这些算法大多把每个基聚类和每个簇都视为同等重要,使聚类结果很容易受到低质量基聚类和簇的影响.为解决这个问题,研究者提出一些给基聚类加权的方法,但大多把基聚类看作一个整体而忽视其中每个簇的差异.受到信息熵的启发,提出一种基于信息熵加权的聚类集成算法.算法首先对每个簇的不稳定性进行衡量,然后提出一种基于信息熵的簇评价指标,进而从簇层面进行加权,在对加权矩阵进行划分后得到最终的聚类结果.该算法有两个主要优点:第一,提出了一个有效的簇评价性指标;第二,从比基聚类层面更细化的簇层面进行加权.一系列的实验证明了该算法的有效性和鲁棒性.
中图分类号:
1 | Ding S F,Jia H J,Du M J,et al. A semi?supervised approximate spectral clustering algorithm based on HMRF model. Information Sciences,2018,429:215-228. |
2 | Cong L,Ding S F,Wang L J,et al. Image segmentation algorithm based on superpixel clustering. IET Image Processing,2018,12(11):2030-2035. |
3 | Saini N,Saha S,Bhattacharyya P. Automatic scientific document clustering using self?organized multi?objective differential evolution. Cognitive Computation,2019,11(2):271-293,doi:10.1007/s12559-018-9611-8. |
4 | Thanh N D,Ali M,Son L H. A novel clustering algorithm in a neutrosophic recommender system for medical diagnosis. Cognitive Computation,2017,9(4):526-544. |
5 | Fred A L N,Jain A K. Combining multiple clusterings using evidence accumulation. IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(6):835-850. |
6 | Li E R,Li Q Y,Geng Y A,et al. Ensemble clustering using maximum relative density path∥2018 IEEE International Conference on Big Data and Smart Computing. Shanghai,China:IEEE,2018,doi:10.1109/BigComp.2018.00036. |
7 | Rathore P,Bezdek J C,Erfani S M,et al. Ensemble fuzzy clustering using cumulative aggregation on random projections. IEEE Transactions on Fuzzy Systems,2018,26(3):1510-1524. |
8 | Zhong C M,Hu L Y,Yue X D,et al. Ensemble clustering based on evidence extracted from the co?association matrix. Pattern Recognition,2019,92:93-106. |
9 | Strehl A,Ghosh J. Cluster ensembles:a knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research,2003,3:583-617. |
10 | Fern X Z,Brodley C E. Solving cluster ensemble problems by bipartite graph partitioning∥Proceedings of the 21st International Conference on Machine Learning. New York,NY,USA:ACM,2004,doi:10.1145/1015330.1015414. |
11 | Shi J B,Malik J. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22(8):888-905. |
12 | Huang D,Wang C D,Wu J S,et al. Ultra?scalable spectral clustering and ensemble clustering. IEEE Transactions on Knowledge and Data Engineering,2019,32(6):1212-1226,doi:10.1109/TKDE.2019.2903410. |
13 | Topchy A,Jain A K,Punch W. Clustering ensembles:models of consensus and weak partitions. IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(12):1866-1881. |
14 | Cristofor D,Simovici D. Finding median partitions using information?theoretical?based genetic algorithms. Journal of Universal Computer Science,2002,8(2):153-172. |
15 | Wu J J,Liu H F,Xiong H,et al. K?means?based consensus clustering:a unified view. IEEE Transactions on Knowledge and Data Engineering,2014,27(1):155-169. |
16 | Huang D,Lai J H,Wang C D. Ensemble clustering using factor graph. Pattern Recognition,2016,50:131-142. |
17 | Yu Z W,Li L,Gao Y J,et al. Hybrid clustering solution selection strategy. Pattern Recognition,2014,47(10):3362-3375. |
18 | Huang D,Wang C D,Lai J H. Locally weighted ensemble clustering. IEEE Transactions on Cybernetics,2017,48(5):1460-1473. |
19 | Li Z G,Wu X M,Chang S F. Segmentation using superpixels:a bipartite graph partitioning approach∥2012 IEEE Conference on Computer Vision and Pattern Recognition. Providence,RI,USA:IEEE,2012:789-796. |
20 | Huang D,Wang C D,Lai J H. LWMC:a locally weighted meta?clustering algorithm for ensemble clustering∥International Conference on Neural Information Processing. Springer Berlin Heidelberg,2017:167-176. |
21 | He N N,Huang D. Meta?cluster based consensus clustering with local weighting and random walking∥International Conference on Intelligent Science and Big Data Engineering. Springer Berlin Heidelberg,2019:266-277. |
22 | Iam?On N,Boongoen T,Garrett S M,et al. A link?based approach to the cluster ensemble problem. IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(12):2396-2409. |
23 | Huang D,Lai J H,Wang C D. Combining multiple clusterings via crowd agreement estimation and multi?granularity link analysis. Neurocomputing,2015,170:240-250. |
24 | Santos J M,Embrechts M. On the use of the adjusted rand index as a metric for evaluating supervised classification∥International Conference on Artificial Neural Networks. Springer Berlin Heidelberg,2009:175-184. |
25 | Vinh L T,Lee S,Park Y T,et al. A novel feature selection method based on normalized mutual information. Applied Intelligence,2012,37(1):100-120. |
[1] | 马明寅, 狄岚, 梁久祯. 基于图像校正和模板分割的纺织品瑕疵检测[J]. 南京大学学报(自然科学版), 2021, 57(1): 29-41. |
|