南京大学学报(自然科学版) ›› 2021, Vol. 57 ›› Issue (2): 177188.doi: 10.13232/j.cnki.jnju.2021.02.002
• • 上一篇
摘要:
k?means和谱聚类是两种应用最广泛的聚类技术.k?means是基于矩阵分解的聚类方法,并且是在数据空间上基于误差极小化的聚类方法.谱聚类是基于图的聚类方法,并且是基于两点在数据空间和特征空间的相似性保持的聚类方法.为了利用两者的优势,提出一种基于乘法更新规则的k?means和谱聚类的联合学习方法,该方法将k?means和谱聚类结合成一个统一的聚类模型,该模型可在单次优化中同时优化k?means和谱聚类的目标;此外,还基于乘法更新规则设计了对聚类中心C与聚类指示器Y进行迭代更新的优化算法.重要的是,在理论上证明了所设计算法的正确性和收敛性.在典型的数据集上进行测试,实验结果表明提出的联合学习算法在聚类精度和标准互信息度指标上都有所提高.
中图分类号:
1 | Tasdemir K,Merenyi E. Exploiting data topology in visualization and clustering of self?organizing maps. IEEE Transactions on Neural Networks,2009,20(4):549-562. |
2 | Kumar A,Rai P,Daumé H. Co?regularized multi?view spectral clustering∥Proceedings of the 24th International Conference on Neural Information Processing Systems. Red Hook,NY,USA:Curran Associates Inc.,2011:1413-1421. |
3 | Ben?Hur A,Horn D,Siegelmann H T,et al. Support vector clustering. Journal of Machine Learning Research,2002,2(12):125-137. |
4 | Girolami M. Mercer kernel?based clustering in feature space. IEEE Transactions on Neural Networks,2002,13(3):780-784. |
5 | Zhang R,Nie F P,Guo M H,et al. Joint learning of fuzzy k?means and nonnegative spectral clustering with side information. IEEE Transactions on Image Processing,2019,28(5):2152-2162,doi:10.1109/TIP.2018.2882925. |
6 | Zhao W L,Deng C H,Ngo C W. K?means:a revisit. Neurocomputing,2018,291:195-206. |
7 | Xu J L,Han J W,Nie F P,et al. Re?weighted discriminatively embedded k?means for multi?view clustering. IEEE Transactions on Image Processing,2017,26(6):3016-3027,doi:10.1109/TIP.2017. 2665976. |
8 | Bauckhage C. K?means clustering is matrix factorization. 2015,arXiv,1512.07548. |
9 | Hu Z X,Nie F P,Wang R,et al. Multi?view spectral clustering via integrating nonnegative embedding and spectral embedding. Information Fusion,2020,55:251-259. |
10 | Kailkhura B,Thiagarajan J J,Rastogi C,et al. A spectral approach for the design of experiments:design,analysis and algorithms. Journal of Machine Learning Research,2018,19(34):1-46. |
11 | Arias?Castro E,Lerman G,Zhang T. Spectral clustering based on local PCA. Journal of Machine Learning Research,2017,18(1):1-57. |
12 | Lee D D,Seung H S. Algorithms for non?negative matrix factorization∥Proceedings of the 13th International Conference on Neural Information Processing Systems. Cambridge,MA,USA:MIT Press,2000,13:535-541. |
13 | Duda R O,Hart P E. Pattern classi?cation and scene analysis. New York:Wiley,1973,512. |
14 | Mao J C,Jain A K. A self?organizing network for hyperellipsoidal clustering. IEEE Transactions on Neural Networks,1996,7(1):16-29. |
15 | Likas A,Vlassis N,Verbeek J J. The global k?means clustering algorithm. Pattern Recognition,2003,36(2):451-461. |
16 | Hansen P,Ngai E,Cheung B K. Analysis of global k?means,an incremental heuristic for minimum sum?of?squares clustering. Journal of Classi?cation,2005,22(2):287-310. |
17 | Shi J B,Malik J M. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,22(8):888-905,doi:10.1109/34.868688. |
18 | Ding C,He X F,Simon H D. Nonnegative lagrangian relaxation of k?means and spectral clustering∥Gama J,Camacho R,Brazdil P B,et al. Machine Learning:ECML2005. |
Springer Berlin Heidelberg,2005,3720:530-538. | |
19 | Ng A Y,Jordan M I,Weiss Y. On spectral clustering:analysis and an algorithm∥Proceedings of the 14th International Conference on Neural Information Processing Systems:Natural and Synthetic. Cambridge,MA,USA:MIT Press,2001,14:849-856. |
20 | Wan M H,Lai Z H,Ming Z,et al. An improve face representation and recognition method based on graph regularized non?negative matrix factorization. Multimedia Tools and Applications,2019,78(15):22109-22126. |
21 | Lee D D,Seung H S. Learning the parts of objects by non?negative matrix factorization. Nature,1999,401(6755):788-791. |
22 | Ding C,He X F,Simon H D. On the equivalence of nonnegative matrix factorization and spectral clustering∥Proceedings of the SIAM International Conference on Data Mining. Philadelphia,PA,USA:Society for Industrial and Applied Mathematics, 2005: 606-610.. |
23 | Ding C H Q,Li T,Jordan M I. Convex and semi?nonnegative matrix factorizations. IEEE Transactions on Pattern Analysis and Machine Intelligence,2010,32(1):45-55. |
24 | Ding C,Li T,Peng W. Nonnegative matrix factorization and probabilistic latent semantic indexing:equivalence,chi?square statistic,and a hybrid method∥Proceedings of the 21st National Conference on Arti?cial Intelligence. Boston,MA,USA:AAAI Press,2006:342-347. |
25 | Lin C J. On the convergence of multiplicative update algorithms for nonnegative matrix factorization. IEEE Transactions on Neural Networks,2007,18(6):1589-1596. |
26 | Xu W,Liu X,Gong Y H. Document clustering based on non?negative matrix factorization∥Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval. New York,NY,USA:ACM,2003:267-273. |
27 | Jolliffe I T. Principal component analysis. Springer Berlin Heidelberg,1986,231-246. |
28 | Deerwester S,Dumais S T,Furnas G W,et al. Indexing by latent semantic analysis. Journal of the American Society for Information Science,1990,41(6):391-407. |
29 | Zha H Y,He X F,Ding C,et al. Spectral relaxation for k?smeans clustering∥Proceedings of the 14th International Conference on Neural Information Processing Systems:Natural and Synthetic. Cambridge,MA,USA:MIT Press,2001:1057-1064. |
30 | von Luxburg U. A tutorial on spectral clustering. Statistics and Computing,2007,17(4):395-416. |
[1] | 段友祥,柳璠,孙歧峰,李洪强. 基于相带划分的孔隙度预测[J]. 南京大学学报(自然科学版), 2019, 55(6): 934-941. |
[2] | 帅 惠, 袁晓彤, 刘青山. 基于L0约束的稀疏子空间聚类[J]. 南京大学学报(自然科学版), 2018, 54(1): 23-. |
[3] | 孟 娜1,梁吉业1,2*,庞天杰1. 一种基于抽样的谱聚类集成算法 [J]. 南京大学学报(自然科学版), 2016, 52(6): 1090-. |
[4] | 曹江中1*,陈 佩2,戴青云3,凌永权1. 基于Markov随机游走的谱聚类相似图构造方法[J]. 南京大学学报(自然科学版), 2015, 51(4): 772-780. |
[5] | 贾洪杰1,2丁世飞1,2. 基于邻域粗糙集约减的谱聚类算法[J]. 南京大学学报(自然科学版), 2013, 49(5): 619-627. |
[6] | 高尚兵1**,周静波2,严云洋1. 一种新的基于超像素的谱聚类图像分割算法*[J]. 南京大学学报(自然科学版), 2013, 49(2): 169-175. |
[7] | 纳跃跃,于剑** . 一种用于谱聚类图像分割的像素相似度计算方法*[J]. 南京大学学报(自然科学版), 2013, 49(2): 159-168. |
[8] | 刘娜1.2,肖智博1,鲁明羽1**. 基于形态学的单词一文档谱聚类方法* [J]. 南京大学学报(自然科学版), 2012, 48(2): 154-163. |
|