南京大学学报(自然科学版) ›› 2018, Vol. 54 ›› Issue (4): 838.
朱庆峰1,2,葛洪伟1,2*
Zhu Qingfeng1,2,Ge Hongwei1,2*
摘要: 经典的密度峰聚类不再适用于复杂的流形聚类,因此提出了快速特征映射优化的流形密度峰聚类,用快速特征映射优化的流形距离取代欧式距离,可以更好地反映不同类的点间相似性. 算法首先通过寻找特征点,构造无向特征图,再通过无向特征图计算任意两个点之间的流形距离,最后按照流形距离的大小完成分配. 在人工数据集和UCI数据集上的实验表明,新算法具有更高的准确率.
[1] Jain A K. Data clustering:50 years beyond K-means. Pattern Recognition Letters,2010,31(8):651-666. [2] Park H S,Jun C H. A simple and fast algorithm for K-medoids clustering. Expert Systems with Applications,2009,36(2):3336-3341. [3] 周亚建,徐 晨,李继国. 基于改进CURE聚类算法的无监督异常检测方法. 通信学报,2010,31(7):18-23,32.(Zhou Y J,Xu C,Li J G. Unsupervised anomaly detection method based on improved CURE clustering algorithm. Journal on Communications,2010,31(7):18-23,32.) [4] Ansari S,Chetlur S,Prabhu S,et al. An overview of clustering analysis techniques used in data miniing. International Journal of Emerging Technology and Advanced Engineering,2013,3(12):284-286. [5] Amini A,Wah T Y,Saybani M R,et al. A study of density-grid based clustering algorithms on data streams ∥ Proceedings of 2011 8th International Conference on Fuzzy Systems and Knowledge Discovery. Shanghai,China:IEEE,2011,3:1652-1656. [6] Yang P,Zhu Q S,Huang B. Spectral clustering with density sensitive similarity function. Knowledge-Based Systems,2011,24(5):621-628. [7] 杨 静,高嘉伟,梁吉业等. 基于数据场的改进DBSCAN聚类算法. 计算机科学与探索,2012,6(10):903-911.(Yang J,Gao J W,Liang J Y,et al. An improved DBSCAN clustering algorithm based on data field. Journal of Frontiers of Computer Science and Technology,2012,6(10):903-911.) [8] Kalita H K,Bhattacharya D K,Kar A. A new algorithm for ordering of points to identify clustering structure based on perimeter of triangle:Optics(BOPT) ∥ Proceedings of 2007 International Conference on Advanced Computing and Communications. Guwahati,India:IEEE,2007:523-528. [9] Rodriguez A,Laio A. Clustering by fast search and find of density peaks. Science,2014,344(6191):1492-1496. [10] 马春来,单 洪,马 涛. 一种基于簇中心点自动选择策略的密度峰值聚类算法. 计算机科学,2016,43(7):255-258,280.(Ma C L,Shan H,Ma T. Improved density peaks based clustering algorithm with strategy choosing cluster center automatically. Computer Science,2016,43(7):255-258,280.) [11] Bie R F,Mehmood R,Ruan S S,et al. Adaptive fuzzy clustering by fast search and find of density peaks. Personal and Ubiquitous Computing,2016,20(5):785-793. [12] 李 涛,葛洪伟,苏树智. 基于密度自适应距离的密度峰聚类. 小型微型计算机系统,2017,38(6):1347-1352.( Li T,Ge H W,Su S Z. Density peaks clustering based on density adaptive distance. Journal of Chinese Computer Systems,2017,38(6):1347-1352.) [13] 谢娟英,高红超,谢维信. K近邻优化的密度峰值快速搜索聚类算法. 中国科学:信息科学,2016,46(2):258-280.(Xie J Y,Gao H C,Xie W X. K-nearest neighbors optimized clustering algorithm by fast search and finding the density peaks of a dataset. Scientia Sinica:Informationis,2016,46(2):258-280.) [14] 杨 燕,靳 蕃,Mohamed K. 聚类有效性评价综述. 计算机应用研究,2008,25(6):1630-1632,1638.(Yang Y,Jin F,Mohamed K. Survey of clustering validity evaluation. Application Research of Computers,2008,25(6):1630-1632,1638.) [15] Vinh N X,Epps J,Bailey J. Information theoretic measures for clusterings comparison:Is a correction for chance necessary? ∥ Proceedings of the 26th Annual International Conference on Machine Learning. Montreal,Canada:ACM,2009:1073-1080. [16] Rehm F,Klawonn F,Kruse R. A novel approach to noise clustering for outlier detection. Soft Computing,2007,11(5):489-494. [17] Ding S F,Zhang J,Jia H J,et al. An adaptive density data stream clustering algorithm. Cognitive Computation,2016,8(1):30-38. |
No related articles found! |
|