南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (1): 92–101.doi: 10.13232/j.cnki.jnju.2019.01.009

• • 上一篇    下一篇

一种嵌入样本流形结构与标记相关性的多标记降维算法

马宏亮,万建武*,王洪元   

  1. 常州大学信息科学与工程学院,常州,213164
  • 接受日期:2018-12-08 出版日期:2019-02-01 发布日期:2019-01-26
  • 通讯作者: 万建武, E-mail:jianwuwan@gmail.com E-mail:jianwuwan@gmail.com
  • 基金资助:
    国家自然科学基金(61502058,61572085),江苏省高校自然科学基金(15KJB520002)

A multi-label dimensionality reduction algorithm embedded sample manifold structure and label correlation

Ma Hongliang,Wan Jianwu*,Wang Hongyuan   

  1. School of Information Science and Engineering,Changzhou University,Changzhou,213164,China
  • Accepted:2018-12-08 Online:2019-02-01 Published:2019-01-26
  • Contact: Wan Jianwu, E-mail:jianwuwan@gmail.com E-mail:jianwuwan@gmail.com

摘要: 现有的多标记降维算法常通过学习标记相关性构建样本间的相似关系,进而提高学习系统的性能. 然而,在实际应用中,样本的标记信息可能存在噪声,且部分标记信息可能缺失,因此由样本的标记信息学得的标记相关性可能不准确,无法有效挖掘样本间的相似关系. 为了解决该问题,从样本的特征空间与标记空间两个方面构建样本间的相似关系. 在利用标记空间学习标记相关性的同时,通过引入特征空间中的概率超图模型,提出一种嵌入样本流形结构与标记相关性的多标记降维算法. 在十个多标记数据集和六种评价准则上的实验结果证明了所提算法的有效性.

关键词: 多标记降维, 标记相关性, 流形结构, 超 图

Abstract: The existing multi-label dimensionality reduction algorithm often constructs the similarity relationship among samples by learning the correlation between labels to improve the performance of the learning system. However,in practical applications,the label information of samples may be noisy,and some of the label information may be missing. Therefore,label correlation learned from label information of samples may be inaccurate,and the similarity relationship among samples cannot be excavated effectively. To solve this problem,this paper tries to construct similarity between samples from two aspects:feature space and label space. In particular,learning label correlation from label space,meanwhile,by introducing the probabilistic hypergraph model into the feature space,a multi-label dimensionality reduction algorithm embedded sample manifold structure and label correlation is proposed. Experimental results on ten multi-label datasets and six evaluation criterias demonstrate the effectiveness of the proposed algorithm.

Key words: multi-label dimensionality reduction, label correlation, manifold structure, hypergraph

中图分类号: 

  • TP391
[1] Zhu X F,Li X L,Zhang S C. Block-row sparse multiview multilabel learning for image classification. IEEE Transactions on Cybernetics,2016,46(2):450-461.
[2] Huang J,Li G R,Huang Q M,et al. Learning label-specific features and class-dependent labels for multi-label classification. IEEE Transactions on Knowledge and Data Engineering,2016,28(12):3309-3323.
[3] Tahir M A,Bouridane A,Kittler J. Dimen-sionality reduction using stacked kernel discriminant analysis for multi-label classification ∥ Proceedings of the 11th International Workshop Multiple Classifier Systems. Springer Berlin Heidelberg,2013:283-294.
[4] Huang J,Li G R,Huang Q M,et al. Joint feature selection and classification for multilabel learning. IEEE Transactions on Cybernetics,2018,48(3):876-889.
[5] Sun L,Ji S W,Ye J P. Multi-label dimensionality reduction. Boca Raton:CRC Press,2013:20-24.
[6] Huang S J,Zhou Z H. Multi-label learning by exploiting label correlations locally ∥ Proceedings of the 26th AAAI Conference on Artificial Intelligence. Ontario,Canada:ACM,2012:949-955.
[7] Lin Y J,Hu Q H,Liu J H,et al. Multi-label feature selection based on max-dependency and min-redundancy. Neurocomputing,2015,168:92-103.
[8] Sun L,Ji S W,Ye J P. Canonical correlation analysis for multilabel classification:A least-squares formulation,extensions,and analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(1):194-200.
[9] Sun L,Ji S W,Ye J P. Hypergraph spectral learning for multi-label classification ∥ Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Las Vegas,NV,USA:ACM,2008:668-676.
[10] Lo H Y,Wang J C,Wang H M,et al. Cost-sensitive multi-label learning for audio tag annotation and retrieval. IEEE Transactions on Multimedia,2011,13(3):518-529.
[11] Agarwal S,Branson K,Belongie S. Higher order learning with graphs ∥ Proceedings of the 23rd International Conference on Machine Learning. Pittsburgh,PA,USA:ACM,2006:17-24.
[12] Wang Y Q,Li P,Yao C. Hypergraph canonical correlation analysis for multi-label classification. Signal Processing,2014,105:258-267.
[13] Huang Y C,Liu Q S,Zhang S T,et al. Image retrieval via probabilistic hypergraph ranking ∥ Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco,CA,USA:IEEE,2010:3376-3383.
[14] Yu T Z,Zhang W S. Semisupervised multilabel learning with joint dimensionality reduction. IEEE Signal Processing Letters,2016,23(6):795-799.
[15] Liu D,Hua X S,Yang L J,et al. Tag ranking ∥ Proceedings of the 18th International Conference on World Wide Web. Madrid,Span:ACM,2009:351-360.
[16] Xu J H,Liu J L,Yin J,et al. A multi-label feature extraction algorithm via maximizing feature variance and feature-label dependence simultaneously. Knowledge-Based Systems,2016,98:172-184.
[17] Zhang M L,Zhou Z H. A review on multi-label learning algorithms. IEEE Transactions on Knowledge and Data Engineering,2014,26(8):1819-1837.
[18] Zhang Y,Zhou Z H. Multi-label dimensionality reduction via dependence maximization ∥ Proceedings of the 23rd National Conference on Artificial Intelligence. Chicago,IL,USA:ACM,2008:1503-1505.
[19] Elghazel H,Aussem A,Gharroudi O,et al. Ensemble multi-label text categorization based on rotation forest and latent semantic indexing. Expert Systems with Applications,2016,57:1-11.
[20] Wang H,Yan L,Huang H,et al. From protein sequence to protein function via multi-label linear discriminant analysis. IEEE/ACM Transactions on Computational Biology and Bioinformatics,2017,14(3):503-513.
[1] 蔡亚萍,杨 明* . 一种利用局部标记相关性的多标记特征选择算法[J]. 南京大学学报(自然科学版), 2016, 52(4): 693-.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 林 銮,陆武萍,唐朝生,赵红崴,冷 挺,李胜杰. 基于计算机图像处理技术的松散砂性土微观结构定量分析方法[J]. 南京大学学报(自然科学版), 2018, 54(6): 1064 -1074 .
[2] 杨 薇, 王洪元, 张 继, 张中宝. 一种基于Faster-RCNN的车辆实时检测改进算法[J]. 南京大学学报(自然科学版), 2019, 55(2): 231 -237 .
[3] 李嘉明, 邹 勇, 郑 浩, 魏钟波, 杨柳燕, 缪爱军. 养殖塘生态系统重金属污染状况与风险评价[J]. 南京大学学报(自然科学版), 2019, 55(2): 272 -281 .
[4] 刘作国,陈笑蓉. 汉语句法分析中的论元关系模型研究[J]. 南京大学学报(自然科学版), 2019, 55(6): 1010 -1019 .
[5] 王冬丽,申俊峰,邱海成,杜佰松,李建平,聂潇,王业晗. 辽宁五龙金矿黄铁矿标型特征研究及深部找矿预测[J]. 南京大学学报(自然科学版), 2019, 55(6): 898 -915 .
[6] 段友祥,柳璠,孙歧峰,李洪强. 基于相带划分的孔隙度预测[J]. 南京大学学报(自然科学版), 2019, 55(6): 934 -941 .
[7] 王露,王士同. 改进模糊聚类在医疗卫生数据的Takagi⁃Sugeno模糊模型[J]. 南京大学学报(自然科学版), 2020, 56(2): 186 -196 .
[8] 刘鑫,胡军,张清华. 属性组序下基于代价敏感的约简方法[J]. 南京大学学报(自然科学版), 2020, 56(4): 469 -479 .