南京大学学报(自然科学版) ›› 2023, Vol. 59 ›› Issue (4): 690704.doi: 10.13232/j.cnki.jnju.2023.04.015
王一宾1,2, 葛文信1, 程玉胜1,2(), 吴海峰1,2
Yibin Wang1,2, Wenxin Ge1, Yusheng Cheng1,2(), Haifeng Wu1,2
摘要:
传统的多标签学习一般基于完整的标签信息,但随着数据量的增大,很难为每个实例获得完整的标签信息,导致弱标签问题在多标签数据集中广泛存在,严重影响了多标签的分类性能.为了提升相关性能,不少学者在实际分类中考虑特征、标签和实例部分的关联性,却忽略了它们之间的相关性.基于此,提出一种基于多维相关性的弱类属属性学习算法:首先,根据特征和标签之间的相关性,采用余弦相似度计算出标签之间的相关性;其次,根据特征与实例之间的相关性,采用密度峰值聚类获得实例相关性,并从中选择具有监督信息的标签矩阵,与分解希尔伯特矩阵获得的特征相关性结合构建流形正则化;最后,在多个不同缺省率的多标签数据集上进行了大量实验,验证了提出的算法的有效性.
中图分类号:
1 | Wei W, Wu Q, Chen D,et al. Automatic image annotation based on an improved nearest neighbor technique with tag semantic extension model. Procedia Computer Science,2021(183):616-623. |
2 | Qian T, Li F, Zhang M S,et al. Contrastive learning from label distribution:A case study on text classification. Neurocomputing,2022(507):208-220. |
3 | Xia W Q, Zheng L Y, Fang J B,et al. PFmulDL:A novel strategy enabling multi?class and multi?label protein function annotation by integrating diverse deep learning methods. Computers in Biology and Medicine,2022(145):105465. |
4 | Liu S H, Wang B, Liu B,et al. Multicommunity graph convolution networks with decision fusion for personalized recommendation∥Proceedings of the 26th Pacific?Asia Conference on Knowledge Discovery and Data Mining. Springer Berlin Heidelberg,2022:16-28. |
5 | Yu H F, Jain P, Kar P,et al. Large?scale multi?label learning with missing labels∥Proceedings of the 31st International Conference on International Conference on Machine Learning. Beijing,China:JMLR,2014:I?593-I?601. |
6 | Sun Y Y, Zhang Y, Zhou Z H. Multi?label learning with weak label∥Proceedings of the 24th AAAI Conference on Artificial Intelligence. Atlanta,GA,USA:AAAI,2010:593-598. |
7 | Jiang L, Yu G X, Guo M Z,et al. Feature selection with missing labels based on label compression and local feature correlation. Neurocomputing,2020(395):95-106. |
8 | Boutell M R, Luo J B, Shen X P,et al. Learning multi?label scene classification. Pattern Recognition,2004,37(9):1757-1771. |
9 | Zhang M L, Zhou Z H. ML?KNN:A lazy learning approach to multi?label learning. Pattern Recognition,2007,40(7):2038-2048. |
10 | Zhang M L, Wu L. Lift:Multi?label learning with label?specific features. IEEE Transactions on pattern Analysis and Machine Intelligence,2015,37(1):107-120. |
11 | Huang J, Li G R, Huang Q M,et al. Learning label specific features for multi?label classification∥2015 IEEE International Conference on Data Mining. Atlantic City,NJ,USA:IEEE,2015:181-190. |
12 | Zhang J, Li C D, Cao D L,et al. Multi?label learning with label?specific features by resolving label correlations. Knowledge?Based Systems,2018(159):148-157. |
13 | Jia X Y, Zhu S S, Li W W. Joint label?specific features and correlation information for multi?label learning. Journal of Computer Science and Technology,2020,35(2):247-258. |
14 | Wang Y B, Zheng W J, Cheng Y S,et al. Joint label completion and label?specific features for multi?label learning algorithm. Soft Computing,2020,24(9):6553-6569. |
15 | Huang J, Qin F, Zheng X,et al. Improving multi?label classification with missing labels by learning label?specific features. Information Sciences,2019(492):124-146. |
16 | Zhao D W, Li H, Lu Y X,et al. Multi?label weak?label learning via semantic reconstruction and label correlations. Information Sciences,2023(623):379-401. |
17 | Cheng Y S, Zhao D W, Zhan W F,et al. Multi?label learning of non?equilibrium labels completion with mean shift. Neurocomputing,2018(321):92-102. |
18 | Zhu Y, Kwok J T, Zhou Z H. Multi?label learning with global and local label correlation. IEEE Transactions on Knowledge and Data Engineering,2018,30(6):1081-1094. |
19 | Wang Y B, Zheng W J, Cheng Y S,et al. Two?level label recovery?based label embedding for multi?label classification with missing labels. Applied Soft Computing,2021(99):106868. |
20 | Kumar S, Rastogi R. Low rank label subspace transformation for multi?label learning with missing labels. Information Sciences,2022(596):53-72. |
21 | Zhang J, Li S Z, Jiang M,et al. Learning from weakly labeled data based on manifold regularized sparse model. IEEE Transactions on Cybernetics,2022,52(5):3841-3854. |
22 | Zhang Y, Zhou Z H. Multilabel dimensionality reduction via dependence maximization. ACM Transactions on Knowledge Discovery from Data,2010,4(3):14. |
23 | Rodriguez A, Laio A. Clustering by fast search and find of density peaks. Science,2014,344(6191):1492-1496. |
24 | Qian K, Min X Y, Cheng Y S,et al. Weight matrix sharing for multi?label learning. Pattern Recognition,2023(136):109156. |
25 | Beck A, Teboulle M. A fast iterative shrinkage?thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences,2009,2(1):183-202. |
26 | Lin Z C, Ganesh A, Wright J,et al. Fast convex optimization algorithms for exact recovery of a corrupted low?rank matrix. Technical Report. Urbana:Coordinated Science Laboratory,2009:2214. |
27 | 王一宾,郑伟杰,程玉胜,等. 基于PLSA学习概率分布语义信息的多标签分类算法. 南京大学学报(自然科学),2021,57(1):75-89. |
Wang Y B, Zheng W J, Cheng Y S,et al. Multi?label classification algorithm based on PLSA learning probability distribution semantic information. Journal of Nanjing University (Natural Science),2021,57(1):75-89. | |
28 | 程玉胜,徐玉婷,王一宾,等. 基于共享子空间的潜在语义学习. 南京大学学报(自然科学),2022,58(5):816-826. |
Cheng Y S, Xu Y T, Wang Y B,et al. Latent semantic learning based on shared subspace. Journal of Nanjing University (Natural Science),2022,58(5):816-826. |
[1] | 田小瑜, 秦永彬, 黄瑞章, 陈艳平. 基于相关性约束矩阵分解的多标签分类方法[J]. 南京大学学报(自然科学版), 2023, 59(1): 76-84. |
[2] | 吕佳, 刘强, 李帅军. 结合密度峰值和改进自然邻居的自训练算法[J]. 南京大学学报(自然科学版), 2022, 58(5): 805-815. |
[3] | 吕佳, 郭铭. 基于密度峰值聚类和局部稀疏度的过采样算法[J]. 南京大学学报(自然科学版), 2022, 58(3): 483-494. |
[4] | 王一宾, 郑伟杰, 程玉胜, 曹天成. 基于PLSA学习概率分布语义信息的多标签分类算法[J]. 南京大学学报(自然科学版), 2021, 57(1): 75-89. |
[5] | 胡学钢*,许尧,李培培,张玉红. 一种过滤式多标签特征选择算法[J]. 南京大学学报(自然科学版), 2015, 51(4): 723-730. |
|