南京大学学报(自然科学版) ›› 2023, Vol. 59 ›› Issue (4): 690–704.doi: 10.13232/j.cnki.jnju.2023.04.015

• • 上一篇    下一篇

基于多维相关性的弱类属属性学习

王一宾1,2, 葛文信1, 程玉胜1,2(), 吴海峰1,2   

  1. 1.安庆师范大学计算机与信息学院,安庆,246133
    2.智能感知与计算安徽省高校重点实验室,安庆,246133
  • 收稿日期:2023-05-24 出版日期:2023-07-31 发布日期:2023-08-18
  • 通讯作者: 程玉胜 E-mail:chengyshaq@163.com
  • 基金资助:
    并行与分布处理国防科技重点实验室项目(WDZC202252501);安徽省自然科学基金(2108085MF216)

Weak⁃label⁃specific features learning based on multidimensional correlation

Yibin Wang1,2, Wenxin Ge1, Yusheng Cheng1,2(), Haifeng Wu1,2   

  1. 1.School of Computer and Information, Anqing Normal University, Anqing, 246133, China
    2.Key Laboratory of Intelligent Perception and Computing of Anhui Province, Anqing, 246133, China
  • Received:2023-05-24 Online:2023-07-31 Published:2023-08-18
  • Contact: Yusheng Cheng E-mail:chengyshaq@163.com

摘要:

传统的多标签学习一般基于完整的标签信息,但随着数据量的增大,很难为每个实例获得完整的标签信息,导致弱标签问题在多标签数据集中广泛存在,严重影响了多标签的分类性能.为了提升相关性能,不少学者在实际分类中考虑特征、标签和实例部分的关联性,却忽略了它们之间的相关性.基于此,提出一种基于多维相关性的弱类属属性学习算法:首先,根据特征和标签之间的相关性,采用余弦相似度计算出标签之间的相关性;其次,根据特征与实例之间的相关性,采用密度峰值聚类获得实例相关性,并从中选择具有监督信息的标签矩阵,与分解希尔伯特矩阵获得的特征相关性结合构建流形正则化;最后,在多个不同缺省率的多标签数据集上进行了大量实验,验证了提出的算法的有效性.

关键词: 弱类属属性学习, 标签相关性, 特征相关性, 密度峰值聚类, 实例相关性

Abstract:

Traditional multi?label learning is generally based on complete label information. However,as the volume of data increases,it is difficult to obtain complete label information for each instance,which leads to the widespread presence of weak labels in multi?label datasets. In turn,the classification performance is seriously affected. Many scholars considered the partial association of features,labels and instances in practical classification,while ignoring the correlation between them.Therefore,this paper proposes a weak?label?specific features learning based on multidimensional correlation. Firstly,the correlation between features and labels is measured according to the cosine similarity. Secondly,based on the correlation between labels and instances,density peak clustering is used to obtain instance correlation,which are combined with feature correlation obtained from decomposing hilbert matrices to construct streamwise regularization. Finally,experiments are conducted on multiple multi?label datasets with different missing rates,and the results verifiy the effectiveness of our algorithm.

Key words: weak?label?specific feature, label correlation, feature correlation, density peaks clustering, instance correlation

中图分类号: 

  • TP181

图1

特征?标签?实例的关系图"

表1

多标签数据集"

DatasetsInstancesFeaturesLabelsCardinalityDomains
Artsa5000462261.636Text
Birdsb645260201.471Audio
Computersa5000681331.508Text
Entertainmenta5000640211.640Text
Healtha5000612321.612Text
Imagec200029451.236Image
Medicalb9791449451.215Text
Recreationa5000606221.606Text
Referencea5000793331.793Text
Societya5000636271.692Text
Sciencea5000743401.451Text
Sociala50001047391.283Text

表2

六种算法在12个数据集上的缺省实验结果(海明损失)"

DatasetsrGLOCALLRMMLMSWLLSMLJLCLSWSMDC
Arts0.10.0629±0.00160.0538±0.00170.0605±0.00110.0577±0.00090.0582±0.00230.0530±0.0010
0.30.0629±0.00170.0553±0.00130.0608±0.00120.0588±0.00080.0615±0.00150.0536±0.0011
0.50.0629±0.00070.0570±0.00160.0613±0.00150.0608±0.00080.0624±0.00070.0547±0.0016
0.70.0629±0.00130.0607±0.00100.0621±0.00060.0624±0.00050.0629±0.00030.0566±0.0013
Birds0.10.0734±0.01950.0508±0.00290.0674±0.00490.0616±0.00330.0511±0.00180.0485±0.0063
0.30.0557±0.00190.0533±0.00310.0695±0.00250.0653±0.00170.0516±0.00570.0505±0.0079
0.50.0621±0.00290.0533±0.00390.0713±0.00490.0729±0.00330.0554±0.00260.0547±0.0050
0.70.0592±0.00600.0622±0.00450.0726±0.00400.0735±0.00300.0598±0.00590.0558±0.0040
Computers0.10.0499±0.00460.0337±0.00070.0435±0.00120.0392±0.00060.0384±0.00250.0331±0.0010
0.30.0527±0.00070.0346±0.00070.0436±0.00130.0408±0.00180.0444±0.00140.0335±0.0016
0.50.0544±0.00090.0385±0.00070.0449±0.00170.0428±0.00110.0580±0.00220.0344±0.0010
0.70.0558±0.00270.0597±0.00120.0450±0.00030.0450±0.00070.0889±0.00360.0356±0.0011
Entertainment0.10.0589±0.00890.0513±0.00100.0648±0.00210.0574±0.00210.0565±0.00170.0501±0.0013
0.30.0670±0.00090.0535±0.00110.0655±0.00190.0596±0.00080.0634±0.00250.0513±0.0024
0.50.0684±0.00080.0570±0.00080.0665±0.00060.0628±0.00120.0757±0.00290.0524±0.0015
0.70.0699±0.00630.0609±0.00070.0657±0.00150.0666±0.00150.1084±0.00560.0558±0.0013
Health0.10.0393±0.00390.0330±0.00090.0481±0.00130.0407±0.00060.0351±0.00130.0316±0.0015
0.30.0412±0.00090.0341±0.00100.0489±0.00100.0436±0.00070.0400±0.00110.0326±0.0011
0.50.0426±0.00100.0377±0.00060.0490±0.00060.0473±0.00040.0507±0.00180.0339±0.0014
0.70.0485±0.00200.0563±0.00260.0503±0.00100.0509±0.00100.0811±0.00090.0358±0.0009
Image0.10.1947±0.02110.2110±0.01220.2390±0.00780.2427±0.00680.2120±0.00640.2017±0.0199
0.30.2076±0.00500.2110±0.00820.2443±0.01080.2450±0.01010.2153±0.02050.2123±0.0180
0.50.2122±0.00430.2230±0.00970.2433±0.00610.2453±0.01320.2107±0.00590.2140±0.0094
0.70.2514±0.01360.2207±0.01570.2457±0.00510.2467±0.00530.2110±0.00620.2217±0.0151
Medical0.10.0138±0.00220.0114±0.00160.0247±0.00230.0194±0.00060.0108±0.00070.0101±0.0012
0.30.0117±0.00090.0122±0.00110.0253±0.00310.0220±0.00150.0129±0.00050.0154±0.0015
0.50.0132±0.00140.0144±0.00110.0257±0.00260.0249±0.00040.0179±0.00220.0169±0.0019
0.70.0215±0.00280.0262±0.00320.0278±0.00020.0270±0.00100.0442±0.00280.0182±0.0014
Recreation0.10.0664±0.00800.0538±0.00120.0621±0.00100.0588±0.00090.0600±0.00120.0528±0.0009
0.30.0718±0.00150.0560±0.00070.0629±0.00070.0602±0.00160.0667±0.00170.0541±0.0018
0.50.0747±0.00120.0588±0.00070.0630±0.00200.0618±0.00120.0791±0.00150.0550±0.0022
0.70.0747±0.00450.0625±0.00100.0631±0.00150.0639±0.00160.1106±0.00190.0582±0.0011
Reference0.10.0338±0.00450.0251±0.00060.0330±0.00010.0290±0.00020.0294±0.00080.0244±0.0010
0.30.0380±0.00050.0262±0.00040.0336±0.00040.0303±0.00040.0362±0.00110.0251±0.0009
0.50.0387±0.00060.0305±0.00050.0339±0.00090.0322±0.00030.0514±0.00410.0253±0.0008
0.70.0394±0.00310.0531±0.00260.0344±0.00050.0346±0.00040.0868±0.00210.0265±0.0013
Society0.10.0557±0.00700.0515±0.00150.0586±0.00240.0554±0.00140.0555±0.00080.0508±0.0019
0.30.0617±0.00100.0530±0.00090.0609±0.00190.0570±0.00130.0615±0.00120.0515±0.0018
0.50.0632±0.00120.0571±0.00130.0614±0.00160.0595±0.00130.0738±0.00220.0523±0.0018
0.70.0621±0.00340.0816±0.00280.0620±0.00130.0623±0.00130.1046±0.00150.0541±0.0019
Science0.10.0340±0.00450.0312±0.00090.0352±0.00010.0341±0.00080.0334±0.00060.0307±0.0013
0.30.0376±0.00070.0318±0.00120.0359±0.00070.0347±0.00060.0383±0.00100.0317±0.0009
0.50.0386±0.00090.0333±0.00080.0356±0.00050.0353±0.00030.0507±0.00080.0322±0.0006
0.70.0390±0.00270.0350±0.00070.0359±0.00070.0361±0.00040.0829±0.00260.0329±0.0009
Social0.10.0217±0.00390.0201±0.00040.0292±0.00100.0238±0.00030.0213±0.00060.0195±0.0013
0.30.0247±0.00050.0213±0.00050.0301±0.00070.0256±0.00050.0259±0.00100.0202±0.0011
0.50.0255±0.00060.0249±0.00060.0310±0.00090.0282±0.00050.0389±0.00050.0207±0.0011
0.70.0282±0.00280.0521±0.00130.0322±0.00060.0318±0.00030.0740±0.00220.0212±0.0010

表3

六种算法在12个数据集上的缺省实验结果(平均精度)"

DatasetsrGLOCALLRMMLMSWLLSMLJLCLSWSMDC
Arts0.10.6192±0.01180.6267±0.01040.6167±0.00830.6114±0.00640.6122±0.00400.6381±0.0128
0.30.6123±0.01120.6029±0.01140.6020±0.00910.5772±0.00870.6099±0.01160.6127±0.0138
0.50.6038±0.00720.5736±0.00700.5830±0.01260.5227±0.01020.5851±0.00820.5823±0.0089
0.70.5837±0.00710.5387±0.01230.5499±0.01060.4706±0.00830.5476±0.01090.5411±0.0099
Birds0.10.7562±0.01950.7608±0.00290.7494±0.02670.7590±0.01680.7645±0.01090.7734±0.0353
0.30.7088±0.01360.7533±0.01020.7392±0.02870.7342±0.02390.7268±0.03410.7510±0.0176
0.50.6889±0.00820.7187±0.01520.7169±0.02050.7046±0.04470.7088±0.03220.7328±0.0331
0.70.6657±0.04760.6857±0.01440.6856±0.03300.6893±0.03250.6768±0.04850.6990±0.0311
Computers0.10.6877±0.05310.7042±0.00580.6991±0.00730.6922±0.01210.6926±0.01210.7206±0.0158
0.30.6511±0.00770.6835±0.00790.6815±0.01080.6560±0.01350.6563±0.01200.6956±0.0155
0.50.6373±0.00730.6316±0.01030.6683±0.01650.6166±0.00820.6082±0.01060.6781±0.0122
0.70.6152±0.02710.5699±0.00620.6383±0.01430.5311±0.00890.5411±0.01990.6404±0.0165
Entertainment0.10.6823±0.06560.6886±0.00480.6883±0.01010.6792±0.00580.6833±0.01150.7034±0.0120
0.30.6349±0.00950.6680±0.00720.6746±0.01120.6487±0.01050.6451±0.00830.6887±0.0170
0.50.6172±0.01210.6471±0.00770.6547±0.01300.6089±0.00870.6110±0.00380.6623±0.0191
0.70.5766±0.03970.6083±0.00770.6324±0.00720.5452±0.01410.5343±0.01470.6176±0.0106
Health0.10.7663±0.04850.7784±0.01090.7756±0.00930.7724±0.00680.7672±0.00770.7954±0.0132
0.30.7331±0.00730.7625±0.00810.7688±0.01010.7438±0.00690.7353±0.00940.7790±0.0101
0.50.7202±0.00660.7273±0.00930.7508±0.01130.6941±0.00670.7002±0.01200.7641±0.0111
0.70.6814±0.02630.6534±0.00730.7239±0.01190.6112±0.00880.6145±0.01220.7327±0.0066
Image0.10.7628±0.03020.7259±0.03050.7564±0.02320.7583±0.01300.7507±0.03510.7620±0.0350
0.30.7418±0.00790.6929±0.03370.7451±0.01470.7537±0.02590.7478±0.02920.7534±0.0433
0.50.7370±0.01080.7190±0.04020.7200±0.03820.7411±0.02610.7395±0.02370.7419±0.0357
0.70.6489±0.02750.6674±0.01460.6906±0.04020.7278±0.04370.7271±0.02250.7214±0.0329
Medical0.10.8604±0.03840.8870±0.01320.7246±0.02480.8895±0.01110.8906±0.01420.9076±0.0168
0.30.8776±0.00600.8724±0.01020.6784±0.28810.8626±0.01990.8634±0.01100.8639±0.0294
0.50.8608±0.01660.8320±0.02040.6340±0.32330.8221±0.01690.8194±0.02360.8405±0.0331
0.70.7103±0.07720.7099±0.02740.5066±0.21380.7122±0.01480.6959±0.03080.8030±0.0327
Recreation0.10.6236±0.07030.6299±0.00510.6311±0.00660.6209±0.00690.6208±0.00750.6512±0.0158
0.30.5721±0.01110.6104±0.00740.6158±0.01380.5804±0.00820.5872±0.01220.6266±0.0126
0.50.5558±0.01290.5740±0.01070.5917±0.00400.5358±0.00940.5448±0.00590.5935±0.0195
0.70.5113±0.03840.5260±0.00600.5673±0.00770.4749±0.01570.4744±0.00860.5392±0.0256
Reference0.10.7061±0.05450.7092±0.00670.7068±0.00790.6986±0.01090.6936±0.00890.7268±0.0092
0.30.6613±0.00640.6822±0.00390.6938±0.01250.6519±0.00620.6536±0.01210.7017±0.0174
0.50.6510±0.00640.6277±0.00660.6748±0.01080.5973±0.00920.5997±0.01680.6755±0.0148
0.70.6151±0.02820.5516±0.00850.6451±0.01210.5101±0.01030.5059±0.00990.6334±0.0117
Society0.10.6426±0.06760.6363±0.01140.6363±0.01340.6207±0.01070.6221±0.00790.6490±0.0145
0.30.5860±0.00830.6124±0.01000.5635±0.11460.5904±0.00890.5860±0.01290.6273±0.0114
0.50.5746±0.00400.5760±0.00920.5497±0.11600.5527±0.01140.5532±0.01300.6069±0.0145
0.70.5662±0.02950.5138±0.01070.4769±0.12340.4871±0.00940.4819±0.01090.5725±0.0178
Science0.10.5868±0.07760.5943±0.02020.5947±0.00440.5832±0.00900.5803±0.00420.6197±0.0171
0.30.5352±0.00980.5642±0.02020.5354±0.01290.5393±0.00480.5363±0.00410.5837±0.0168
0.50.5138±0.01470.5162±0.01370.4946±0.11960.4733±0.01240.4720±0.00530.5549±0.0126
0.70.4663±0.03460.4652±0.01960.4160±0.16000.3781±0.01220.3871±0.01340.4935±0.0181
Social0.10.7696±0.05200.7708±0.00760.7797±0.01400.7583±0.00630.7615±0.01090.7874±0.0137
0.30.7308±0.00600.7470±0.00550.7633±0.00730.7208±0.01540.7198±0.01110.7647±0.0098
0.50.7223±0.00700.6933±0.00770.7511±0.01260.6708±0.00810.6653±0.00710.7458±0.0148
0.70.6780±0.02600.6140±0.00600.7198±0.00410.5752±0.01200.5751±0.01610.7154±0.0134

表4

六种算法在12个数据集上的缺省实验结果(1?错误率)"

DatasetsrGLOCALLRMMLMSWLLSMLJLCLSWSMDC
Arts0.10.4554±0.01270.4588±0.01130.4596±0.01350.4590±0.00900.4568±0.01490.4392±0.0209
0.30.4641±0.01620.4764±0.00830.4740±0.01390.4910±0.01460.4620±0.01560.4542±0.0168
0.50.4832±0.00970.5058±0.01130.4938±0.01250.5462±0.01300.4866±0.00940.4840±0.0114
0.70.5049±0.01050.5628±0.01650.5310±0.01340.5992±0.00990.5156±0.01570.5224±0.0139
Birds0.10.3055±0.02430.2868±0.01100.2992±0.03010.2884±0.03840.2899±0.03270.2847±0.0486
0.30.3690±0.01950.2884±0.01730.3116±0.04890.3039±0.02750.3163±0.04110.2884±0.0308
0.50.3938±0.01010.3256±0.02980.3411±0.02550.3364±0.07110.3395±0.03580.3163±0.0482
0.70.4035±0.06120.3597±0.03720.3752±0.02620.3473±0.03650.3612±0.06040.3411±0.0436
Computers0.10.3664±0.05910.3498±0.01070.3480±0.01570.3548±0.01800.3550±0.01240.3362±0.0184
0.30.4038±0.00950.3660±0.00630.3636±0.01560.3814±0.01520.3810±0.00620.3486±0.0220
0.50.4223±0.00940.4022±0.01580.3754±0.01430.4196±0.01860.4302±0.01620.3578±0.0134
0.70.4412±0.03380.4684±0.00400.4092±0.01880.5164±0.01260.5028±0.02630.3896±0.0255
Entertainment0.10.3935±0.08380.3950±0.00600.3936±0.00940.3972±0.01460.3962±0.01880.3808±0.0205
0.30.4527±0.01190.4166±0.01350.4068±0.01730.4280±0.01210.4296±0.01240.3898±0.0249
0.50.4772±0.01910.4420±0.00910.4344±0.01540.4692±0.01090.4616±0.00550.4124±0.0251
0.70.5327±0.05180.4882±0.01060.4528±0.01600.5360±0.01520.5512±0.01970.4630±0.0103
Health0.10.2805±0.06100.2584±0.02020.2666±0.01910.2556±0.01380.2594±0.00820.2412±0.0243
0.30.3174±0.01000.2634±0.01730.2724±0.01950.2770±0.01330.2864±0.01780.2518±0.0151
0.50.3379±0.01150.2928±0.01620.2946±0.01480.3276±0.01240.3166±0.01760.2580±0.0181
0.70.3827±0.03690.3686±0.00520.3210±0.02140.4200±0.01390.4166±0.00390.2828±0.0174
Image0.10.3620±0.04760.4267±0.04260.3833±0.04620.3900±0.02320.4000±0.05450.3733±0.0684
0.30.3960±0.01850.4867±0.04370.3900±0.01930.3917±0.04050.3933±0.05040.3867±0.0722
0.50.4020±0.01600.4417±0.06010.4283±0.06680.4033±0.05150.4133±0.03890.4083±0.0602
0.70.5360±0.04250.5033±0.02870.4717±0.06320.4300±0.06760.4250±0.03370.4433±0.0588
Medical0.10.1840±0.05810.1585±0.02370.3766±0.03260.1452±0.00880.1421±0.01860.1298±0.0295
0.30.1480±0.00720.1667±0.02870.3829±0.31550.1616±0.02620.1687±0.01840.1717±0.0420
0.50.1684±0.01940.1963±0.03040.4380±0.35940.2055±0.01770.2117±0.04060.2025±0.0414
0.70.3723±0.09240.3446±0.03580.6116±0.24150.3364±0.01880.3538±0.03570.2300±0.0372
Recreation0.10.4588±0.09080.4566±0.01180.4492±0.00920.4662±0.01060.4664±0.01430.4376±0.0247
0.30.5195±0.01810.4844±0.01540.4682±0.01710.5046±0.00710.4948±0.01880.4590±0.0217
0.50.5460±0.01650.5258±0.01620.4960±0.00610.5500±0.01490.5374±0.00530.4844±0.0277
0.70.5936±0.05000.5844±0.01150.5212±0.00760.6242±0.02150.6174±0.01310.5518±0.0338
Reference0.10.3673±0.06150.3666±0.01030.3676±0.01230.3718±0.01570.3764±0.01370.3504±0.0132
0.30.4155±0.00950.3854±0.00740.3846±0.01790.4130±0.00340.4148±0.01400.3664±0.0238
0.50.4292±0.00890.4404±0.00570.4014±0.01300.4734±0.01130.4716±0.01340.3850±0.0208
0.70.4741±0.03190.5260±0.01160.4320±0.01780.5738±0.01290.5806±0.01130.4260±0.0193
Society0.10.3871±0.08020.3920±0.01780.3978±0.01390.4010±0.01270.4022±0.00880.3778±0.0186
0.30.4495±0.01170.4114±0.01750.4750±0.12070.4290±0.01390.4360±0.03100.3916±0.0177
0.50.4675±0.00600.4420±0.01400.4866±0.12140.4748±0.01830.4676±0.01750.4142±0.0159
0.70.4756±0.03800.5240±0.01560.5678±0.13560.5564±0.01590.5660±0.01600.4490±0.0268
Science0.10.4905±0.09520.4934±0.02550.4932±0.00700.4950±0.01240.4954±0.01050.4740±0.0250
0.30.5513±0.01240.5168±0.02450.5512±0.01680.5268±0.00800.5322±0.00880.5008±0.0180
0.50.5799±0.02090.5594±0.01680.5922±0.12910.6022±0.01540.5988±0.01120.5182±0.0172
0.70.6333±0.04270.6192±0.01320.6738±0.16720.7000±0.01570.6926±0.01380.5762±0.0251
Social0.10.2865±0.06110.2752±0.01240.2736±0.01450.2814±0.00980.2782±0.01950.2632±0.0187
0.30.3220±0.01020.2944±0.00410.2916±0.01150.3176±0.01620.3144±0.01400.2838±0.0130
0.50.3387±0.01160.3438±0.01460.3024±0.01480.3724±0.01370.3788±0.01120.2934±0.0181
0.70.4135±0.03420.4354±0.00620.3346±0.00910.4832±0.01250.4830±0.02190.3190±0.0151

表5

六种算法在12个数据集上的缺省实验结果(排序损失)"

DatasetsrGLOCALLRMMLMSWLLSMLJLCLSWSMDC
Arts0.10.1466±0.00640.1244±0.00450.1372±0.00430.1539±0.00680.1537±0.00440.1259±0.0060
0.30.1448±0.00710.1464±0.00740.1471±0.00370.1748±0.00440.1452±0.00730.1458±0.0079
0.50.1446±0.00580.1638±0.00530.1643±0.00830.2130±0.00750.1634±0.00530.1666±0.0097
0.70.1466±0.00520.1706±0.00380.1904±0.00670.2441±0.00560.1917±0.00670.1994±0.0091
Birds0.10.1164±0.01370.0933±0.00640.1099±0.01660.0964±0.00440.0945±0.01480.0878±0.0179
0.30.1074±0.00600.1087±0.00950.1101±0.00880.1207±0.01550.1191±0.01520.1088±0.0092
0.50.1200±0.00700.1268±0.00990.1293±0.01830.1406±0.02220.1358±0.01700.1128±0.0195
0.70.1349±0.01240.1568±0.00980.1496±0.02800.1554±0.01720.1637±0.02760.1410±0.0117
Computers0.10.1027±0.02590.0983±0.00260.1035±0.00430.1106±0.00710.1120±0.01070.0828±0.0091
0.30.1305±0.00240.1134±0.00420.1142±0.00520.1344±0.00620.1351±0.00520.1007±0.0075
0.50.1304±0.00260.1505±0.00530.1286±0.01180.1529±0.00330.1582±0.00370.1154±0.0094
0.70.1358±0.00310.1853±0.00720.1475±0.01060.1990±0.00580.1955±0.01170.1382±0.0088
Entertainment0.10.1194±0.03580.1730±0.00220.1142±0.00850.1272±0.00390.1268±0.00450.1007±0.0039
0.30.1517±0.00340.1289±0.00160.1236±0.00840.1492±0.00800.1521±0.00460.1164±0.0077
0.50.1562±0.00220.1339±0.00360.1385±0.01090.1738±0.00720.1744±0.00720.1352±0.0101
0.70.1595±0.00280.1553±0.00380.1569±0.00490.2168±0.01220.2231±0.00660.1618±0.0072
Health0.10.0647±0.02040.0662±0.00300.0622±0.00330.0743±0.00370.0766±0.00390.0564±0.0051
0.30.0865±0.00090.0808±0.00420.0672±0.00250.0905±0.00340.0936±0.00340.0668±0.0023
0.50.0857±0.00200.0981±0.00450.0793±0.00280.1153±0.00620.1124±0.00490.0752±0.0047
0.70.0841±0.00090.1353±0.00340.0993±0.00310.1525±0.00730.1475±0.00820.0922±0.0050
Image0.10.2085±0.02780.2279±0.03530.1992±0.01830.1874±0.01430.1976±0.02790.1897±0.0252
0.30.2294±0.00730.2483±0.03570.2197±0.02050.1989±0.02490.2038±0.02600.1996±0.0415
0.50.2354±0.01080.2310±0.03600.2431±0.03290.2128±0.02600.2086±0.01720.2089±0.0299
0.70.2644±0.01320.2953±0.02660.2765±0.03730.2228±0.04290.2264±0.02480.2264±0.0297
Medical0.10.0239±0.00740.0202±0.00620.1688±0.01460.0227±0.00560.0247±0.00370.0148±0.0050
0.30.0254±0.00220.0282±0.00680.1494±0.19340.0340±0.01160.0349±0.00530.0303±0.0099
0.50.0260±0.00350.0489±0.00910.1914±0.20940.0463±0.01000.0488±0.00590.0387±0.0167
0.70.0375±0.00570.0836±0.00990.2065±0.13340.0818±0.00670.0834±0.01030.0602±0.0190
Recreation0.10.1513±0.03810.1492±0.00400.1518±0.00490.1604±0.00560.1591±0.00580.1347±0.0082
0.30.1890±0.00180.1584±0.00390.1633±0.00580.1900±0.00550.1866±0.00560.1520±0.0082
0.50.1907±0.00520.1788±0.00510.1762±0.00560.2192±0.00800.2128±0.00550.1790±0.0099
0.70.2038±0.00480.1976±0.00340.2005±0.00560.2504±0.01010.2544±0.00510.2092±0.0142
Reference0.10.0873±0.03000.0943±0.00220.0944±0.00240.1061±0.00440.1072±0.00450.0782±0.0046
0.30.1243±0.00470.1149±0.00090.1040±0.00370.1356±0.00700.1319±0.00650.1012±0.0080
0.50.1184±0.00380.1453±0.00680.1208±0.00600.1597±0.00450.1629±0.01540.1178±0.0079
0.70.1206±0.00290.1852±0.00360.1422±0.00700.2030±0.00790.2036±0.00550.1428±0.0089
Society0.10.1434±0.03710.1509±0.00610.1401±0.00690.1676±0.00620.1636±0.00580.1361±0.0084
0.30.1817±0.00090.1706±0.00660.2110±0.12140.1893±0.00430.1912±0.00430.1567±0.0051
0.50.1820±0.00250.1989±0.00380.2254±0.11970.2115±0.00760.2161±0.00500.1715±0.0112
0.70.1860±0.00350.2328±0.00610.3058±0.13820.2467±0.00300.2491±0.00440.1960±0.0131
Science0.10.1310±0.03720.1300±0.00860.1285±0.00370.1485±0.00750.1496±0.00240.1105±0.0092
0.30.1652±0.00230.1560±0.00940.1774±0.00870.1733±0.00420.1768±0.00300.1356±0.0109
0.50.1684±0.00270.1795±0.01030.2172±0.11210.2088±0.00890.2094±0.00630.1536±0.0058
0.70.1740±0.00260.2003±0.01710.2873±0.13890.2623±0.00870.2529±0.00660.1920±0.0142
Social0.10.0709±0.02590.0746±0.00560.0644±0.00580.0830±0.00430.0839±0.00490.0595±0.0074
0.30.0985±0.00100.0935±0.00530.0757±0.00590.1038±0.00520.1048±0.00500.0723±0.0071
0.50.0957±0.00160.1187±0.00360.0863±0.00580.1257±0.00190.1278±0.00350.0845±0.0070
0.70.0969±0.00350.1565±0.00440.1089±0.00620.1662±0.01160.1672±0.01080.1038±0.0088

表6

六种算法在12个数据集上的缺省实验结果(覆盖率)"

DatasetsrGLOCALLRMMLMSWLLSMLJLCLSWSMDC
Arts0.10.2168±0.00650.2117±0.00860.1990±0.00520.2292±0.00800.2281±0.00830.1965±0.0084
0.30.2129±0.01230.2207±0.00980.2105±0.00520.2512±0.00430.2177±0.01320.2194±0.0112
0.50.2115±0.00730.2405±0.00760.2314±0.01100.2926±0.00630.2406±0.00800.2434±0.0157
0.70.2123±0.00870.2445±0.00440.2643±0.01070.3259±0.00670.2715±0.00720.2783±0.0075
Birds0.10.1727±0.01860.1447±0.01640.1656±0.02050.1478±0.01200.1478±0.02250.1369±0.0248
0.30.1691±0.00630.1662±0.01660.1616±0.01650.1789±0.01570.1771±0.02130.1647±0.0187
0.50.1786±0.00540.1815±0.01610.1865±0.02850.1985±0.02100.1902±0.01250.1675±0.0280
0.70.1945±0.01470.2140±0.01700.2164±0.03540.2178±0.01940.2189±0.02960.1971±0.0172
Computers0.10.1325±0.03270.1439±0.00350.1308±0.00060.1587±0.01050.1611±0.01380.1238±0.0108
0.30.1680±0.00200.1629±0.00410.1499±0.00570.1889±0.00640.1876±0.00960.1488±0.0098
0.50.1674±0.00310.2055±0.00470.1631±0.01450.2094±0.00430.2144±0.00150.1684±0.0119
0.70.1747±0.00420.2414±0.00720.1878±0.01100.2572±0.00740.2550±0.01610.1954±0.0072
Entertainment0.10.1601±0.03680.1615±0.00380.1510±0.00820.1737±0.00630.1734±0.00730.1435±0.0052
0.30.1940±0.00240.1744±0.00520.1627±0.01090.1969±0.00830.1990±0.00440.1618±0.0118
0.50.1976±0.00200.1790±0.00650.1795±0.01290.2223±0.00880.2230±0.00910.1835±0.0112
0.70.1993±0.00270.2024±0.00450.2012±0.00410.2665±0.01460.2727±0.00580.2100±0.0090
Health0.10.1145±0.02990.1271±0.00290.1220±0.00540.1368±0.00660.1413±0.00720.1135±0.0093
0.30.1499±0.00110.1463±0.00520.1304±0.00450.1588±0.00560.1605±0.00630.1267±0.0066
0.50.1460±0.00440.1679±0.00270.1464±0.00380.1883±0.00890.1831±0.00430.1398±0.0089
0.70.1405±0.00110.2108±0.00530.1635±0.00280.2270±0.01120.2225±0.01200.1616±0.0096
Image0.10.2200±0.02560.2330±0.02550.2110±0.01430.2023±0.01330.2107±0.02490.2053±0.0206
0.30.2408±0.00540.2503±0.02300.2273±0.02990.2110±0.02910.2170±0.03350.2133±0.0365
0.50.2450±0.00800.2370±0.02400.2460±0.02480.2233±0.02730.2203±0.01480.2200±0.0261
0.70.2714±0.01180.2857±0.01380.2703±0.03330.2307±0.03110.2333±0.02200.2320±0.0265
Medical0.10.0347±0.00780.0325±0.00870.1660±0.02440.0363±0.00790.0391±0.00660.0251±0.0064
0.30.0378±0.00270.0428±0.01010.1698±0.20550.0519±0.01540.0519±0.00800.0440±0.0147
0.50.0388±0.00430.0703±0.01110.2120±0.22280.0637±0.01210.0653±0.00880.0556±0.0201
0.70.0526±0.00710.1041±0.01310.2271±0.13670.1015±0.01150.1047±0.01200.0813±0.0238
Recreation0.10.1927±0.04190.1989±0.00620.1925±0.00710.2126±0.00650.2113±0.00970.1845±0.0086
0.30.2346±0.00130.2086±0.00730.2059±0.00510.2424±0.00840.2393±0.00910.2028±0.0087
0.50.2361±0.00540.2288±0.00510.2186±0.00490.2725±0.00670.2655±0.01050.2317±0.0110
0.70.2481±0.00370.2463±0.00430.2454±0.00870.3021±0.01050.3075±0.00600.2632±0.0139
Reference0.10.0997±0.03460.1186±0.00450.1122±0.00410.1316±0.00400.1331±0.00600.1015±0.0059
0.30.1436±0.00650.1397±0.00200.1219±0.00610.1611±0.00900.1578±0.00640.1258±0.0078
0.50.1510±0.01850.1708±0.00710.1399±0.00820.1853±0.00480.1884±0.01700.1426±0.0099
0.70.1682±0.03860.2108±0.00370.1634±0.00980.2273±0.00680.2277±0.00640.1680±0.0094
Society0.10.2227±0.04280.2381±0.00830.2180±0.01030.2556±0.00890.2516±0.00450.2204±0.0121
0.30.2673±0.00160.2590±0.00960.2864±0.11000.2802±0.00380.2824±0.00560.2437±0.0078
0.50.2673±0.00340.2899±0.00730.3022±0.10770.3029±0.00980.3059±0.00530.2599±0.0163
0.70.2680±0.00300.3207±0.00830.3861±0.12580.3352±0.00450.3375±0.00550.2851±0.0160
Science0.10.1754±0.03850.1798±0.01120.1711±0.00470.2000±0.00430.2015±0.00550.1557±0.0136
0.30.2114±0.00150.2098±0.01310.2322±0.00750.2303±0.00750.2317±0.00660.1872±0.0118
0.50.2129±0.00240.2344±0.01450.2665±0.11470.2644±0.00980.2672±0.00680.2080±0.0062
0.70.2174±0.00240.2576±0.02220.3393±0.14420.3197±0.01080.3126±0.00780.2497±0.0137
Social0.10.0980±0.03350.1081±0.00750.0905±0.00670.1189±0.00650.1186±0.00580.0886±0.0108
0.30.1341±0.00140.1297±0.00720.1056±0.00880.1396±0.00470.1415±0.00600.1039±0.0082
0.50.1305±0.00250.1566±0.00560.1197±0.00710.1636±0.00140.1653±0.00450.1185±0.0103
0.70.1303±0.00370.1958±0.00540.1464±0.00760.2061±0.01220.2051±0.01080.1411±0.0105

图2

Birds数据集上的参数敏感性分析"

图3

WSMDC和WSC算法在缺省率为0.1的12个多标签数据集上的消融性分析的结果"

表7

每个评价指标的弗里德曼检验值FF和临界值"

评价指标FF临界值
HL49.11692.252
AP33.6400
OE28.9474
RL29.0327
CV32.0061

图4

WSMDC算法与其他算法的五个评价指标的Nemenyi检验结果"

1 Wei W, Wu Q, Chen D,et al. Automatic image annotation based on an improved nearest neighbor technique with tag semantic extension model. Procedia Computer Science2021(183):616-623.
2 Qian T, Li F, Zhang M S,et al. Contrastive learning from label distribution:A case study on text classification. Neurocomputing2022(507):208-220.
3 Xia W Q, Zheng L Y, Fang J B,et al. PFmulDL:A novel strategy enabling multi?class and multi?label protein function annotation by integrating diverse deep learning methods. Computers in Biology and Medicine2022(145):105465.
4 Liu S H, Wang B, Liu B,et al. Multicommunity graph convolution networks with decision fusion for personalized recommendation∥Proceedings of the 26th Pacific?Asia Conference on Knowledge Discovery and Data Mining. Springer Berlin Heidelberg,2022:16-28.
5 Yu H F, Jain P, Kar P,et al. Large?scale multi?label learning with missing labels∥Proceedings of the 31st International Conference on International Conference on Machine Learning. Beijing,China:JMLR,2014:I?593-I?601.
6 Sun Y Y, Zhang Y, Zhou Z H. Multi?label learning with weak label∥Proceedings of the 24th AAAI Conference on Artificial Intelligence. Atlanta,GA,USA:AAAI,2010:593-598.
7 Jiang L, Yu G X, Guo M Z,et al. Feature selection with missing labels based on label compression and local feature correlation. Neurocomputing2020(395):95-106.
8 Boutell M R, Luo J B, Shen X P,et al. Learning multi?label scene classification. Pattern Recognition200437(9):1757-1771.
9 Zhang M L, Zhou Z H. ML?KNN:A lazy learning approach to multi?label learning. Pattern Recognition200740(7):2038-2048.
10 Zhang M L, Wu L. Lift:Multi?label learning with label?specific features. IEEE Transactions on pattern Analysis and Machine Intelligence201537(1):107-120.
11 Huang J, Li G R, Huang Q M,et al. Learning label specific features for multi?label classification∥2015 IEEE International Conference on Data Mining. Atlantic City,NJ,USA:IEEE,2015:181-190.
12 Zhang J, Li C D, Cao D L,et al. Multi?label learning with label?specific features by resolving label correlations. Knowledge?Based Systems2018(159):148-157.
13 Jia X Y, Zhu S S, Li W W. Joint label?specific features and correlation information for multi?label learning. Journal of Computer Science and Technology202035(2):247-258.
14 Wang Y B, Zheng W J, Cheng Y S,et al. Joint label completion and label?specific features for multi?label learning algorithm. Soft Computing202024(9):6553-6569.
15 Huang J, Qin F, Zheng X,et al. Improving multi?label classification with missing labels by learning label?specific features. Information Sciences2019(492):124-146.
16 Zhao D W, Li H, Lu Y X,et al. Multi?label weak?label learning via semantic reconstruction and label correlations. Information Sciences2023(623):379-401.
17 Cheng Y S, Zhao D W, Zhan W F,et al. Multi?label learning of non?equilibrium labels completion with mean shift. Neurocomputing2018(321):92-102.
18 Zhu Y, Kwok J T, Zhou Z H. Multi?label learning with global and local label correlation. IEEE Transactions on Knowledge and Data Engineering201830(6):1081-1094.
19 Wang Y B, Zheng W J, Cheng Y S,et al. Two?level label recovery?based label embedding for multi?label classification with missing labels. Applied Soft Computing2021(99):106868.
20 Kumar S, Rastogi R. Low rank label subspace transformation for multi?label learning with missing labels. Information Sciences2022(596):53-72.
21 Zhang J, Li S Z, Jiang M,et al. Learning from weakly labeled data based on manifold regularized sparse model. IEEE Transactions on Cybernetics202252(5):3841-3854.
22 Zhang Y, Zhou Z H. Multilabel dimensionality reduction via dependence maximization. ACM Transactions on Knowledge Discovery from Data20104(3):14.
23 Rodriguez A, Laio A. Clustering by fast search and find of density peaks. Science2014344(6191):1492-1496.
24 Qian K, Min X Y, Cheng Y S,et al. Weight matrix sharing for multi?label learning. Pattern Recognition2023(136):109156.
25 Beck A, Teboulle M. A fast iterative shrinkage?thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences20092(1):183-202.
26 Lin Z C, Ganesh A, Wright J,et al. Fast convex optimization algorithms for exact recovery of a corrupted low?rank matrix. Technical Report. Urbana:Coordinated Science Laboratory,2009:2214.
27 王一宾,郑伟杰,程玉胜,等. 基于PLSA学习概率分布语义信息的多标签分类算法. 南京大学学报(自然科学)202157(1):75-89.
Wang Y B, Zheng W J, Cheng Y S,et al. Multi?label classification algorithm based on PLSA learning probability distribution semantic information. Journal of Nanjing University (Natural Science)202157(1):75-89.
28 程玉胜,徐玉婷,王一宾,等. 基于共享子空间的潜在语义学习. 南京大学学报(自然科学)202258(5):816-826.
Cheng Y S, Xu Y T, Wang Y B,et al. Latent semantic learning based on shared subspace. Journal of Nanjing University (Natural Science)202258(5):816-826.
[1] 田小瑜, 秦永彬, 黄瑞章, 陈艳平. 基于相关性约束矩阵分解的多标签分类方法[J]. 南京大学学报(自然科学版), 2023, 59(1): 76-84.
[2] 吕佳, 刘强, 李帅军. 结合密度峰值和改进自然邻居的自训练算法[J]. 南京大学学报(自然科学版), 2022, 58(5): 805-815.
[3] 吕佳, 郭铭. 基于密度峰值聚类和局部稀疏度的过采样算法[J]. 南京大学学报(自然科学版), 2022, 58(3): 483-494.
[4] 王一宾, 郑伟杰, 程玉胜, 曹天成. 基于PLSA学习概率分布语义信息的多标签分类算法[J]. 南京大学学报(自然科学版), 2021, 57(1): 75-89.
[5] 胡学钢*,许尧,李培培,张玉红. 一种过滤式多标签特征选择算法[J]. 南京大学学报(自然科学版), 2015, 51(4): 723-730.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!