A multilabel feature selection algorithm by exploiting label correlations locally
Cai Yaping,Yang Ming*
Author information+
{{custom_zuoZheDiZhi}}
About authors:
College of Computer Science,Nanjing Normal University,Nanjing,210023,China
{{custom_authorNodes}}
{{custom_bio.content}}
{{custom_bio.content}}
{{custom_authorNodes}}
Collapse
History+
Published
2016-07-24
Issue Date
2016-07-24
Abstract
Different from traditional supervised learning framework in which each object is assigned to only one concept of label,the condition which one object may be associated with multiple labels simultaneously in multilabel learning is able to analyze the problems in the real world more effectively.In recent years,multilabel learning has been attracting a great deal of attention in machine learning.In multilabel learning,each object may be associated with multiple labels simultaneously,and these labels are related to each other.So how to effectively discover and exploit correlations among labels is the core research issue of multilabel learning.A series of multilabel learning algorithms by exploiting label correlations have been proposed and applied successfully in many application areas.However,there are a lot of redundant features and irrelevant features existing in high dimensional data which reduce the performance of classifiers,and few multilabel feature selection algorithms consider the label correlations.Meanwhile,most of the existed algorithms for multilabel feature selection exploit the global label correlations,which assuming that the label correlations are shared by all the instances.However,in realworld tasks,different instances may share different label correlations.With this respect,we focuse on how to exploit the label correlations locally to improve multilabel feature selection and help multilabel classification.In this paper,a novel multilabel feature selection algorithm by exploiting the label correlation locally(LocMLFS)is introduced.The algorithm takes advantages of local label correlations(the correlations are not shared by all instances)in multilabel feature selection algorithm.To achieve the use of local label correlations,LocMLFS divides the samples into groups by category clustering and use multilabel feature selection to each group.At the same time,the algorithm can be extended to a unified framework.Experimental results on the datasets demonstrate that LocMLFS achieves superior performance.
Cai Yaping,Yang Ming* .
A multilabel feature selection algorithm by exploiting label correlations locally[J]. Journal of Nanjing University(Natural Sciences), 2016, 52(4): 693
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
References
[1] Tsoumakas G,Katakis I.Multilabel classification:An overview.International Journal of Data Warehousing and Mining,2007,3(3):1-13. [2] Tsoumakas G,Katakis,Vlahavas I.Mining multilabel data.In:Maimon O,Rokach L.Data Mining and Knowledge Discovery Handbook,Part 6.The 2nd Edition.US:Springer,2010,67-685. [3] Schapire R E,Singer Y.Boostexter:A boostingbased system for text categorization.Machine Learning,2000,39(2/3):135-168. [4] Godbole S,Sarawagi S.Discriminative methods for multilabeled classification.In:PAKDD’04:The 8th PacificAsia Conference on Knowledge Discovery and Data Mining.Berlin:Springer,2004,22-30. [5] Fürnkranz J,Hüllermeier E,Mencía E L,et al.Multilabel classification via calibrated label ranking.Machine Learning,2008,73(2):133-153. [6] Clare A,King R D.Knowledge discovery in multilabel phenotype data.In:De Raedt L,Siebes A.Lecture Notes in Computer Science 2168.Berlin:Springer,2001,42-53. [7] Elisseeff A,Weston J.A kernel method for multilabelled classification.In:Dietteroch T G,Bercker S,Ghahramani Z.Advances in Neural Information Processing Systems 14.Cambridge,MA:MIT Press,2002,681-687. [8] Barutcuoglu Z,Schapire R E,Troyanskaya O G.Hierarchical multilabel prediction of gene function.Bioinformatics,2006,22(7):830-836. [9] Boutell M R,Luo J,Shen X,et al.Learning multilabel scene classification.Pattern Recognition.2004,37(9):1757-1771. [10] Qi G J,Hua X S,Rui Y,et al.Correlative multilabel video annotation.In:Proceedings of the 15th ACM International Conference on Multimedia.New York,NY:ACM Press,2007,17-26. [11] Snoek C G M,Worring M,Gemert J C V,et al.The challenge problem for automated detection of 101 semantic concepts in multimedia.In:Proceedings of the 14th ACM International Conference on Multimedia(ACM Multimedia’06).Santa Barbara,CA:ACM,2006,421-430. [12] Tang L,Rajan S,Narayanan V K.Large scale multilabel classification via metalabeler.In:Proceedings of the 19th International Conference on World Wide Web(WWW’09).Madrid,Spain:ACM,2009,211-220. [13] Yang B,Sun J T,Wang T,et al.Effective multilabel active learning for text classification.In:Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD’09).Paris,France:ACM,2009,917-926. [14] 黄圣君.多标记学习中标记关系利用的研究.博士学位论文.南京:南京大学,2014.(Huang S J.Research on the exploiting of the label relationship in multi label learning.Ph.D Dissertation.Nanjing:Nanjing University,2014.) [15] Andre C P L F,Freitas A A.A tutorial on multilabel classification techniques.In:Function Approximation and Classification.Berlin:Springer,2009,5:177-195. [16] Zhang M L,Peña J M,Robles V.Feature selection for multilabel naive Bayes classification.Information Sciences,2009,179(19):3218-3229. [17] Spolaor N,Cherman E,Monard M,et al.A comparison of multilabel feature selection methods using the problem transformation approach.Electronic Notes in Theoretical Computer Science,2013,292:135-151. [18] Kong D,Ding C H Q,Huang H,et al.Multilabel reliefF and Fstatistic feature selections for image annotation.In:2013 IEEE Conference on Computer Vision and Paffern Recognition,2012,2352-2359. [19] Pupo O,Morell C,Soto S.ReliefFML:An extension of reliefF algorithm to multilabel learning.In:Proceedings of 18th Iberoamerican Congress.Havana,Cuba:Springer,2013,528-535. [20] Cai Y P,Yang M,Gao Y,et al.ReliefFbased multilabel feature selection.International Journal of Database Theory and Application,2015,8(4):307-318. [21] Huang S J,Zhou Z H.Multilabel learning by exploiting label correlations locally.In:Proceedings of the 26th AAAI Conference on Artificial Intelligence.Toronto,Canada:Machine Learning,2012,949-955. [22] Zhang M L,Zhou Z H.A review on multilabel learning algorithms.IEEE Transactions on Knowledge and Data Engineering,2014,26(8):1819-1836. [23] Zhang M L,Zhou Z H.MLKNN:A lazy learning approach to multilabel learning.Pattern Recognition,2007,40(7):2038-2048.