南京大学学报(自然科学版) ›› 2024, Vol. 60 ›› Issue (1): 118129.doi: 10.13232/j.cnki.jnju.2024.01.012
• • 上一篇
Xianyu Hou(), Yuming Chen, Keshou Wu
摘要:
粒化是一种构建粒数据与粒模型的方法.近些年来,有多种粒化方法被提出,如基于样本相似度尺度的相似度粒化、基于邻域关系的邻域粒化和基于特征尺度变换的旋转粒化等.这些粒化方法都在监督与非监督任务中获得优秀的表现.但是这些粒化方法都是基于样本本身的度量关系构建的,会导致样本在粒化过程中的信息量呈现不同量级的扩展现象.这一特征使粒化后的粒子在一些情况下难以处理.因此,提出一种基于多采样方法构建近似粒子的粒化方式以保证粒化过程被限制在有限量级,并且在粒化过程中抛弃固定的度量关系式,粒化的结果会随着选取的近似集与近似基模型的不同而变化,使得样本在粒化为粒子时有着更高的灵活性.文中对多采样近似粒化和多种粒化方法进行详细比较,结果表明多采样近似粒化有着更高的分类性能,且与多种先进的集成算法做了详细比较,结果表明在分类任务上多采样近似粒集成模型拥有着更好的鲁棒性与泛化性.
中图分类号:
1 | Morente?Molinera J A, Mezei J, Carlsson C,et al. Improving supervised learning classification methods using multigranular linguistic modeling and fuzzy entropy. IEEE Transactions on Fuzzy Systems,2017,25(5):1078-1089. |
2 | Opitz D, Maclin R. Popular ensemble methods:An empirical study. Journal of Artificial Intelligence Research,1999,11(1):169-198. |
3 | Quadrianto N, Ghahramani Z. A very simple safe?Bayesian random forest. IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(6):1297-1303. |
4 | Jiang S H, Mao H Y, Ding Z M,et al. Deep decision tree transfer boosting. IEEE Transactions on Neural Networks and Learning Systems,2020,31(2):383-395. |
5 | Zadeh L A. Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets and Systems,1997,90(2):111-127. |
6 | Bhapkar H R, Mahalle P N, Shinde G R,et al. Rough sets in COVID?19 to predict symptomatic cases∥Santosh K C,Joshi A. COVID?19:Prediction,decision?making,and its impacts. Springer Berlin Heidelberg,2021:57-68. |
7 | Chen Y M, Zhu S Z, Li W,et al. Fuzzy granular convolutional classifiers. Fuzzy Sets and Systems,2022,426:145-162. |
8 | Niu J J, Chen D G, Li J H,et al. Fuzzy rule?based classification method for incremental rule learning. IEEE Transactions on Fuzzy Systems,2022,30(9):3748-3761. |
9 | Meher S K, Pal S K. Rough?wavelet granular space and classification of multispectral remote sensing image. Applied Soft Computing,2011,11(8):5662-5673. |
10 | Borowska K, Stepaniuk J. A rough?granular approach to the imbalanced data classification problem. Applied Soft Computing,2019,83:105607. |
11 | Hu X C, Pedrycz W, Wang X M. Fuzzy classifiers with information granules in feature space and logic?based computing. Pattern Recognition,2018,80:156-167. |
12 | Yao Y Y. Three perspectives of granular computing. Journal of Nanchang Institute of Technology,2006,25(2):16-21. |
13 | 胡清华,于达仁,谢宗霞. 基于邻域粒化和粗糙逼近的数值属性约简. 软件学报,2008,19(3):640-649. |
Hu Q H, Yu D R, Xie Z X. Numerical attribute reduction based on neighborhood granulation and rough approximation. Journal of Software,2008,19(3):640-649. | |
14 | 傅兴宇,陈颖悦,陈玉明,等. 一种全连接粒神经网络分类方法. 山西大学学报(自然科学版),2023,46(1):91-100. |
Fu X Y, Chen Y Y, Chen Y M,et al. A classification method of fully connected granular neural network. Journal of Shanxi University (Natural Science Edition),2023,46(1):91-100. | |
15 | Jiang H L, Chen Y M, Kong L R,et al. An LVQ clustering algorithm based on neighborhood granules. Journal of Intelligent & Fuzzy Systems:Applications in Engineering and Technology,2022,43(5):6109-6122. |
16 | Li W, Chen Y M, Song Y P. Boosted K?nearest neighbor classifiers based on fuzzy granules. Knowledge?Based Systems,2020,195:105606. |
17 | Lin S H, Zhang K B, Guan D,et al. An intrusion detection method based on granular autoencoders. Journal of Intelligent & Fuzzy Systems:Applications in Engineering and Technology,2023,44(5):8413-8424. |
18 | 陈玉明,蔡国强,卢俊文,等. 一种邻域粒K均值聚类方法. 控制与决策,2023,38(3):857-864. |
Chen Y M, Cai G Q, Lu J W,et al. A neighborhood granular K?means clustering method. Control and Decision,2023,38(3):857-864. | |
19 | Chen Y M, Qin N, Li W,et al. Granule structures,distances and measures in neighborhood systems. Knowledge?Based Systems,2019,165:268-281. |
20 | Chen Y M, Zhu Q X, Wu K S,et al. A binary granule representation for uncertainty measures in rough set theory. Journal of Intelligent & Fuzzy Systems:Applications in Engineering and Technology,2015,28(2):867-878. |
21 | Chen J F, Zhu J, Song L. Stochastic training of graph convolutional networks with variance reduction. 2018,arXiv:. |
22 | Chiang W L, Liu X Q, Si S,et al. Cluster?GCN:An efficient algorithm for training deep and large graph convolutional networks∥Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. Anchorage,USA:ACM,2019:257-266. |
23 | Feng K X, Lu Z Z, Ling C Y,et al. Fuzzy importance sampling method for estimating failure possibility. Fuzzy Sets and Systems,2021,424:170-184. |
24 | Müller T, McWilliams B, Rousselle F,et al. Neural importance sampling. ACM Transactions on Graphics,2019,38(5):145. |
25 | Grittmann P, Georgiev I, Slusallek P,et al. Variance?aware multiple importance sampling. ACM Transactions on Graphics,2019,38(6):152. |
26 | Huang X L, Li Z H, Jin Y L,et al. Fair?AdaBoost:Extending AdaBoost method to achieve fair classification. Expert Systems with Applications,2022,202:117240. |
27 | Liu B, Liu C D, Xiao Y S,et al. AdaBoost?based transfer learning method for positive and unlabelled learning problem. Knowledge?Based Systems,2022,241:108162. |
28 | Jiang X, Xu Y, Ke W,et al. An imbalanced multifault diagnosis method based on bias weights AdaBoost. IEEE Transactions on Instrumentation and Measurement,2022,71:3505908. |
29 | Guryanov A. Histogram?based algorithm for building gradient boosting ensembles of piecewise linear decision trees∥8th International Conference on Analysis of Images,Social Networks and Texts. Springer Berlin Heidelberg,2019:39-50. |
30 | Chen T Q, Guestrin C. Xgboost:A scalable tree boosting system∥Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco,USA:ACM,2016:785-794. |
31 | Dong W, Huang Y M, Lehane B,et al. XGBoost algorithm?based prediction of concrete electrical resistivity for structural health monitoring. Automation in Construction,2020,114:103155. |
[1] | 李小川, 张超, 李德玉, 上官学奎, 马瑾男, 陆文瑞. 基于TPOP法的犹豫模糊语言稳健型多粒度群决策[J]. 南京大学学报(自然科学版), 2023, 59(1): 22-34. |
[2] | 于子淳, 吴伟志. 用证据理论刻画协调的具有多尺度决策的信息系统的最优尺度选择[J]. 南京大学学报(自然科学版), 2022, 58(1): 71-81. |
[3] | 郑嘉文, 吴伟志, 包菡, 谭安辉. 基于熵的多尺度决策系统的最优尺度选择[J]. 南京大学学报(自然科学版), 2021, 57(1): 130-140. |
[4] | 郭英杰, 胡峰, 于洪, 张红亮. 基于时间粒的铝电解过热度预测模型[J]. 南京大学学报(自然科学版), 2019, 55(4): 624-632. |
[5] | 方 宇1,闵 帆1*,刘忠慧1,杨 新2. 序贯三支决策的代价敏感分类方法[J]. 南京大学学报(自然科学版), 2018, 54(1): 148-. |
[6] | 顾沈明, 张 昊, 吴伟志, 谭安辉, . 多标记序决策系统中基于局部最优粒度的规则获取[J]. 南京大学学报(自然科学版), 2017, 53(6): 1012-. |
[7] | 杨 洁1,2,王国胤1*,庞紫玲1. 密度峰值聚类相关问题的研究[J]. 南京大学学报(自然科学版), 2017, 53(4): 791-. |
[8] | 李同军1,2*,吴伟志1,2,顾沈明1,2. 双论域上基于Brouwer-正交补的粗糙近似[J]. 南京大学学报(自然科学版), 2016, 52(5): 871-. |
[9] | 顾沈明1,2*,万雅虹1,2,吴伟志1,2,徐优红1,2. 多粒度决策系统的局部最优粒度选择[J]. 南京大学学报(自然科学版), 2016, 52(2): 280-. |
[10] | 马对霞*,祝 峰,林姿琼. 基于等价关系的粗糙拟阵的性质[J]. 南京大学学报(自然科学版), 2016, 52(2): 300-. |
[11] | 戴志聪,吴伟志*. 不完备多粒度序信息系统的粗糙近似 [J]. 南京大学学报(自然科学版), 2015, 51(2): 361-367. |
[12] | 谢 珺*,秦 琴,续欣莹. 全覆盖粒计算模型的粒化、知识逼近及其算子性质研究[J]. 南京大学学报(自然科学版), 2015, 51(1): 105-110. |
[13] | 徐健锋1,2张远健1Duanning Zhou2,Dan Li 2,李宇1,. 基于粒计算的不确定性时间序列建模及其聚类[J]. 南京大学学报(自然科学版), 2014, 50(1): 86-. |
[14] | 顾沈明**,叶晓敏,吴伟志 . 多标记粒度不完备信息系统的粗糙近似*[J]. 南京大学学报(自然科学版), 2013, 49(2): 250-257. |
[15] | 据春华,帅朝谦**,封毅 . 基于粒计算的商业数据流概念漂移特征选择*[J]. 南京大学学报(自然科学版), 2011, 47(4): 391-397. |
|