南京大学学报(自然科学版) ›› 2021, Vol. 57 ›› Issue (5): 750756.doi: 10.13232/j.cnki.jnju.2021.05.004
• • 上一篇
Bin Ni1, Xiaolei Lu2(), Yiqi Tong3, Tao Ma1, Zhixian Zeng1
摘要:
通过引入BERT (Bidirectional Encoder Representation from Transformers)词向量和胶囊神经网络架构,建立期刊文本自动分类模型.选取三个不同规模的Web of Science数据集,以期刊领域的文本分类作为研究任务.在分析文本的基础上,对论文摘要进行多种深度学习算法训练.利用向量化的胶囊神经元和动态路由机制获取文本的局部?整体关系,最终实现更加精准的文本分类模型.实验结果表明,在该数据集上,基于胶囊神经网络的文本分类器的准确率、精准率、召回率和F1值等多项指标均领先于其他基线算法,同时动态路由的迭代次数需要综合考虑模型的损失与训练速度.
中图分类号:
1 | Johnson R,Watkinson A,Mabe M. The STM report:An overview of scientific and scholarly journal publishing. The 5th Edition. Hague,Netherlands:International Association of Scientific,Technical and Medical Publishers,2018. |
2 | 李广建,杨林. 大数据视角下的情报研究与情报研究技术. 图书与情报,2012(6):1-8. |
Li G J,Yang L. Intelligence analysis and intelligence technology in view of big data. Library and Information,2012(6):1-8. | |
3 | 邓三鸿,傅余洋子,王昊. 基于LSTM模型的中文图书多标签分类研究. 数据分析与知识发现,2017(7):52-60. |
Deng S H,Fu Y Y Z,Wang H. Multi?label classification of Chinese books with LSTM model. Data Analysis and Knowledge Discovery,2017(7):52-60. | |
4 | 邓要武,王连俊. 图书自动分类专家系统的设计尝试. 图书情报工作,1997(5):43-45. |
Deng Y W,Wang L J. The design of automatic book classification expert system. Library and Information Service,1997(5):43-45. | |
5 | McCallum A,Nigam K. A comparison of event models for naive bayes text classification∥AAAI?98 Workshop on Learning for Text Categorization. Madison,WI,USA:AAAI,1998:41-48. |
6 | Joachims T. Transductive inference for text classification using support vector machines∥Proceedings of the 16th International Conference on Machine Learning. Bled,Slovenia:Morgan Kaufmann Publishers Inc.,1999:200-209. |
7 | Forman G. An extensive empirical study of feature selection metrics for text classification.The Journal of Machine Learning Research,2003(3):1289-1305. |
8 | 陈白雪,宋培彦. 基于用户自然标注的TF?IDF辅助标引算法及实证研究. 图书情报工作,2018,62(1):132-139. (Chen B X,Song P Y. Empirical research on TF?IDF assisted indexing algorithm based on |
users' natural annotation. Library and Information Service,2018,62(1):132-139. | |
9 | Mikolov T,Sutskever I,Chen K,et al. Distributed representations of words and phrases and their compositionality∥Proceedings of the 26th Inter?national Conference on Neural Information Processing Systems. Sydney,Australia:Curran Associates Inc.,2013:3111-3119. |
10 | Kim Y. Convolutional neural networks for sentence classification. 2014,arXiv:. |
11 | Zhang X,Zhao J B,LeCun Y. Character?level convolutional networks for text classification∥Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal,Canada:MIT Press,2015:649-657. |
12 | Hochreiter S,Schmidhuber J. Long short?term memory. Neural Computation,1997,9(8):1735-1780. |
13 | Dai A M,Le Q V. Semi?supervised sequence learning∥Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal,Canada:MIT Press,2015:3079-3087. |
14 | Cho K,Van Merrienboer B,Bahdanau D,et al. On the properties of neural machine translation:Encoder?decoder approaches. 2014,arXiv:. |
15 | Kowsari K,Brown D E,Heidarysafa M,et al. HDLTex:Hierarchical deep learning for text classification∥2017 16th IEEE International Conference on Machine Learning and Applications. Cancun,Mexico:IEEE,2017:364-371. |
16 | Vaswani A,Shazeer N,Parmar N,et al. Attention is all you need∥Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach,CA,USA:Curran Associates Inc.,2017:6000-6010. |
17 | Hinton G,Alex K,Sida D. Transforming auto?encoders∥International Conference on Artificial Neural Networks. Springer Berlin Heidelberg,2011:44-51. |
18 | Sabour S,Frosst N,Hinton G E. Dynamic routing between capsules∥Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach,CA,USA:Curran Associates Inc.,2017:3859-3869. |
19 | Hinton G,Sabour S,Frosst N. Matrix capsules with EM routing∥6th International Conference on Learning Representations. Vancouver,Canada:ICLR Committee,2018:1-15. |
20 | Wang Y Q,Sun A X,Han J L,et al. Sentiment analysis by capsules∥Proceedings of 2018 World Wide Web Conference. Lyon,France:International World Wide Web Conferences Steering Committee,2018:1165-1174. |
21 | Kim J,Jang S,Choi S,et al. Text classification using capsules. 2018,arXiv:. |
22 | Zhao W,Ye J B,Yang M,et al. Investigating capsule networks with dynamic routing for text classification. 2018,arXiv:. |
23 | Devlin J,Chang M W,Lee K,et al. BERT: |
Pre?training of deep bidirectional transformers for language understanding. 2019,arXiv:. | |
24 | Chen K W,Zhang Z P,Long J,et al. Turning from TF?IDF to TF?IGM for term weighting in text classification. Expert Systems with Applications,2016(66):245-260. |
[1] | 陈磊, 孙权森, 王凡海. 基于深度对抗网络和局部模糊探测的目标运动去模糊[J]. 南京大学学报(自然科学版), 2021, 57(5): 735-749. |
[2] | 贾霄, 郭顺心, 赵红. 基于图像属性的零样本分类方法综述[J]. 南京大学学报(自然科学版), 2021, 57(4): 531-543. |
[3] | 普志方, 陈秀宏. 基于卷积神经网络的细胞核图像分割算法[J]. 南京大学学报(自然科学版), 2021, 57(4): 566-574. |
[4] | 段建设, 崔超然, 宋广乐, 马乐乐, 马玉玲, 尹义龙. 基于多尺度注意力融合的知识追踪方法[J]. 南京大学学报(自然科学版), 2021, 57(4): 591-598. |
[5] | 罗金屯, 滕飞, 周亚波, 池茂儒, 张海波. 数据驱动的高速铁路轮轨作用力反演模型[J]. 南京大学学报(自然科学版), 2021, 57(2): 299-308. |
[6] | 曾宪华, 陆宇喆, 童世玥, 徐黎明. 结合马尔科夫场和格拉姆矩阵特征的写实类图像风格迁移[J]. 南京大学学报(自然科学版), 2021, 57(1): 1-9. |
[7] | 余方超, 方贤进, 张又文, 杨高明, 王丽. 增强深度学习中的差分隐私防御机制[J]. 南京大学学报(自然科学版), 2021, 57(1): 10-20. |
[8] | 张萌, 韩冰, 王哲, 尤富生, 李浩然. 基于深度主动学习的甲状腺癌病理图像分类方法[J]. 南京大学学报(自然科学版), 2021, 57(1): 21-28. |
[9] | 李一凡, 朱斐, 凌兴宏, 刘全. 具有窗口结构Bi⁃LSTM网络的心电图QRS波检测方法[J]. 南京大学学报(自然科学版), 2021, 57(1): 42-51. |
[10] | 温玉莲, 林培光. 基于行业背景差异下的金融时间序列预测方法[J]. 南京大学学报(自然科学版), 2021, 57(1): 90-100. |
[11] | 潘越,王骏,李文飞,张建,王炜. 基于卷积神经网络的蛋白质折叠类型最小特征提取[J]. 南京大学学报(自然科学版), 2020, 56(5): 744-753. |
[12] | 朱伟,张帅,辛晓燕,李文飞,王骏,张建,王炜. 结合区域检测和注意力机制的胸片自动定位与识别[J]. 南京大学学报(自然科学版), 2020, 56(4): 591-600. |
[13] | 李康,谢宁,李旭,谭凯. 基于卷积神经网络和几何优化的统计染色体核型分析方法[J]. 南京大学学报(自然科学版), 2020, 56(1): 116-124. |
[14] | 韩普,刘亦卓,李晓艳. 基于深度学习和多特征融合的中文电子病历实体识别研究[J]. 南京大学学报(自然科学版), 2019, 55(6): 942-951. |
[15] | 张家精,夏巽鹏,陈金兰,倪友聪. 基于张量分解和深度学习的混合推荐算法[J]. 南京大学学报(自然科学版), 2019, 55(6): 952-959. |
|