南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (6): 10001009.doi: 10.13232/j.cnki.jnju.2019.06.012
Yang Xu,Wenxuan Zhou,Huibin Ruan,Yu Sun,Yu Hong()
摘要:
篇章关系识别研究旨在理解篇章内部论述单元(简称“论元”,包括短语、句子及文本片段)之间的语义连接关系.现有研究通过交互式注意力机制方法,提升论元之间的信息的交互性,从而提升模型的分类能力.尽管如此,仅通过提升论元间的信息交互不能表述论元对的整体语义概念,原因在于现有方法往往将论元对视作独立的个体,忽略上下文信息对其语义上的影响.针对以上问题,提出一种基于层次化表示的隐式篇章关系识别方法,通过基于词的交互式注意力机制提取出较为重要的单词或短语,并通过论元的注意力机制赋予关键论元较高的权重,最终通过基于上下文的注意力机制融合论元对所在段落的信息,获得具有上下文语义信息的论元对表示.该方法进一步强化了论元之间信息交互性,同时强化了论元对与上下文信息间的交互.使用PDTB(Penn Discourse Treebank)语料进行实验,结果证明该方法的F 1值在四个大类关系(Comparison,Contingency,Expansion,Temporal)上相对基准系统提高了4.94%,5.43%,4.57%和7.42%.
中图分类号:
1 | Zhou L J , Li B Y , Gao W ,et al . Unsupervised discovery of discourse relations for eliminating intra-sentence polarity ambiguities∥Proceedings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg,PA,USA:ACL,2011:162-171. |
2 | Riaz M , Girju R . Another look at causality:discovering scenario?specific contingency relation?ships with no supervision∥2010 IEEE 4th International Conference on Semantic Computing. Pittsburgh,PA,USA:IEEE,2010:361-368. |
3 | Do Q X , Chan Y S , Roth D . Minimally supervised event causality identification∥Proceedings of the Conference on Empirical Methods in Natural Language Processing. Edinburgh,UK:ACL,2011:294-303. |
4 | Litkowski K C . Question?answering using seman?tic relation triples∥Proceedings of the 8th Text Retrieval Conference. Gaithersburg,MD,USA:NIST,1999:349. |
5 | Yoshida Y , Suzuki J , Hirao T ,et al . Dependency?based discourse parser for single?document summarization∥Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha,Qatar:ACL,2014:1834-1839. |
6 | Meyer T , Popescu?Belis A . Using sense?labeled discourse connectives for statistical machine translation∥Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra). Stroudsburg,PA,USA:ACL,2012:129-138. |
7 | Xiong D Y , Ding Y , Zhang M ,et al . Lexical chain based cohesion models for document?level statis?tical machine translation∥Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Seattle,WA,USA:ACL,2013:1563-1573. |
8 | Meyer T , Webber B . Implicitation of discourse connectives in (machine) translation∥Proceedings of the Workshop on Discourse in Machine Translation. Sofia,Bulgaria:ACL,2013:19-26. |
9 | Prasad R , Dinesh N , Lee A ,et al . The penn discourse TreeBank 2.0∥Proceedings of the 6th International Conference on Language Resources and Evaluation. Marrakech,Morocco:ELRA,2008:2961-2968. |
10 | Zhang B , Su J S , Xiong D Y ,et al . Shallow convolutional neural network for implicit discourse relation recognition∥Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon,Portugal:ACL,2015:2230-2235. |
11 | Liu Y , Li S , Zhang X ,et al . Implicit discourse relation classification via multi?task neural networks∥Proceedings of AAAI. Phoenix,AZ,USA:AAAI Press,2016:2750-2756. |
12 | Qin L H , Zhang Z S , Zhao H . A stacking gated neural architecture for implicit discourse relation classification∥Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin,Texas,USA:ACL,2016:2263-2270. |
13 | Wu C , Shi X , Chen Y ,et al . Bilingually?constrained synthetic data for implicit discourse relation recognition∥Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin,TX,USA:ACL,2016:2306-2312. |
14 | 曾义夫,蓝天,吴祖峰 等 . 基于双记忆注意力的方面级别情感分类模型. 计算机学报,2019,42:1-14. |
Zeng Y F , Lan T , Wu Z F ,et al . Bi?memory based attention model for aspect level sentiment classification. Chinese Journal of Computers,2019,42:1-14. | |
15 | 郑玉昆,李丹,范臻 等 . T?Reader:一种基于自注意力机制的多任务深度阅读理解模型. 中文信息学报,2018,32(11):128-134. |
Zheng Y K , Li D , Fan Z ,et al . T?reader:a multi?task deep reading comprehension model with self?attention mechanism. Journal of Chinese Information Processing,2018,32(11):128-134. | |
16 | Choi H , Cho K , Bengio Y . Fine?grained attention mechanism for neural machine translation. Neurocomputing,2018,284:171-176. |
17 | Ma D H , Li S J , Zhang X D ,et al . Interactive attention networks for aspect?level sentiment classification. arXiv:1709.00893,2017. |
18 | Pennington J , Socher R , Manning C D . Glove:Global vectors for word representation∥Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP). Doha,Qatar:ACL,2014:1532-1543. |
19 | Qian N . On the momentum term in gradient descent learning algorithms. Neural Networks,1999,12(1):145-151. |
20 | Ji Y F , Eisenstein J . One vector is not enough:entity?augmented distributed semantics for discourse relations. Transactions of the Association for Computational Linguistics,2015,3:329-344. |
21 | Chen J F , Zhang Q , Liu P F ,et al . Implicit discourse relation detection via a deep architecture with gated relevance network∥Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin,Germany:ACL,2016,1:1726-1735. |
22 | Liu Y , Li S J . Recognizing implicit discourse relations via repeated reading:Neural networks with multi?level attention. 2016,arXiv:1609. 06380. |
23 | Bai H X , Zhao H . Deep enhanced representation for implicit discourse relation recognition. 2018,arXiv:1807.05154. |
[1] | 朱伟,张帅,辛晓燕,李文飞,王骏,张建,王炜. 结合区域检测和注意力机制的胸片自动定位与识别[J]. 南京大学学报(自然科学版), 2020, 56(4): 591-600. |
[2] | 曹欣怡,李鹤,王蔚. 基于语料库的语音情感识别的性别差异研究[J]. 南京大学学报(自然科学版), 2019, 55(5): 758-764. |
[3] | 钱付兰, 黄鑫, 赵姝, 张燕平. 基于路径相互关注的网络嵌入算法[J]. 南京大学学报(自然科学版), 2019, 55(4): 573-580. |
[4] | 顾健伟, 曾 诚, 邹恩岑, 陈 扬, 沈 艺, 陆 悠, 奚雪峰. 基于双向注意力流和自注意力结合的机器阅读理解[J]. 南京大学学报(自然科学版), 2019, 55(1): 125-132. |
|