南京大学学报(自然科学版) ›› 2019, Vol. 55 ›› Issue (4): 660666.doi: 10.13232/j.cnki.jnju.2019.04.016
Wei Wang(),Tingting Hu,Yaqin Feng
摘要:
语音是情感表达的重要途径,自然状态和表演状态下的语音所蕴含的情感信息并不完全相同.为了探索自然状态和表演状态下语音情感识别的差异,采用深度学习算法分析了IEMOCAP公用数据集,对自然状态和表演状态下的中性、愤怒、开心和悲伤等四类情绪语音数据进行实验:首先提取语音数据的声学特征(对比了emobase2010特征集和eGeMAPs特征集),然后利用卷积神经网络(Convolutional Neural Networks,CNN)对自然与表演状态下的语音情感进行识别,比较了两种状态下的情感识别率,再利用混淆矩阵分析两种状态下不同情绪之间的误分率和相似性.实验结果显示,自然状态下的情感识别率明显高于表演状态下,还发现愤怒和悲伤在两种状态下的误分率有明显区别.该现象对理解情绪的表达机制有启发意义.
中图分类号:
1 |
FanX H,ZhaoH M,ChenX Q,et al. Deceptive speech detection based on sparse representation∥2016 IEEE 12th International Colloquium on Signal Processing & Its Applications. Malacca City,Malaysia:IEEE,2016,DOI:10.1109/CSPA.2016.7515793.
doi: 10.1109/CSPA.2016.7515793 |
2 | PanX Y,ZhaoH M,ZhouY. The application of fractional Mel cepstral coefficient in deceptive speech detection. PreeJ,2015,3:e1194. |
3 |
LevitanS I,AnG Z,WangM D,et al. Cross?cultural production and detection of deception from speech∥Proceedings of the 2015 ACM on Workshop on Multimodal Deception Detection. New York,NY,USA:ACM,2015,DOI:10.1145/2823465.2823468.
doi: 10.1145/2823465.2823468 |
4 | DevillersL,VidrascuL,LamelL. Challenges in real?life emotion annotation and machine learning based detection. Neural Networks,2005,18(4):407-422. |
5 | HirschbergJ,BenusS,BrenierJ M,et al. Distinguishing deceptive from non?deceptive speech∥Interspeech 2005. Lisbon,Portugal:ISCA,2005:1833-1836. |
6 |
MendelsG,LevitanS I,LeeK Z,et al. Hybrid acoustic?lexical deep learning approach for deception detection∥Proceedings of Interspeech 2017. Stockholm,Sweden:ISCA,2017,DOI:10.21437/Interspeech.2017-1723.
doi: 10.21437/Interspeech.2017-1723 |
7 |
FanC,ZhaoH M,ChenX Q,et al. Distinguishing deception from non?deception in Chinese speech∥2015 6th International Conference on Intelligent Control and Information Processing. Wuhan,China:IEEE,2016,DOI:10.1109/ICICIP.2015. 7388181.
doi: 10.1109/ICICIP.2015. 7388181 |
8 | HirschbergJ. Deceptive speech:clues from spoken language. Chen F,Jokinen K. Speech Technology.Boston:Springer,2010,79-88. |
9 | EkmanP. Telling lies:clues to deceit in the marketplace,politics,and marriage (Revised Edition)New York:W. W. Norton & Company,2009,416. |
10 | Douglas?CowieE,CampbellN,CowieR,et al. Emotional speech:towards a new generation ofdatabases. Speech Communication,2003,40(1-2):33-60. |
11 | BatlinerA,FischerK,HuberR,et al. Desperately seeking emotions or:actors,wizards and human beings∥ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. Northern Ireland,United Kingdom:Newcastle,2000:195-200. |
12 | ChenchahF,LachiriZ. A bio?inspired emotion recognition system under real?life conditions. Applied Acoustics,2017,115:6-14. |
13 | NeumannM,VuN T. Attentive convolutional neural network based speech emotion recognition:s study on the impact of input features,signal length,and acted speech. 2017,arXiv:1706. 00612. |
14 | BussoC,BulutM,LeeC C,et al. IEMOCAP:interactive emotional dyadic motion capture database. LanguageResources and Evaluation,2008,42(4):335-359. |
15 | 胡婷婷,沈凌洁,冯亚琴等. 语音与文本情感识别中愤怒与开心误判分析. 计算机技术与发展,2018,28(11):130-133. |
Hu T T,Shen L J,Feng Y Q,et al.Research on anger and happy misclassification in speech and text emotion recognition. Computer Technology and Development,2018,28(11):130-133. | |
16 | SchullerB,SteidlS,BatlinerA,et al. The INTERSPEECH 2010 paralinguistic challenge?age,gender,and affect∥Interspeech 2010. Makuhari,Japan:ISCA,2010:2794-2797. |
17 | Eybe,SchererK R,SchullerB W,et al. The geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing. IEEE Transactions on Affective Computing,2016,7(2):190-202. |
[1] | 朱伟,张帅,辛晓燕,李文飞,王骏,张建,王炜. 结合区域检测和注意力机制的胸片自动定位与识别[J]. 南京大学学报(自然科学版), 2020, 56(4): 591-600. |
[2] | 李康,谢宁,李旭,谭凯. 基于卷积神经网络和几何优化的统计染色体核型分析方法[J]. 南京大学学报(自然科学版), 2020, 56(1): 116-124. |
[3] | 韩普,刘亦卓,李晓艳. 基于深度学习和多特征融合的中文电子病历实体识别研究[J]. 南京大学学报(自然科学版), 2019, 55(6): 942-951. |
[4] | 张家精,夏巽鹏,陈金兰,倪友聪. 基于张量分解和深度学习的混合推荐算法[J]. 南京大学学报(自然科学版), 2019, 55(6): 952-959. |
[5] | 钟琪,冯亚琴,王蔚. 跨语言语料库的语音情感识别对比研究[J]. 南京大学学报(自然科学版), 2019, 55(5): 765-773. |
[6] | 张鹏,黄毅,阮雅端,陈启美*. 基于稀疏特征的交通流视频检测算法[J]. 南京大学学报(自然科学版), 2015, 51(2): 264-270. |
|