跨语言语料库的语音情感识别对比研究

doi:10.13232/j.cnki.jnju.2019.05.008

[1]

宋鹏，郑文明，赵力 .

基于特征迁移学习方法的跨库语音情感识别

清华大学学报(自然科学版)，2016，56(11)：1179-1183.

[本文引用: 1]

Song

P

， Zheng

W M

， Zhao

L

.

Cross⁃corpus speech emotion recognition based on a feature transfer learning method

Journal of Tsinghua University (Natural Science Edition)，2016，56(11)：1179-1183.

[本文引用: 1]

[2]

Shah

M

， Chakrabarti

C

， Spanias

A

.

Within and cross⁃corpus speech emotion recognition using latent topic model⁃based features

EURASIP Journal on Audio，Speech，and Music Processing ，2015，2015(1)：4.

[本文引用: 1]

[3]

Schuller

B

， Vlasenko

B

， Eyben

F

，et al .

Cross⁃corpus acoustic emotion recognition：variances and strategies

IEEE Transactions on Affective Computing，2010，1(2)：119-131.

[本文引用: 1]

[4]

Schuller

B

， Zhang

Z X

， Weninger

F

，et al .

Using multiple databases for training in emotion recognition：to unite or to vote?∥Proceedings of the 12^th Annual Conference of the International Speech Communication Association

Florence，Italy，2011：1553-1556.

[本文引用: 1]

[5]

Abdelwahab

M

， Busso

C

.

Supervised domain adaptation for emotion recognition from speech∥2015 IEEE International Conference on Acoustics，Speech and Signal Processing

Brisbane，Australia：IEEE，2015：5058-5062.

[本文引用: 1]

[6]

Mao

Q R

， Xue

W T

， Rao

Q R

，et al .

Domain adaptation for speech emotion recognition by sharing priors between related source and target classes∥2016 IEEE International Conference on Acoustics，Speech and Signal Processing

Shanghai，China：IEEE，2016：2608-2612.

[本文引用: 1]

[7]

李爱军，邵鹏飞，党建武 .

情感表达的跨文化多模态感知研究

清华大学学报(自然科学版)，2009，49(S1)：1393-1401.

[本文引用: 1]

Li

A J

， Shao

P F

， Dang

J W

.

Intercultural multimodal perception of emotional expressions

Journal of Tsinghua University (Natural Science Edition)，2009，49(S1)：1393-1401.

[本文引用: 1]

[8]

Scherer

K R

， Banse

R

， Wallbott

H G

.

Emotion inferences from vocal expression correlate across languages and cultures

Journal of Cross⁃Cultural Psychology，2001，32(1)：76-92.

[本文引用: 2]

[9]

Pell

M D

， Paulmann

S

， Dara

C

，et al .

Factors in the recognition of vocally expressed emotions：a comparison of four languages

Journal of Phonetics，2009，37(4)：417-435.

[本文引用: 1]

[10]

Paulmann

S

， Uskul

A K

.

Cross⁃cultural emotional prosody recognition：evidence from Chinese and British listeners

Cognition and Emotion，2014，28(2)：230-244.

[本文引用: 1]

[11]

Koeda

M

， Belin

P

， Hama

T

，et al .

Cross⁃cultural differences in the processing of non⁃verbal affec⁃tive vocalizations by Japanese and Canadian listeners

Frontiers in Psychology，2013，4：105.

[本文引用: 2]

[12]

Sauter

D A

， Eisner

F

， Ekman

P

，et al .

Cross⁃cultural recognition of basic emotions through nonverbal emotional vocalizations

Proceedings of the National Academy of Sciences of the United States of America，2010，107(6)：2408-2412.

[本文引用: 1]

[13]

Lanjewar

R B

， Mathurkar

S

， Patel

N

.

Implementation and comparison of speech emotion recognition system using Gaussian Mixture Model (GMM) and K⁃Nearest Neighbor (K⁃NN) techni⁃ques

Procedia Computer Science，2015，49：50-57.

[本文引用: 1]

[14]

孙红进

.

基于GMM的语音情感信息识别

信息技术，2008(12)：138-140.

[本文引用: 1]

Sun

H J

.

Emotion recognition of speech based on GMM

Information Technology，2008(12)：138-140.

[本文引用: 1]

[15]

Chen

Y L

， Zhang

Z

.

Research on text sentiment analysis based on CNNs and SVM∥2018 13^th IEEE Conference on Industrial Electronics and Applications (ICIEA)

Wuhan，China：IEEE，2018：2731-2734.

[本文引用: 1]

[16]

任浩，叶亮，李月等 .

基于多级SVM分类的语音情感识别算法

计算机应用研究，2017，34(6)：1682-1684.

[本文引用: 1]

Ren

H

， Ye

L

， Li

Y

,et al .

Speech emotion recognition algorithm based on multi⁃layer SVM classification

Application Research of Computers，2017，34(6)：1682-1684.

[本文引用: 1]

[17]

Zhao

J F

， Xia

M

， Chen

L J

.

Learning deep features to recognise speech emotion using merged deep CNN

IET Signal Processing，2018，12(6)：713-721.

[本文引用: 2]

[18]

薄洪健，马琳，孔祥浩等 .

基于卷积神经网络学习的语音情感特征降维方法研究

高技术通讯，2017，27(11-12)：889-898.

[本文引用: 1]

Bo

H J

， Ma

L

， Kong

X H

,et al .

Research on a dimension reduction method of speech emotional feature based on convolution neural network

Chinese High Technology Letters，2017，27(11-12)：889-898.

[本文引用: 1]

[19]

Chao

L L

， Tao

J H

， Yang

M H

，et al .

Long short term memory recurrent neural network based encoding method for emotion recognition in video∥IEEE International Conference on Acoustics，Speech and Signal Processing

Shanghai，China：IEEE，2016：2752-2756.

[本文引用: 1]

[20]

刘畅，张一珂，张鹏远等 .

基于改进主题分布特征的神经网络语言模型

电子与信息学报，2018，40(1)：219-225.

[本文引用: 1]

Liu

C

， Zhang

Y K

， Zhang

P Y

,et al .

Neural network language modeling using an improved topic distribution feature

Journal of Electronics and Information Technology，2018，40(1)：219-225.

[本文引用: 1]

[21]

Eyben

F

， Wöllmer

M

， Schuller

B

.

Opensmile：The munich versatile and fast open⁃source audio feature extractor∥Proceedings of the 18^th ACM International Conference on Multimedia

.Firenze，Italy：ACM，2010：1459-1462.

[本文引用: 1]

[22]

Milton

A

， Roy

S S

， Selvi

S T

.

SVM scheme for speech emotion recognition using MFCC feature

International Journal of Computer Applications，2013，69(9)：34-39.

[本文引用: 1]

[23]

Wollmer

M

， Schuller

B

， Eyben

F

，et al .

Combining long short⁃term memory and dynamic Bayesian networks for incremental emotion⁃sensitive artificial listening

IEEE Journal of Selected Topics in Signal Processing，2010，4(5)：867-881.

[本文引用: 1]

[24]

Busso

C

， Bulut

M

， Lee

C C

，et al .

IEMOCAP：interactive emotional dyadic motion capture database

Language Resources and Evaluation，2008，42(4)：335-359.

[本文引用: 1]

[25]

Pan

S F

， Tao

J H

， Li

Y

.

The CASIA audio emotion recognition method for audio/visual emotion challenge 2011∥Proceedings of the 4^th International Conference on Affective Computing and Intelligent Interaction

.Memphis，TN,USA：ACM，2011：388-395.

[本文引用: 1]

[26]

Burkhardt

F

， Paeschke

A

， Rolfes

M

，et al .

A database of German emotional speech∥Proceedings of Interspeech 2005

Lisbon，Portugal，2005：1517-1520.

[本文引用: 1]

[27]

Juth

P

， Lundqvist

D

， Karlsson

A

，et al .

Looking for foes and friends：perceptual and emotional factors when finding a face in the crowd

Emotion，2005，5(4)：379-395.

[本文引用: 1]

[28]

Shimamura

A P

， Ross

J G

， Bennett

H D

.

Memory for facial expressions：the power of a smile

Psychonomic Bulletin & Review，2006，13(2)：217-222.

[本文引用: 1]

[29]

Scherer

K R

.

The role of culture in emotion⁃antecedent appraisal

Journal of Personality & Social Psychology，1997，73(5)：902-922.

[本文引用: 1]

基于特征迁移学习方法的跨库语音情感识别

1

2016