Open Access Repository

Combination of CNN and RNN for speech emotion recognition

Vo, TH, Yeom, S ORCID: 0000-0002-5843-101X, Na, IS, Oh, A, Lee, GS, Yang, HJ and Kim, SH 2019 , 'Combination of CNN and RNN for speech emotion recognition', in SMA2019 - The International Conference on Smart Media & Applications , Smart Media - Korean Institute of Smart Media, South Korea, 163 -166 .

Full text not available from this repository.

Abstract

Speech Emotion Recognition has several potential applications in health care and human-computer interaction. In recent years, some studies use deep learning to deal with this task, but most of them use shallow neural networks due to the overfitting problem. We proposed an architecture that combines CNN and RNN to learn not only to capture the spatial but also the time-series information. The experiments on VGG-16 and ResNet-101 like networks gave a compatible result with some recent studies.

Item Type: Conference Publication
Authors/Creators:Vo, TH and Yeom, S and Na, IS and Oh, A and Lee, GS and Yang, HJ and Kim, SH
Keywords: emotion, speech, human computer interaction, RNN, CNN, Deep learning
Journal or Publication Title: SMA2019 - The International Conference on Smart Media & Applications
Publisher: Smart Media - Korean Institute of Smart Media
ISSN: 2287-4348
Copyright Information:

Copyright unknown

Related URLs:
Item Statistics: View statistics for this item

Actions (login required)

Item Control Page Item Control Page
TOP