A New Network Structure for Speech Emotion Recognition Research.

Abstract

Deep learning promotes the breakthrough of emotion recognition in many fields, especially speech emotion recognition (SER). As an important part of speech emotion recognition, the most relevant acoustic feature extraction has always attracted the attention of existing researchers. Aiming at the problem that the emotional information contained in the current speech signals is distributed dispersedly and cannot comprehensively integrate local and global information, this paper presents a network model based on a gated recurrent unit (GRU) and multi-head attention. We evaluate our proposed emotion model on the IEMOCAP and Emo-DB corpora. The experimental results show that the network model based on Bi-GRU and multi-head attention is significantly better than the traditional network model at detecting multiple evaluation indicators. At the same time, we also apply the model to a speech sentiment analysis task. On the CH-SIMS and MOSI datasets, the model shows excellent generalization performance.

Show Full Text

A New Network Structure for Speech Emotion Recognition Research.

Researchers

Journal

Modalities

Models

Abstract

Prediction of knee adduction moment using innovative instrumented insole and deep learning neural networks in healthy female individuals.

Deep learning to distinguish Best vitelliform macular dystrophy (BVMD) from adult-onset vitelliform macular degeneration (AVMD).

How Artificial Intelligence Unravels the Complex Web of Cancer Drug Response.

A Spectral-Spatial-Dependent Global Learning Framework for Insufficient and Imbalanced Hyperspectral Image Classification.

Volumetric tumor tracking from a single cone-beam X-ray projection image enabled by deep learning.

PlantPAD: a platform for large-scale image phenomics analysis of disease in plant science.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply