Deep Cross-Corpus Speech Emotion Recognition: Recent Advances and Perspectives.

Abstract

Automatic speech emotion recognition (SER) is a challenging component of human-computer interaction (HCI). Existing literatures mainly focus on evaluating the SER performance by means of training and testing on a single corpus with a single language setting. However, in many practical applications, there are great differences between the training corpus and testing corpus. Due to the diversity of different speech emotional corpus or languages, most previous SER methods do not perform well when applied in real-world cross-corpus or cross-language scenarios. Inspired by the powerful feature learning ability of recently-emerged deep learning techniques, various advanced deep learning models have increasingly been adopted for cross-corpus SER. This paper aims to provide an up-to-date and comprehensive survey of cross-corpus SER, especially for various deep learning techniques associated with supervised, unsupervised and semi-supervised learning in this area. In addition, this paper also highlights different challenges and opportunities on cross-corpus SER tasks, and points out its future trends.Copyright © 2021 Zhang, Liu, Tao and Zhao.

Show Full Text

Deep Cross-Corpus Speech Emotion Recognition: Recent Advances and Perspectives.

Researchers

Journal

Modalities

Models

Abstract

A deep learning framework for segmentation and pose estimation of pedicle screw implants based on C-arm fluoroscopy.

RadioBERT: A deep learning-based system for medical report generation from chest x-ray images using contextual embeddings.

Research on Data Analysis Network of TCM Tongue Diagnosis Based on Deep Learning Technology.

Affective State Recognition with Convolutional Autoencoders.

Breast cancer histopathology image-based gene expression prediction using spatial transcriptomics data and deep learning.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply