Implementing machine learning techniques for continuous emotion prediction from uniformly segmented voice recordings.

Researchers

Hannes Diemerling Leonie Stresemann Timo von Oertzen Tina Braun

Journal

Modalities

Models

Convolutional Neural Networks (CNN)Deep Neural Networks (DNN)Hybrid Model (C-DNN)

Abstract

Emotional recognition from audio recordings is a rapidly advancing field, with significant implications for artificial intelligence and human-computer interaction. This study introduces a novel method for detecting emotions from short, 1.5 s audio samples, aiming to improve accuracy and efficiency in emotion recognition technologies.We utilized 1,510 unique audio samples from two databases in German and English to train our models. We extracted various features for emotion prediction, employing Deep Neural Networks (DNN) for general feature analysis, Convolutional Neural Networks (CNN) for spectrogram analysis, and a hybrid model combining both approaches (C-DNN). The study addressed challenges associated with dataset heterogeneity, language differences, and the complexities of audio sample trimming.Our models demonstrated accuracy significantly surpassing random guessing, aligning closely with human evaluative benchmarks. This indicates the effectiveness of our approach in recognizing emotional states from brief audio clips.Despite the challenges of integrating diverse datasets and managing short audio samples, our findings suggest considerable potential for this methodology in real-time emotion detection from continuous speech. This could contribute to improving the emotional intelligence of AI and its applications in various areas.Copyright © 2024 Diemerling, Stresemann, Braun and von Oertzen.

Show Full Text

Implementing machine learning techniques for continuous emotion prediction from uniformly segmented voice recordings.

Researchers

Journal

Modalities

Models

Abstract

Establishment and Reliability Evaluation of Prognostic Models in Diabetic Foot.

Artificial Intelligence and Deep Learning for Advancing PET Image Reconstruction: State-of-the-Art and Future Directions.

Artificial Intelligence-Enabled Software Prototype to Inform Opioid Pharmacovigilance From Electronic Health Records: Development and Usability Study.

A novel deep learning method to segment parathyroid glands on intraoperative videos of thyroid surgery.

Radiomics Analysis Based on Automatic Image Segmentation of DCE-MRI for Predicting Triple-Negative and Nontriple-Negative Breast Cancer.

Automated diagnosis of myositis from muscle ultrasound: Exploring the use of machine learning and deep learning methods.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply