Evaluating the Diagnostic Potential of Connected Speech for Benign Laryngeal Disease Using Deep Learning Analysis.

Researchers

Hee Chan Kim Jae Yeong Kim Jeong Hoon Lee Jungirl Seok Tack-Kyun Kwon

Journal

Journal of voice : official journal of the Voice Foundation

Modalities

Models

Convolutional Neural Networks (CNNs)Time series models

Abstract

This study aimed to evaluate the performance of artificial intelligence (AI) models using connected speech and vowel sounds in detecting benign laryngeal diseases.Retrospective.Voice samples from 772 patients, including 502 with normal voices and 270 with vocal cord polyps, cysts, or nodules, were analyzed. We employed deep learning architectures, including convolutional neural networks (CNNs) and time series models, to process the speech data. The primary endpoint was the area under the receiver’s operating characteristic curve for binary classification.CNN models analyzing speech segments significantly outperformed those using vowel sounds in distinguishing patients with and without benign laryngeal diseases. The best-performing CNN model achieved areas under the receiver operating characteristic curve of 0.895 and 0.845 for speech and vowel sounds, respectively. Correlations between AI-generated disease probabilities and perceptual assessments were more pronounced in the connected-speech analyses. However, the time series models performed worse than the CNNs.Connected speech analysis is more effective than traditional vowel sound analysis for the diagnosis of laryngeal voice disorders. This study highlights the potential of AI technologies in enhancing the diagnostic capabilities of speech, advocating further exploration, and validation in this field.Copyright © 2024 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

Show Full Text

Evaluating the Diagnostic Potential of Connected Speech for Benign Laryngeal Disease Using Deep Learning Analysis.

Researchers

Journal

Modalities

Models

Abstract

In Contemporary Reproductive Medicine Human Beings are Not Yet Dispensable.

PaleAle 5.0: prediction of protein relative solvent accessibility by deep learning.

Hip osteoarthritis: A novel network analysis of subchondral trabecular bone structures.

Automatic semantic segmentation of EHG recordings by deep learning: An approach to a screening tool for use in clinical practice.

Effect of Bodybuilding and Fitness Exercise on Physical Fitness Based on Deep Learning.

High through-plane resolution CT imaging with self-supervised deep learning.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply