Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM.

Researchers

Audrius Kulikajevas Kipras Pribuišis Robertas Damaševičius Rytis Maskeliūnas Virgilijus Uloza

Journal

Journal of voice : official journal of the Voice Foundation

Modalities

Models

Abstract

Loss of the larynx significantly alters natural voice production, requiring alternative communication modalities and rehabilitation methods to restore speech intelligibility and improve the quality of life of affected individuals. This paper explores advances in alaryngeal speech enhancement to improve signal quality and reduce background noise, focusing on individuals who have undergone laryngectomy. In this study, speech samples were obtained from 23 Lithuanian males who had undergone laryngectomy with secondary implantation of the tracheoesophageal prosthesis (TEP). Pareto-optimized gated long short-term memory was trained on tracheoesophageal speech data to recognize complex temporal connections and contextual information in speech signals. The system was able to distinguish between actual speech and various forms of noise and artifacts, resulting in a 25% drop in the mean signal-to-noise ratio compared to other approaches. According to acoustic analysis, the system significantly decreased the number of unvoiced frames (proportion of voiced frames) from 40% to 10% while maintaining stable proportions of voiced frames (proportion of voiced speech frames) and average voicing evidence (average voice evidence in voiced frames), indicating the accuracy of the approach in selectively attenuating noise and undesired speech artifacts while preserving important speech information.Copyright © 2024. Published by Elsevier Inc.

Show Full Text

Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM.

Researchers

Journal

Modalities

Models

Abstract

The effect of time on the automated detection of the pharyngeal phase in videofluoroscopic swallowing studies.

Performance of deep learning object detection technology in the detection and diagnosis of maxillary sinus lesions on panoramic radiographs.

Dual-modality endoscopic probe for tissue surface shape reconstruction and hyperspectral imaging enabled by deep neural networks.

Discovery and analytical validation of a vocal biomarker to monitor anosmia and ageusia in patients with Covid-19: Cross-sectional study.

AI Detection of Glottic Neoplasm Using Voice Signals, Demographics, and Structured Medical Records.

Transfer learning for anatomical structure segmentation in otorhinolaryngology microsurgery.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply