A Deep Learning Based Approach to Synthesize Intelligible Speech with Limited Temporal Envelope Information.

Researchers

Ching-Ju Hsiao Fei Chen Ji-Yan Han Wei-Zhong Zheng Ying-Hui Lai

Journal

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Modalities

Models

deep learning

Abstract

Envelope waveforms can be extracted from multiple frequency bands of a speech signal, and envelope waveforms carry important intelligibility information for human speech communication. This study aimed to investigate whether a deep learning-based model with features of temporal envelope information could synthesize an intelligible speech, and to study the effect of reducing the number (from 8 to 2 in this work) of temporal envelope information on the intelligibility of the synthesized speech. The objective evaluation metric of short-time objective intelligibility (STOI) showed that, on average, the synthesized speech of the proposed approach provided higher STOI (i.e., 0.8) scores in each test condition; and the human listening test showed that the average word correct rate of eight listeners was higher than 97.5%. These findings indicated that the proposed deep learning-based system can be a potential approach to synthesize a highly intelligible speech with limited envelope information in the future.

Show Full Text

A Deep Learning Based Approach to Synthesize Intelligible Speech with Limited Temporal Envelope Information.

Researchers

Journal

Modalities

Models

Abstract

3D morphometric quantification of maxillae and defects for patients with unilateral cleft palate via deep learning-based CBCT image auto-segmentation.

Deep Learning Model for Automatic Contouring of Cardiovascular Substructures on Radiotherapy Planning CT Images: Dosimetric Validation and Reader Study based Clinical Acceptability Testing.

Classification of VLF/LF Lightning Signals Using Sensors and Deep Learning Methods.

Preparation and Performance Analysis of Transformer Aramid Nanopaper-Based Insulating Material Based on Deep Learning.

Deep learning for patient-specific quality assurance: Identifying errors in radiotherapy delivery by radiomic analysis of gamma images with convolutional neural networks.

A Deep Learning-based Framework for Intersectional Traffic Simulation and Editing.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply