Other

Long short-term memory for speaker generalization in supervised speech separation.

July 9, 2019 Other

Researchers

DeLiang Wang Jitong Chen

Journal

The Journal of the Acoustical Society of America

Modalities

Models

deep neural networks (DNNs)Long short-term memory (LSTM)

Abstract

Speech separation can be formulated as learning to estimate a time-frequency mask from acoustic features extracted from noisy speech. For supervised speech separation, generalization to unseen noises and unseen speakers is a critical issue. Although deep neural networks (DNNs) have been successful in noise-independent speech separation, DNNs are limited in modeling a large number of speakers. To improve speaker generalization, a separation model based on long short-term memory (LSTM) is proposed, which naturally accounts for temporal dynamics of speech. Systematic evaluation shows that the proposed model substantially outperforms a DNN-based model on unseen speakers and unseen noises in terms of objective speech intelligibility. Analyzing LSTM internal representations reveals that LSTM captures long-term speech contexts. It is also found that the LSTM model is more advantageous for low-latency speech separation and it, without future frames, performs better than the DNN model with future frames. The proposed model represents an effective approach for speaker- and noise-independent speech separation.

Show Full Text

Long short-term memory for speaker generalization in supervised speech separation.

Researchers

Journal

Modalities

Models

Abstract

Supervised Deep Learning Techniques for Image Description: A Systematic Review.

Regularizing Deep Neural Networks by Enhancing Diversity in Feature Extraction.

Churn prediction of mobile and online casual games using play log data.

Backpropagation-free training of deep physical neural networks.

Improving Thermostability and Catalytic Activity of Glycosyltransferase From by Semi-Rational Design for Rebaudioside D Synthesis.

DeepMoCap: Deep Optical Motion Capture Using Multiple Depth Sensors and Retro-Reflectors.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply