A systematic review of the application of machine learning techniques to ultrasound tongue imaging analysis.

Abstract

B-mode ultrasound has emerged as a prevalent tool for observing tongue motion in speech production, gaining traction in speech therapy applications. However, the effective analysis of ultrasound tongue image frame sequences (UTIFs) encounters many challenges, such as the presence of high levels of speckle noise and obscured views. Recently, the application of machine learning, especially deep learning techniques, to UTIF interpretation has shown promise in overcoming these hurdles. This paper presents a thorough examination of the existing literature, focusing on UTIF analysis. The scope of our work encompasses four key areas: a foundational introduction to deep learning principles, an exploration of motion tracking methodologies, a discussion of feature extraction techniques, and an examination of cross-modality mapping. The paper concludes with a detailed discussion of insights gleaned from the comprehensive literature review, outlining potential trends and challenges that lie ahead in the field.© 2024 Acoustical Society of America.

Show Full Text

A systematic review of the application of machine learning techniques to ultrasound tongue imaging analysis.

Researchers

Journal

Modalities

Models

Abstract

Automatic sleep staging for the young and the old – Evaluating age bias in deep learning.

Ensemble classifier fostered detection of arrhythmia using ECG data.

Toward Automatic Detection of Radiation-Induced Cerebral Microbleeds Using a 3D Deep Residual Network.

Extraction of multi-scale features enhances the deep learning-based daily PM forecasting in cities.

Intracerebral hemorrhage detection on computed tomography images using a residual neural network.

Spatial interplay of tissue hypoxia and T-cell regulation in ductal carcinoma in situ.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply