Research on gesture recognition algorithm based on MME-P3D.

Abstract

A Multiscale-Motion Embedding Pseudo-3D (MME-P3D) gesture recognition algorithm has been proposed to tackle the issues of excessive parameters and high computational complexity encountered by existing gesture recognition algorithms deployed in mobile and embedded devices. The algorithm initially takes into account the characteristics of gesture motion information, integrating the channel attention (CE) mechanism into the pseudo-3D (P3D) module, thereby constructing a P3D-C feature extraction network that can efficiently extract spatio-temporal feature information while reducing the complexity of the algorithmic model. To further enhance the understanding and learning of the global gesture movement’s dynamic information, a Multiscale Motion Embedding (MME) mechanism is subsequently designed. The experimental findings reveal that the MME-P3D model achieves recognition accuracies reaching up to 91.12% and 83.06% on the self-constructed conference gesture dataset and the publicly available Chalearn 2013 dataset, respectively. In comparison with the conventional 3D convolutional neural network, the MME-P3D model demonstrates a significant advantage in terms of parameter count and computational requirements, which are reduced by as much as 82% and 83%, respectively. This effectively addresses the limitations of the original algorithms, making them more suitable for deployment on embedded and mobile devices and providing a more effective means for the practical application of hand gesture recognition technology.

Show Full Text

Research on gesture recognition algorithm based on MME-P3D.

Researchers

Journal

Modalities

Models

Abstract

Using deeply time-series semantics to assess depressive symptoms based on clinical interview speech.

Integrating audio and visual modalities for multimodal personality trait recognition hybrid deep learning.

Deep learning model to predict complex stress and strain fields in hierarchical composites.

Optical patching scheme for optical convolutional neural networks based on wavelength-division multiplexing and optical delay lines.

UIdataGB: Multi-Class ultrasound images dataset for gallbladder disease detection.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply