Artificial Intelligence | Computer Vision

PA-Tran: Learning to Estimate 3D Hand Pose with Partial Annotation.

February 11, 2023 Artificial Intelligence, Computer Vision

Abstract

This paper tackles a novel and challenging problem-3D hand pose estimation (HPE) from a single RGB image using partial annotation. Most HPE methods ignore the fact that the keypoints could be partially visible (e.g., under occlusions). In contrast, we propose a deep-learning framework, PA-Tran, that jointly estimates the keypoints status and 3D hand pose from a single RGB image with two dependent branches. The regression branch consists of a Transformer encoder which is trained to predict a set of target keypoints, given an input set of status, position, and visual features embedding from a convolutional neural network (CNN); the classification branch adopts a CNN for estimating the keypoints status. One key idea of PA-Tran is a selective mask training (SMT) objective that uses a binary encoding scheme to represent the status of the keypoints as observed or unobserved during training. In addition, by explicitly encoding the label status (observed/unobserved), the proposed PA-Tran can efficiently handle the condition when only partial annotation is available. Investigating the annotation percentage ranging from 50-100%, we show that training with partial annotation is more efficient (e.g., achieving the best 6.0 PA-MPJPE when using about 85% annotations). Moreover, we provide two new datasets. APDM-Hand, is for synthetic hands with APDM sensor accessories, which is designed for a specific hand task. PD-APDM-Hand, is a real hand dataset collected from Parkinson’s Disease (PD) patients with partial annotation. The proposed PA-Tran can achieve higher estimation accuracy when evaluated on both proposed datasets and a more general hand dataset.

Show Full Text

PA-Tran: Learning to Estimate 3D Hand Pose with Partial Annotation.

Researchers

Journal

Modalities

Models

Abstract

Inferring gene regulatory networks from single-cell transcriptomics based on graph embedding.

A Small Object Detection Algorithm for Traffic Signs Based on Improved YOLOv7.

Insight into Automatic Image Diagnosis of Ear Conditions Based on Optimized Deep Learning Approach.

Cancer type prediction based on copy number aberration and chromatin 3D structure with convolutional neural networks.

TIAToolbox as an end-to-end library for advanced tissue image analytics.

Partial hard occluded target reconstruction of Fourier single pixel imaging guided through range slice.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply