H4MER: Human 4D Modeling by Learning Neural Compositional Representation with Transformer.

Abstract

Despite the impressive results achieved by deep learning based 3D reconstruction, the techniques of directly learning to model 4D human captures with detailed geometry have been less studied. This work presents a novel neural compositional representation for Human 4D Modeling with transformER (H4MER). Specifically, our H4MER is a compact and compositional representation for dynamic human by exploiting the human body prior from the widely used SMPL parametric model. Thus, H4MER can represent a dynamic 3D human over a temporal span with the codes of shape, initial pose, motion and auxiliaries. A simple yet effective linear motion model is proposed to provide a rough and regularized motion estimation, followed by per-frame compensation for pose and geometry details with the residual encoded in the auxiliary codes. We present a novel Transformer-based feature extractor and conditional GRU decoder to facilitate learning and improve the representation capability. Extensive experiments demonstrate our method is not only effective in recovering dynamic human with accurate motion and detailed geometry, but also amenable to various 4D human related tasks, including monocular video fitting, motion retargeting, 4D completion, and future prediction. The video demos are in the project homepage: https://boyanjiang.github.io/H4MER/.

Show Full Text

H4MER: Human 4D Modeling by Learning Neural Compositional Representation with Transformer.

Researchers

Journal

Modalities

Models

Abstract

Fluorescence microscopy datasets for training deep neural networks.

A novel sequential structure for lightweight multi-scale feature learning under limited available images.

How Robust is Your Fairness? Evaluating and Sustaining Fairness under Unseen Distribution Shifts.

Deep learning-based approach for identification of diseases of maize crop.

Evaluation of automatic video captioning using direct assessment.

Efficient Multi-Object Detection and Smart Navigation Using Artificial Intelligence for Visually Impaired People.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply