MagicPose4D: Crafting Articulated Models
with Appearance and Motion Control

Hao Zhang^1* Di Chang^2* Fang Li¹ Mohammad Soleymani² Narendra Ahuja¹

¹University of Illinois Urbana-Champaign ²University of Southern California

^*Equal Contribution

We introduce MagicPose4D, a novel framework for 4D generation providing more accurate and customizable 4D motion retargeting. We propose a dual-phase reconstruction process that initially uses accurate 2D and pseudo 3D supervision without skeleton constraints, and subsequently refines the model with skeleton constraints to ensure physical plausibility. We incorporate a novel Global-Local Chamfer loss function that aligns the overall distribution of mesh vertices with the supervision and maintains part-level alignment without additional annotations. Our method enables cross-category motion transfer using a kinematic-chain-based skeleton, ensuring smooth transitions between frames through dynamic rigidity and achieving robust generalization without the need for additional training.

Method

MagicPose4D accepts two distinct types of input prompts: (i) appearance prompts and (ii) motion prompts. Both images and textual descriptions can function as appearance prompts, delineating the desired object and its visual characteristics. In a departure from existing approaches, MagicPose4D enables users to specify precise motions and trajectories by providing a video/mesh sequence that represents the anticipated movement. MagicPose4D comprises three critical components: (i) the 4D Reconstruction module, (ii) the Cross-Category Motion Transfer module, and (iii) the Image-to-3D Generation module. Each module is tailored to facilitate distinct aspects of dynamic modeling, enabling adaptive 4D reconstructions that align with user-defined specifications.

Multi-View 4D Generation

Animal

Comparison to Other Pose Transfer Methods

Animals

Reference Motion Ours 3D-CoreNet X-DualNet

Humanoid

Reference Motion Ours 3D-CoreNet X-DualNet

Comparison to Image-to-4D Generation Method

"Hip-Hop Dancing"

Reference

Animate124

Ours

"Running like a rabit"

Reference

Animate124

Ours

"Attacking with head"

Reference

Animate124

Ours

BibTeX

@misc{zhang2024magicpose4d,
      title={MagicPose4D: Crafting Articulated Models with Appearance and Motion Control}, 
      author={Hao Zhang and Di Chang and Fang Li and Mohammad Soleymani and Narendra Ahuja},
      year={2024},
      eprint={2405.14017},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

MagicPose4D: Crafting Articulated Models with Appearance and Motion Control

Video

Method

Multi-View 4D Generation

Animal

Humanoid

Comparison to Other Pose Transfer Methods

Animals

Humanoid

Comparison to Image-to-4D Generation Method

"Hip-Hop Dancing"

"Running like a rabit"

"Attacking with head"

BibTeX

MagicPose4D: Crafting Articulated Models
with Appearance and Motion Control