Diffusers
Safetensors
3D controllable video generation

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

3DTrajMaster-CogVideoX

Introduction

3DTrajMaster controls one or multiple entity motions in 3D space with entity-specific 3D trajectories for text-to-video (T2V) generation. It has the following features::

  • 6 Domain of Freedom (DoF): control 3D entity location and orientation.
  • Diverse Entities: human, animal, robot, car, even abstract fire, breeze, etc.
  • Diverse Background: city, forest, desert, gym, sunset beach, glacier, hall, night city, etc.
  • Complex 3D trajectories: 3D occlusion, rotating in place, 180°/continuous 90° turnings, etc.
  • Fine-grained Entity Prompt: change human hair, clothing, gender, figure size, accessory, etc.

Usage

This is the implementation based on CogVideoX-5B. Please refer to our github for details on usage.

Citation

If you find this project useful, please consider citing:

@inproceedings{fu20243dtrajmaster,
    author    = {Fu, Xiao and Liu, Xian and Wang, Xintao and Peng, Sida and Xia, Menghan and Shi, Xiaoyu and Yuan, Ziyang and Wan, Pengfei and Zhang, Di and Lin, Dahua},
    title     = {3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation},
    booktitle = {ICLR},
    year      = {2025}
}
Downloads last month
0
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Dataset used to train KwaiVGI/3DTrajMaster