🎇 Showcase

DiTraj: Training-free Trajectory Control For Video Diffusion Transformer

Cheng Lei^12†, Jiayu Zhang^2†‡, Yue Ma^3*, Xinyu Wang⁴, Long Chen², Liang Tang², Yiqiang Yan², Fei Su¹, Zhicheng Zhao^1*

¹ Beijing University of Posts and Telecommunications, ² Lenovo, ³ HKUST, ⁴ Tsinghua University

†Equal Contribution ‡ Project Lead *Corresponding Author

We propose DiTraj, the first training-free trajectory control framework for DiT-based video generation model. Given an input bbox trajectory guidance, DiTraj enables generating high-quality videos that align with the target trajectory. Our method achieves state-of-the-art performance in both video quality and trajectory controllability. It can be adapted to most DiT-based video generation models (Wan2.1, CogVideoX etc.).

🎇 Showcase

🎇 Complex Trajectory

For more examples, please refer to our project page (https://xduzhangjiayu.github.io/DiTraj_Project_Page/).

📖 Pipeline

🔥 News

[2025.9.29] Paper released!
[2025.12.10] Code released!

👨‍💻 ToDo

Release Paper on arxiv
Release Code
Release Gradio demo with user-friendly interaction

🚀 Getting Started

Environment Requirement

Clone the repo:

git clone https://github.com/xduzhangjiayu/DiTraj.git

Then:

conda create --name DiTraj python=3.11
conda activate DiTraj
pip install -r requirements.txt
git clone --branch v0.33.1 https://github.com/huggingface/diffusers.git
cd diffusers
pip install -e .

Finally:
Replace the ./module/transformer_wan.py file in the ./diffusers/src/diffusers/models/transformers/transformer_wan.py

Generate your own video!

First, input your prompts in the test_prompts.txt.
Then, run the following command:

python prompt_extend.py (optional)

python prompt_refine.py

demo/test_prompts_refined.json will be generated, including the bg/fg prompt.

Define your trajectory in run.py (line 15) You can set the bbox in several keyframes , (x1,y1) is the bbox top left corner, (x2,y2) is the bottom right corner. Each keyframe uses [frame_id, y1, y2, x1, x2] For example:

bboxs = [
            [0, 0.3, 0.7, 0.1, 0.4], # frame 0: Left side
            [80, 0.3, 0.7, 0.7,1.0]  # frame 80: Right side
        ]

if you want to use a complex trajectory, you can use the following code:

bboxs = [
            [0, 0.05, 0.55, 0.05, 0.45], # frame 0: Top-left
            [20, 0.05, 0.55, 0.55, 0.95], # frame 20: Top-right
            [40, 0.45, 0.95, 0.55, 0.95], # frame 40: Bottom-left
            [60, 0.45, 0.95, 0.05, 0.45], # frame 60: Bottom-right
            [80, 0.05, 0.55, 0.05, 0.45], # frame 80: Top-left 
        ]

Run the following command:

python run.py

The video will be saved in the in demo/output.mp4 and demo/output_box.mp4 (video with bbox)

📚 Acknowledgements

Our codebase builds on diffusers, thanks for the great work!

🖋️ Citation

If you find our work helpful, please star 🌟 this repo and cite 📑 our paper. Thanks for your support!

@misc{lei2025ditrajtrainingfreetrajectorycontrol,
      title={DiTraj: training-free trajectory control for video diffusion transformer}, 
      author={Cheng Lei and Jiayu Zhang and Yue Ma and Xinyu Wang and Long Chen and Liang Tang and Yiqiang Yan and Fei Su and Zhicheng Zhao},
      year={2025},
      eprint={2509.21839},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2509.21839}, 
}

License

This code is licensed under CC BY-NC 4.0 and intended for research use only — no commercial use allowed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiTraj: Training-free Trajectory Control For Video Diffusion Transformer

🎇 Showcase

🎇 Complex Trajectory

📖 Pipeline

🔥 News

👨‍💻 ToDo

🚀 Getting Started

Environment Requirement

Generate your own video!

📚 Acknowledgements

🖋️ Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
demo		demo
module		module
teaser		teaser
README.md		README.md
prompt_extend.py		prompt_extend.py
prompt_refine.py		prompt_refine.py
requirements.txt		requirements.txt
run.py		run.py
test_prompts.txt		test_prompts.txt
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

DiTraj: Training-free Trajectory Control For Video Diffusion Transformer

🎇 Showcase

🎇 Complex Trajectory

📖 Pipeline

🔥 News

👨‍💻 ToDo

🚀 Getting Started

Environment Requirement

Generate your own video!

📚 Acknowledgements

🖋️ Citation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages