Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation

This repository is the official implementation of Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation. The paper is accepted for Oral Presentation at the Conference on Robot Learning (CoRL 2025). This code implementation is based on the official Diffusion Policy repository and can be easily integrated into the official repository as a branch.

Environment Configuration

We use the same environment provided by the official Diffusion Policy repository.

conda env create -f conda_environment_real.yaml

Start Training

Once we have confirmed that the environment has been configured, we can activate it with the following command:

conda activate robodiff

Then, we can run the main program to start training:

python train.py --config-name=sheaf_xarm_split_diffusion_workspace

The above is the same implementation we provided as in the main text, meaning that the two arms will have shared third-persion view. If you want each arm to have its own separate third-view camera, you can use the following command:

python train.py --config-name=sheaf_individual_camera_diffusion_workspace

Task Results

We have two experiments: cooperative push-T and coffee bean pouring. We are only providing the training data for coffee bean pouring here.

Reference

If this repository is helpful to you, please cite our work by:

@article{he2025latent,
  title={Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation},
  author={He, Chengyang and Camps, Gadiel Sznaier and Liu, Xu and Schwager, Mac and Sartoretti, Guillaume},
  journal={arXiv preprint arXiv:2505.09144},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
data/xarm_multi		data/xarm_multi
diffusion_policy		diffusion_policy
.gitignore		.gitignore
README.md		README.md
conda_environment_real.yaml		conda_environment_real.yaml
overview.png		overview.png
result_task1_v2.png		result_task1_v2.png
result_task2.png		result_task2.png
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation

Environment Configuration

Start Training

Task Results

Reference

About

Uh oh!

Releases

Packages

Languages

StanfordMSL/LatentToM

Folders and files

Latest commit

History

Repository files navigation

Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation

Environment Configuration

Start Training

Task Results

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages