You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’m investigating performance differences between building MuJoCo from source and installing the official wheel (via pip), both at version 3.3.4, under Conda environments. I’m seeing significantly different CPU usage and performance in RL training scenarios.
My setup
mujoco 3.3.4
python 3.10
ubuntu 22.04
torch 2.5.1
My question
✅ Test 1 – specs_test.py
pip-installed MuJoCo: Ran 32 tests in 0.031s
source-built MuJoCo: Ran 32 tests in 0.047s
This suggests pip-installed version is ~50% faster on basic tests.
✅ Test 2 – My custom multi-process RL code (2 environments)
Using htop, I observed:
pip version: each CPU core < 20% usage
source build: each core > 60%, multiple over 70%
🔍 Questions
What could cause this performance gap?
Is it possible to achieve comparable performance by re-building from source with appropriate flags?
Could you please share the full set of compiler flags used in the official wheel build?
I want to confirm if rebuilding with identical build configuration will allow my source-built version to match the performance of the official wheel.
Thanks for any advice or pointers!
Minimal model and/or code that explain my question
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Intro
Hi MuJoCo team and community,
I’m investigating performance differences between building MuJoCo from source and installing the official wheel (via pip), both at version 3.3.4, under Conda environments. I’m seeing significantly different CPU usage and performance in RL training scenarios.
My setup
mujoco 3.3.4
python 3.10
ubuntu 22.04
torch 2.5.1
My question
✅ Test 1 – specs_test.py
Ran 32 tests in 0.031s
Ran 32 tests in 0.047s
This suggests pip-installed version is ~50% faster on basic tests.
✅ Test 2 – My custom multi-process RL code (2 environments)
Using
htop
, I observed:🔍 Questions
I want to confirm if rebuilding with identical build configuration will allow my source-built version to match the performance of the official wheel.
Thanks for any advice or pointers!
Minimal model and/or code that explain my question
No response
Confirmations
Beta Was this translation helpful? Give feedback.
All reactions