Remove unnecessary SDE resampling in PPO update #1933

brn-dev · 2024-05-22T10:24:33Z

Description

Remove policy.reset_noise() call in PPO update

Motivation and Context

Resampling the SDE noise in the PPO update is unnecessary, for more info see #1929

I have raised an issue to propose this change (required for new features and bug fixes)

closes #1929

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Note: You can run most of the checks using make commit-checks.

Note: we are using a maximum length of 127 characters per line

araffin

Thanks for the PR =)
could you do the same for PPO variants in SB3 contrib?

araffin · 2024-06-29T17:01:59Z

I've created a report to check that it had no impact on the performance (it changes results because the state of the pseudo-random generator is not the same but should not impact performance):
https://wandb.ai/openrlbenchmark/sb3/reports/PR-1933-Remove-gSDE-resampling--Vmlldzo4NDk4Nzgx

araffin

LGTM, thanks =)

* Remove unnecessary SDE resampling in PPO update * Update changelog.rst * Update version * Update PyTorch version on CI * Update ruff * Limit NumPy version * Reformat --------- Co-authored-by: Antonin RAFFIN <[email protected]>

brn-dev added 2 commits May 22, 2024 12:00

Remove unnecessary SDE resampling in PPO update

02d7bbc

Update changelog.rst

6b8eace

brn-dev mentioned this pull request May 22, 2024

[Question] Why resample SDE noise matrices in PPO optimzation? #1929

Closed

4 tasks

araffin reviewed May 23, 2024

View reviewed changes

araffin added 2 commits June 5, 2024 22:55

Merge branch 'master' into patch-1

d1c5fb1

Merge branch 'master' into patch-1

3d80747

Update version

e4ebffa

araffin approved these changes Jun 29, 2024

View reviewed changes

araffin added 4 commits June 29, 2024 19:30

Update PyTorch version on CI

f09dc96

Update ruff

fad9a7e

Limit NumPy version

09f4b76

Reformat

a4839d2

araffin merged commit 24ebf1a into DLR-RM:master Jun 29, 2024
4 checks passed

araffin mentioned this pull request Jun 29, 2024

Update SB3 and remove gSDE resampling Stable-Baselines-Team/stable-baselines3-contrib#251

Merged

15 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove unnecessary SDE resampling in PPO update #1933

Remove unnecessary SDE resampling in PPO update #1933

brn-dev commented May 22, 2024 •

edited by araffin

Loading

araffin left a comment

araffin commented Jun 29, 2024

araffin left a comment

Remove unnecessary SDE resampling in PPO update #1933

Remove unnecessary SDE resampling in PPO update #1933

Conversation

brn-dev commented May 22, 2024 • edited by araffin Loading

Description

Motivation and Context

Types of changes

Checklist

araffin left a comment

Choose a reason for hiding this comment

araffin commented Jun 29, 2024

araffin left a comment

Choose a reason for hiding this comment

brn-dev commented May 22, 2024 •

edited by araffin

Loading