diff --git a/index.html b/index.html index 30c2afa..aba3273 100644 --- a/index.html +++ b/index.html @@ -28,57 +28,57 @@
Learning curves sample efficiency comparison of TD3, SnapshotRL+TD3, and S3RL+TD3 on six MuJoCo environments. For details see wandb.
- +Learning curves sample efficiency comparison of TD3, SnapshotRL+TD3, and S3RL+TD3 on six MuJoCo environments. For details see wandb.
+Learning curves sample efficiency comparison of SAC, SnapshotRL+SAC, and S3RL+SAC on six MuJoCo environments. For details see wandb.
- +Learning curves sample efficiency comparison of SAC, SnapshotRL+SAC, and S3RL+SAC on six MuJoCo environments. For details see wandb.
+Learning curves sample efficiency comparison of PPO, SnapshotRL+PPO, and S3RL+PPO on six MuJoCo environments. For details see wandb.
- +Learning curves sample efficiency comparison of PPO, SnapshotRL+PPO, and S3RL+PPO on six MuJoCo environments. For details see wandb.
+Ablation study results showing the impact of key components on the sample efficiency of S3RL+TD3 on six MuJoCo environments. For details see wandb.
- +Ablation study results showing the impact of key components on the sample efficiency of S3RL+TD3 on six MuJoCo environments. For details see wandb.
+