Skip to content

Commit 3375a62

Browse files
authored
Update paper link.
1 parent 86b3897 commit 3375a62

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Efficient and Stable Offline-to-online Reinforcement Learning via Continual Policy Revitalization
22

3-
The official code for "Efficient and Stable Offline-to-online Reinforcement Learning via Continual Policy Revitalization", (IJCAI'24), to appear. The paper will soon be released.
3+
The official code for [Efficient and Stable Offline-to-online Reinforcement Learning via Continual Policy Revitalization](https://www.ijcai.org/proceedings/2024/0477.pdf), (IJCAI'24).
44

55
## Install Dependency
66

@@ -48,7 +48,7 @@ fi
4848
Run the following script to finish the offline experiments
4949

5050
```bash
51-
bash ./script/run_td3bc_offline.sh $task $quality $name $seed --device $device_id
51+
bash ./script/run_td3bc_offline.sh tasktask quality namename seed --device $device_id
5252
```
5353

5454
Value for the arguments
@@ -72,7 +72,7 @@ bash ./script/run_td3bc_offline.sh halfcheetah medium corl 0 --device "cuda:0"
7272
Run the following script to reproduce online experiments
7373

7474
```bash
75-
bash ./script/run_cpr_online.sh $task $quality original $seed --device $device_id
75+
bash ./script/run_cpr_online.sh tasktask quality original seed−−deviceseed --device device_id
7676
```
7777

7878
Value for the arguments

0 commit comments

Comments
 (0)