Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

优化commands #45

Open
1 of 2 tasks
StepNeverStop opened this issue Jul 27, 2021 · 0 comments
Open
1 of 2 tasks

优化commands #45

StepNeverStop opened this issue Jul 27, 2021 · 0 comments
Assignees
Labels
enhancement New feature or request
Projects

Comments

@StepNeverStop
Copy link
Owner

StepNeverStop commented Jul 27, 2021

  • 移除docopt
  • 命令行加入是否使用RNN
@StepNeverStop StepNeverStop created this issue from a note in Tasks (In Progress) Jul 27, 2021
StepNeverStop added a commit that referenced this issue Jul 27, 2021
1. removed `docopt`
2. added `argparse`
3. updated README
4. removed redundant functions
5. optimized `run.py`
6. added configuration files in folder `configs`
7. optimized code-style
@StepNeverStop StepNeverStop added the enhancement New feature or request label Jul 27, 2021
StepNeverStop added a commit that referenced this issue Jul 28, 2021
1. removed sarl off-policy algorithm pd_ddpg, 'cause it's not in main stream
2. updated README
3. removed `iql` and added script `IndependentMA.py` instead to implement independent multi-agent algorithms
4. optimized summary writing
5. move NamedDict from 'rls.common.config' to 'rls.common.specs'
6. updated example config
7. updated `.gitignore`
8. added property `is_multi` to identify whether training task is for sarl or marl for both unity and gym
9. reconstructed inheritance relationships between algorithms and their's superclass
10. removed `1.e+18` in yaml files and use a large integer number instead, 'cause we want a large integer rather than float
@StepNeverStop StepNeverStop self-assigned this Aug 25, 2021
StepNeverStop added a commit that referenced this issue Aug 30, 2021
…rn`. (#28, #45)

1. implemented function `n_step_return` to calculating $G_{t}^{n}$
2. implemented function `td_lambda_return` to calculating $TD(\lambda)$
3. renamed `no_save` to `is_save` and changed related command
4. removed `--prefill-steps`, `--info`, and `--save-frequency` in command, users could specify those parameters in configuration files
5. updated README
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Tasks
  
In Progress
Development

No branches or pull requests

1 participant