Skip to content

Metro1998/hppo-in-traffic-signal-control

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement learning for traffic signal control in hybrid action space

H-PPO

In this repository, utilizing Hybrid Proximal Policy Optimization1 (H-PPO), we have implemented the synchronous optimization of the signal staging (discrete action) and its corresponding duration (continuous parameter).

The rollout buffer, network architectures, and (H)PPO classes are defined in src\hppo\HPPO.py. In the specific implementation, we have referred to repo:OpenAI and repo:ikostrikov. We have also considered the tricks in revisit papers234.

In the envs\ , we define the traffic demand as well as the simulation configuration, for example, 4_4.rou.xml and 4_4.add.xml, to make the sumo work. The environment class SUMOEnv is also included, which have been encapsulated to provide interactive envs (Please make sure you have registered it locally). Of course, if you are not interested in traffic signal control, you can ignore this env and write on your own. High performance and vectorized env, is under developed.

Finally, we present the overview of H-PPO’s architecture as well as the partial training results. And you can cite our paper5 as well.

Average queue length of Env #9 Average delay of Env #9

HyAR(TODO)

Footnotes

  1. FAN Z, SU R, ZHANG W, et al. Hybrid actor-critic reinforcement learning in parameterized action space[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence. 2019: 2279-2285.

  2. HUANG S, DOSSA R F J, RAFFIN A, et al. The 37 implementation details of proximal policy optimization[J]. The ICLR Blog Track, 2022.

  3. ENGSTROM L, ILYAS A, SANTURKAR S, et al. Implementation Matters in Deep RL: A Case Study on PPO and TRPO[C]//International Conference on Learning Representations. 2020.

  4. ANDRYCHOWICZ M, RAICHUK A, STAŃCZYK P, et al. What Matters In On-Policy Reinforce�ment Learning? A Large-Scale Empirical Study[C]//ICLR 2021-Ninth International Conference on Learning Representations. 2021.

  5. LUO H, BIE Y, JIN S. Reinforcement Learning for Traffic Signal Control in Hybrid Action Space [J]. IEEE Transactions on Intelligent Transportation Systems, 2024: 1-17.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages