small experiments with agents learning atari games, implemented in jax/numpy
mostly following the 2013 NeurIPS paper Playing Atari with Deep Reinforcement Learning by Mnih et al.
- experimental code setup partly based on this pytorch implementation
- nice explanation on the preprocessing with frame skipping
This is a WIP