ferdinand-muetsch.de
CartPole with a Deep Q-Network
In my last post I developed a solution to OpenAI Gym’s CartPole environment, based on a classical Q-Learning algorithm. The best score I achieved with it was 120, although the score I uploaded to the