Code for each week's short video of Siraj Raval Course on Reinforcement Learning "AI for Video Games"
Value iteration algorithm built for the Taxi-v1 environment by OpenAI Gym library.
Monte Carlo Prediction algorithm built for the Blackjack-v0 environment by OpenAI Gym library.
Q-Learning algorithm built for the MountainCarContinuous-v0 environment by OpenAI Gym library.
Policy Gradients algorithm built for the Pong-v0 environment by OpenAI Gym library.
Actor-Critic model built for the Pendulum-v0 environment by OpenAI Gym library.
Proximal Policy Optimization algorithm built for the Pendulum-v0 environment by OpenAI Gym library.