huggingface-RL course

Unit 01. LunarLander-v2

Bonus Unit 01. Huggy Dogs

Unit 02. Q-Learning with FrozenLake-v1 and Taxi-v3

Unit 03. Deep Q-Learning with Atari games

Bonus Unit 02. Optuna Automatic Hyperparameter tuning

Unit 04. Policy Gradient with Pytorch

Unit 05. Introduce to Unity ML-agents

Unit 06. ACTOR CRITIC METHODS WITH ROBOTICS ENVIRONMENTS

Unit 07. Introduction to multi-agents and AI vs AI

Unit 08. PART 1 PROXIMAL POLICY OPTIMIZATION (PPO)

Unit 08. Introduction to multi-agents and AI vs AI