In this paper, we implement three state-of-art continuous reinforcement learning algorithms, Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO) and Policy Gradient (PG)in portfolio management.
This is a Gomoku AI based on curriculum learning and AlphaGo methods.
Deep reinforcement learning GPU libraries for NVIDIA Jetson with PyTorch, OpenAI Gym, and Gazebo robotics simulator.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Simple Reinforcement learning tutorials
中文整理的强化学习资料(Reinforcement Learning)
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Repo for the Deep Reinforcement Learning Nanodegree program