Introduction to Reinforcement Learning (DDPG and TD3).. TL;DR: Reinforcement Learning is the ideal framework for a recommendation system because it has Markov Property. The state is movies rated by a user. Action is the.

Introduction to Reinforcement Learning (DDPG and TD3).
Introduction to Reinforcement Learning (DDPG and TD3). from miro.medium.com

The first feature added to TD3 is the use of two critic networks. This was inspired by the technique seen in Deep Reinforcement Learning with Double Q-learning (Van Hasselt et.