Chapter 13

  1. Imagination in an agent specifies visualizing and planning before taking any action.
  2. Imagination core consists of policy network and environmental model for performing imagination.
  3. Agents repeatedly take feedback from the human and change its goal according to the human preference.
  4. DQfd uses some demonstration data for training where as DQN doesn't use any demonstrations data upfront.
  5. Refer section Hindsight Experience Replay (HER).
  1. Hierarchical reinforcement learning (HRL) is proposed to solve the curse of dimensionality where we decompress large problems into small subproblems in a hierarchy
  2. We tried to find the optimal policy given the reward function in RL whereas in inverse reinforcement learning, the optimal policy is given and we find the reward function

 

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.221.123.73