
The question list is as follows:

  1. What are policy gradients?
  2. Why are policy gradients effective?
  3. What is the use of the Actor Critic network in DDPG?
  4. What is the constraint optimization problem?
  5. What is the trust region?
  6. How does PPO overcome the drawbacks of TRPO?
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.