In policy gradient methods, which type utilizes a neural network to approximate the policy?
REINFORCE
Trust Region Policy Optimization
Deterministic Policy Gradient
Actor-Critic

Artificial Intelligence Exercises are loading ...