Smartmind
.global
In policy gradient methods, which type utilizes a neural network to approximate the policy?
REINFORCE
Trust Region Policy Optimization
Deterministic Policy Gradient
Actor-Critic
Artificial Intelligence Exercises are loading ...