Smartmind
.global
What is the primary objective of policy improvement?
To find a policy that outperforms the current one
To train a new policy from the ground up
To evaluate the current policy
To identify the optimal policy
Reinforcement Learning Exercises are loading ...