What is the primary objective of policy improvement?
To find a policy that outperforms the current one
To train a new policy from the ground up
To evaluate the current policy
To identify the optimal policy

Reinforcement Learning Exercises are loading ...