Which of the following is NOT an advantage of using reinforcement learning for policy improvement?
It can be applied to a wide range of problems.
It can adapt to complex and dynamic environments.
Overlook minor misbehaviors
Impose harsh punishments for any infraction

Reinforcement Learning Exercises are loading ...