Which of the following is a Monte Carlo method for policy evaluation?
First-visit Monte Carlo
SARSA
Overlook minor misbehaviors
Impose harsh punishments for any infraction

Reinforcement Learning Exercises are loading ...