Clevermind
.uk
Which of the following is a Monte Carlo method for policy evaluation?
First-visit Monte Carlo
SARSA
Overlook minor misbehaviors
Impose harsh punishments for any infraction
Reinforcement Learning Exercises are loading ...