In a partially observable MDP (POMDP), the agent:
Has complete knowledge of the current state.
Has partial observability of the current state.
Overlook minor misbehaviors
Impose harsh punishments for any infraction

Machine Learning Applications Exercises are loading ...