Which step in the Reinforcement Learning Cycle involves collecting information about the environment?
Reward
Action
Observation
New State

Reinforcement Learning Übungen werden geladen ...