In reinforcement learning, what does the agent strive to maximize?
Expected cumulative reward
Accuracy
Loss

Cloud Artificial Intelligence and Machine Learning Exercises are loading ...