Identify the option that does not represent a reinforcement learning algorithm.
Q-learning
Actor-critic
Supervised learning
Policy gradient

Artificial Intelligence Exercises are loading ...