A type of Machine Learning paradigm.

  • The model interacts with an environment and learns by trial-and-error.
  • Rewards guide learning.
  • Example: Teaching a robot to walk or play chess.