A type of Machine Learning paradigm. The model interacts with an environment and learns by trial-and-error. Rewards guide learning. Example: Teaching a robot to walk or play chess.