Reinforcement Learning - Maximize reward and minimize loss. There are rewards and losses associated with actions (states, what is going on right now).