Imitation Learning - Learning paradigm in which an agent seeks to acquire a policy by observing and imitating the behavior of proficient agents, referred to as the expert
In Imitation Learning the expert provides Dataset D of expert-related behavioral data
In english: The format of the input the agent can learn from
- M is a positive number
- In the state the expert took action
- In the state the expert took action and the resulting state was
- pre-action state and post-action state
In english: For a specific state the probability distribution of all the possible actions
- S = the state space (set of all possible states)
- A = the action space (set of all possible actions)
- = the set of probability distributions over actions in A
Explicit Imitation - The Dataset provided only contains state-action pairs
Implicit Imitation - The Dataset ONLY contains state transitions