Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances

Imitation Learning - Learning paradigm in which an agent seeks to acquire a policy by observing and imitating the behavior of $1 \dots N$ proficient agents, referred to as the expert

In Imitation Learning the expert provides Dataset D of expert-related behavioral data

M \in N^{+} D = {(s_{i}, a_{i})}_{i = 1}^{M} D = {(s_{i}, a_{i}, s_{i}^{'}}_{i = 1}^{M} D = {(s_{i}, s_{i}^{'}}_{i = 1}^{M}

In english: The format of the input the agent can learn from

M is a positive number
In the state $s_{i}$ the expert took action $a_{i}$
In the state $s_{i}$ the expert took action $a_{i}$ and the resulting state was $s_{i}^{'}$
pre-action state $s_{i}$ and post-action state $s_{i}^{'}$

π_{e} : S \to △ (A)

In english: For a specific state the probability distribution of all the possible actions

S = the state space (set of all possible states)
A = the action space (set of all possible actions)
$△ (A)$ = the set of probability distributions over actions in A

Explicit Imitation - The Dataset provided only contains state-action pairs ${(s_{i}, a_{i}}_{i = 1}^{M}$

Implicit Imitation - The Dataset ONLY contains state transitions ${(s_{i})}_{i = 1}^{M}$

Ayush Garg

Recently Updated

Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances

ECE 222

Interpreter

Maximum Likelihood

Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances

Graph View

Backlinks