• Intuition: Measures the average absolute difference between predicted and true values.
  • Characteristics:
    • Produces a linear penalty → every unit of error is penalized equally.
    • More robust to outliers, since large errors don’t get squared.
    • Optimization is harder because the absolute function is not differentiable at 0 (though subgradients are used).