https://verifiers.readthedocs.io/en/latest/overview.html

TLDR: Verifiers are used in language model training where correctness isn’t objectively measurable

Verifiers provide a flexible framework for defining custom interaction protocols between LLMs and environments

3 key pieces of environments are:

  • Dataset (str)
  • Your Rubric (one or more reward function)
  • Interaction protocol