https://verifiers.readthedocs.io/en/latest/overview.html
TLDR: Verifiers are used in language model training where correctness isn’t objectively measurable
Verifiers provide a flexible framework for defining custom interaction protocols between LLMs and environments
3 key pieces of environments are:
- Dataset (str)
- Your Rubric (one or more reward function)
- Interaction protocol