You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -36,7 +36,7 @@ Verifiers: Environments for LLM Reinforcement Learning
36
36
37
37
-[01/08/26] v0.1.9 is released, featuring a number of new experimental environment class types, monitor rubrics for automatic metric collection, improved workspace setup flow, improved error handling, bug fixes, and a documentation overhaul.
38
38
-[11/19/25] v0.1.8 is released, featuring a major refactor of the rollout system to use trajectory-based tracking for token-in token-out training across turns, as well as support for truncated or branching rollouts.
39
-
-[11/07/25] Verifiers v0.1.7 is released! This includes an improved quickstart configuration for training with [prime-rl], a new included "nano" trainer (`vf.RLTrainer`, replacing `vf.GRPOTrainer`), and a number of bug fixes and improvements to the documentation.
39
+
-[11/07/25] Verifiers v0.1.7 is released! This includes an improved quickstart configuration for training with [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl), a new included "nano" trainer (`vf.RLTrainer`, replacing `vf.GRPOTrainer`), and a number of bug fixes and improvements to the documentation.
40
40
-[10/27/25] A new iteration of the Prime Intellect [Environments Program](https://docs.google.com/spreadsheets/d/13UDfRDjgIZXsMI2s9-Lmn8KSMMsgk2_zsfju6cx_pNU/edit?gid=0#gid=0) is live!
41
41
42
42
@@ -159,17 +159,17 @@ prime eval run primeintellect/math-python
159
159
160
160
## Documentation
161
161
162
-
**[Environments](environments.md)** — Create datasets, rubrics, and custom multi-turn interaction protocols.
162
+
**[Environments](docs/environments.md)** — Create datasets, rubrics, and custom multi-turn interaction protocols.
163
163
164
-
**[Evaluation](evaluation.md)** - Evaluate models using your environments.
164
+
**[Evaluation](docs/evaluation.md)** - Evaluate models using your environments.
165
165
166
-
**[Training](training.md)** — Train models in your environments with reinforcement learning.
166
+
**[Training](docs/training.md)** — Train models in your environments with reinforcement learning.
167
167
168
-
**[Development](development.md)** — Contributing to verifiers
168
+
**[Development](docs/development.md)** — Contributing to verifiers
169
169
170
-
**[API Reference](reference.md)** — Understanding the API and data structures
170
+
**[API Reference](docs/reference.md)** — Understanding the API and data structures
171
171
172
-
**[FAQs](faqs.md)** - Other frequently asked questions.
172
+
**[FAQs](docs/faqs.md)** - Other frequently asked questions.
0 commit comments