You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jan 16, 2025. It is now read-only.
May I know what should be the correct configuration for training your model/setup (since you are using an entirely different model and eval dataset than the original paper)?
I tried your default configuration as well as increasing the training steps 10x to max_steps=10000, but the output_stage_3/checkpoint-10000 has an accuracy score of 0.006 on your gsm8k_synthetic_cot. Many of the output predictions are None such as the following. It also got the answer for your infer.py example wrong.
Input:
It was Trevor's job to collect fresh eggs from the family's 4 chickens every morning. He got 4 eggs from Gertrude and 3 eggs from Blanche. Nancy laid 2 eggs as did Martha. On the way, he dropped 2 eggs. How many eggs did Trevor have left?
Target: 9
Predicted: None
Thank you for creating this repo!
May I know what should be the correct configuration for training your model/setup (since you are using an entirely different model and eval dataset than the original paper)?
I tried your default configuration as well as increasing the training steps 10x to
max_steps=10000, but theoutput_stage_3/checkpoint-10000has an accuracy score of 0.006 on your gsm8k_synthetic_cot. Many of the output predictions areNonesuch as the following. It also got the answer for your infer.py example wrong.