Skip to content
This repository was archived by the owner on Jan 16, 2025. It is now read-only.
This repository was archived by the owner on Jan 16, 2025. It is now read-only.

Not able to generate meaningful results on gsm8k_synthetic_cot #4

@yongzx

Description

@yongzx

Thank you for creating this repo!

May I know what should be the correct configuration for training your model/setup (since you are using an entirely different model and eval dataset than the original paper)?

I tried your default configuration as well as increasing the training steps 10x to max_steps=10000, but the output_stage_3/checkpoint-10000 has an accuracy score of 0.006 on your gsm8k_synthetic_cot. Many of the output predictions are None such as the following. It also got the answer for your infer.py example wrong.

Input: 
It was Trevor's job to collect fresh eggs from the family's 4 chickens every morning.   He got 4 eggs from Gertrude and 3 eggs from Blanche.  Nancy laid 2 eggs as did Martha.  On the way, he dropped 2 eggs.  How many eggs did Trevor have left?
Target: 9
Predicted: None

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions