Quality of VM

Thanks for your impressive work.

I have trained mistral VM following `PRM/train_VM_mistral.py` and use it to guide ToT evaluation in `evaluate.py`. But after training 2 epochs following recommended setting, the test accuracy is only **0.1582**. And it outputs unreliable scores to tree nodes.

Since the default depth and branch is limited and exploring nodes are ranked by values, an relatively accurate score seems necessary. So I wonder if this is a normal situation, and how do you handle this problem?

 Looking forward for your help, thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quality of VM #24

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Quality of VM #24

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions