The safety-aligned model

Dear authors,
Thank you for open-sourcing your great work.

Could you please release the model checkpoints after safety training? 

Alternatively, could you provide guidance on how you performed safety training on HH-RLHF 
(e.g., minimal instructions to reproduce your setup on the OpenRLHF codebase) 
so I can check detailed setups other than those already specified in your Appendix B, such as LR scheduler, etc.?

Appreciate it.

Best, 
Arthur

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The safety-aligned model #2

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

The safety-aligned model #2

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions