add paper: LLMs Meet Finance (SFT+DPO+RL for financial NLP) by WhymustIhaveaname · Pull Request #86 · opendilab/awesome-RLHF

WhymustIhaveaname · 2026-03-28T15:53:31Z

Our paper on fine-tuning Qwen2.5 and DeepSeek-R1 for the Open FinLLM Leaderboard. We do SFT first, then DPO to fix overlength outputs (54.7% → 1.7%), then RL with synthetic CoT data for tasks without training sets.

…ial NLP

add(Youran Sun): add LLMs Meet Finance paper on DPO and RL for financ…

9006d6e

…ial NLP

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add paper: LLMs Meet Finance (SFT+DPO+RL for financial NLP)#86

add paper: LLMs Meet Finance (SFT+DPO+RL for financial NLP)#86
WhymustIhaveaname wants to merge 1 commit into
opendilab:mainfrom
WhymustIhaveaname:add-finance-paper

WhymustIhaveaname commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

WhymustIhaveaname commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant