Skip to content

add paper: LLMs Meet Finance (SFT+DPO+RL for financial NLP)#86

Open
WhymustIhaveaname wants to merge 1 commit into
opendilab:mainfrom
WhymustIhaveaname:add-finance-paper
Open

add paper: LLMs Meet Finance (SFT+DPO+RL for financial NLP)#86
WhymustIhaveaname wants to merge 1 commit into
opendilab:mainfrom
WhymustIhaveaname:add-finance-paper

Conversation

@WhymustIhaveaname
Copy link
Copy Markdown

Our paper on fine-tuning Qwen2.5 and DeepSeek-R1 for the Open FinLLM Leaderboard. We do SFT first, then DPO to fix overlength outputs (54.7% → 1.7%), then RL with synthetic CoT data for tasks without training sets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant