Skip to content
This repository was archived by the owner on Jan 16, 2025. It is now read-only.

Commit 5519bf4

Browse files
add derivative/similar work
1 parent 6ac1b5c commit 5519bf4

1 file changed

Lines changed: 5 additions & 0 deletions

File tree

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,11 @@
44

55
OpenCoconut intends to replicate the Chain of Continuous Thought (COCONUT) paper that implements a novel latent reasoning paradigm. The main idea is to generate thoughts in latent space by utilizing the hidden states during prefilling before we start decoding response. We build on the public dataset from the paper for math [casperhansen/gsm8k_synthetic_cot](https://huggingface.co/datasets/casperhansen/gsm8k_synthetic_cot).
66

7+
## Derivative/Similar Work
8+
9+
1. Derivative: A clean demonstration of how a modified OpenCoconut using Gemma 2 leads to improved performance in translation tasks: https://github.com/vicksEmmanuel/latent-gemma
10+
3. Similar: LucidRains implements a custom Transformer from scratch with Coconut paradigm: https://github.com/lucidrains/coconut-pytorch
11+
712
## Getting started
813

914
Install the package and then go look in `examples` for how to run training and inference.

0 commit comments

Comments
 (0)