Skip to content

Commit 5d718d8

Browse files
committed
polish
1 parent 255941f commit 5d718d8

3 files changed

Lines changed: 19 additions & 6 deletions

File tree

content/blog/annotation/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ show_word_count: true
3838
---
3939
{{< button href="https://github.com/terminal-agent/reptile" label="GITHUB" external=true >}}
4040
{{< button href="https://github.com/terminal-agent/AnnotationGuidelines" label="GUIDELINE" external=true >}}
41-
{{< button href="https://www.notion.so/On-Policy-SFT-Annotation-How-Minimal-Human-Edits-Unlock-Massive-Gains-in-LLM-Agents-2b80ba07baa880d6ba7fca50816d33f2?source=copy_link" label="NOTION-Version" external=true >}}
41+
{{< button href="https://www.notion.so/On-Policy-SFT-Annotation-How-Minimal-Human-Edits-Unlock-Massive-Gains-in-LLM-Agents-2b80ba07baa880d6ba7fca50816d33f2" label="NOTION-Version" external=true >}}
4242

4343

4444

content/blog/tool/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ show_word_count: true
3838
---
3939
{{< button href="https://github.com/sail-sg/tty-use" label="GITHUB" external=true >}}
4040
{{< button href="https://x.com/mavenlin/status/1977758827366817929" label="TWITTER" external=true >}}
41-
{{< button href="https://tinyurl.com/vrwcmpks " label="NOTION-Version" external=true >}}
41+
{{< button href="https://tinyurl.com/vrwcmpks" label="NOTION-Version" external=true >}}
4242

4343

4444

content/blog/workflow/index.md

Lines changed: 17 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -41,20 +41,29 @@ show_word_count: true
4141

4242
## Introduction
4343

44-
We propose **Reptile**, a terminal agent that operates under an extended **REPL (Read-Execute-Print-Learn)** protocol, where human feedback is seamlessly integrated into the agent's execution loop.
44+
We propose **Reptile**, a terminal agent that operates under an extended **REPL (Read-Execute-Print-Learn Loop)** protocol, where human feedback is seamlessly integrated into the agent's execution loop.
4545

46-
Unlike traditional REPL (Read-Execute-Print-Loop) environments that focus solely on code evaluation, our REPL protocol emphasizes the iterative cycle of human-agent collaboration, transforming the terminal from a passive command executor into an interactive learning environment.
46+
Unlike traditional REPL (Read-Execute-Print Loop) environments that focus solely on code evaluation, our REPL protocol emphasizes the iterative cycle of human-agent collaboration, transforming the terminal from a passive command executor into an interactive learning environment.
4747

4848

4949
<figure style="text-align: center; margin: 1rem 0;">
5050
<img src="https://hackmd.io/_uploads/SkGiD2BWWl.png" style="width: 60%; display: block; margin: 0 auto;">
5151
</figure>
5252

5353

54-
This blog focus on **workflow** and **evaluation**. We detail the **on-policy annotation** and **SFT training** in https://terminal-agent.github.io/blog/annotation/
54+
This blog focus on **workflow** and **evaluation**. We detail the **on-policy annotation** and **SFT training** in https://terminal-agent.github.io/blog/annotation/.
5555

56+
## What Makes Reptile Workflow Special?
5657

57-
## Insights
58+
Compared with other CLI agents (e.g., Claude Code and Mini SWE-Agent), Reptile stands out because of:
59+
60+
- **Terminal-only beyond Bash-only**: Simple and stateful execution, which is more efficient than bash-only (you don't need to specify the environment in every command). It doesn't require the complicated MCP protocol—just a naive bash tool under the REPL protocol.
61+
> See [TTY-use Post](https://terminal-agent.github.io/blog/tool/) for more details on making terminal backend work.
62+
- **Human-in-the-Loop Learning**: Users can inspect every step and provide prompt feedback, i.e., give feedback under the USER role or edit the LLM generation under the ASSISTANT role.
63+
> See [On-policy Annotation Post](https://terminal-agent.github.io/blog/annotation/) for more details on annotation and naive SFT training.
64+
65+
66+
## Our Insights in Building General Agents
5867

5968

6069
**Workflow**: Build the universal action space for the LLM, reserving specialized workflows only for high-risk operations.
@@ -218,3 +227,7 @@ If you find Reptile useful in your research or applications, please cite:
218227
note={Blog}
219228
}
220229
```
230+
231+
> **Fun fact**: The name "Reptile" has a dual meaning: it refers to the REPL (Read-Eval-Print-Learning Loop) workflow in terminal interactions, and also pays homage to OpenAI's Reptile meta-learning algorithm (2018), which pioneered few-shot adaptation. Like its namesake, our Reptile learns to quickly adapt to new tasks—but through human-in-the-loop collaboration rather than pure algorithmic optimization. Both share the same philosophy: learning efficiently from minimal examples to master diverse tasks.
232+
>
233+
> *Reference: [On First-Order Meta-Learning Algorithms](https://arxiv.org/abs/1803.02999)*

0 commit comments

Comments
 (0)