Skip to content

Commit 37a71b6

Browse files
committed
docs(apollo11): add images to relevant files
1 parent 3853d58 commit 37a71b6

4 files changed

Lines changed: 7 additions & 1 deletion

File tree

test_dataset_apollo11/.DS_Store

6 KB
Binary file not shown.

test_dataset_apollo11/RATIONALE.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,8 @@ The excerpted length balances comprehensiveness with practical testability.
4242

4343
## Why These Excerpted Passages?
4444

45+
![image](images/test-selection.png)
46+
4547
**Continuous Narrative:**
4648

4749
Selected passages flow from descent through surface activities, forming a natural
@@ -64,7 +66,7 @@ and analytical reasoning.
6466

6567
**Verified Coverage:**
6668

67-
All 15 test prompts confirmed answerable with excerpted passages through
69+
All 21 test prompts confirmed answerable with excerpted passages through
6870
preliminary testing.
6971

7072
**Length Management:**

test_dataset_apollo11/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -85,6 +85,8 @@ reasoning, RAG, paraphrasing and creative generation tasks.
8585

8686
## 📝 Test Structure
8787

88+
![image](images/evaluation-process.png)
89+
8890
The test includes **21 standardized prompts** distributed across **five categories**.
8991
In addition, a **Master Instruction** and **task-specific guidance prompts** are
9092
provided to ensure consistency and clarity across all tasks.

test_dataset_apollo11/test_prompts.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,8 @@
33
This document contains **21 standardized test prompts** for evaluating language
44
models using the Apollo 11 lunar landing context.
55

6+
![image](images/prompt-sequence.png)
7+
68
Please follow the instructions in order:
79
first **Master Prompt**,
810
then **task-specific instructions**,

0 commit comments

Comments
 (0)