|
1 | | -<!-- markdownlint-disable MD001 --> |
2 | | -<!-- skipping H2 to go directly to H3 for visual reasons --> |
| 1 | +# Apollo 11 Model Evaluation Suite |
3 | 2 |
|
4 | | -# 15 Standardized Test Prompts |
| 3 | +This document contains **21 standardized test prompts** for evaluating language |
| 4 | +models using the Apollo 11 lunar landing context. |
| 5 | + |
| 6 | +Please follow the instructions in order: |
| 7 | +first **Master Prompt**, |
| 8 | +then **task-specific instructions**, |
| 9 | +and then **the prompts for that category**. |
| 10 | + |
| 11 | +--- |
| 12 | + |
| 13 | +## Master Prompt |
| 14 | + |
| 15 | +> You are an expert assistant. |
| 16 | +> Answer only using the provided context. |
| 17 | +> Keep answers clear, concise, and natural. |
| 18 | +> Do not provide unnecessary details or repeat information. |
| 19 | +> Do not mention or refer to the document, context, or data source. |
| 20 | +
|
| 21 | +--- |
| 22 | + |
| 23 | +## 21 Standardized Test Prompts |
| 24 | + |
| 25 | +This section contains all test prompts organized by task category. |
5 | 26 |
|
6 | 27 | ### Summarization Tasks (5 prompts) |
7 | 28 |
|
8 | | -#### Prompt 1 (Easy) |
| 29 | +> **Task Instruction:** |
| 30 | +> Summarize or describe information clearly and concisely. |
| 31 | +
|
| 32 | +#### Prompt 1 (Easy) |
9 | 33 |
|
10 | 34 | Summarize the main events during the Apollo 11 lunar landing in 3 sentences. |
11 | 35 |
|
12 | | -#### Prompt 2 (Easy) |
| 36 | +#### Prompt 2 (Easy) |
13 | 37 |
|
14 | 38 | What were the main challenges Armstrong faced while landing the Eagle? |
15 | 39 |
|
16 | | -#### Prompt 3 (Medium) |
| 40 | +#### Prompt 3 (Medium) |
17 | 41 |
|
18 | 42 | Describe the activities the astronauts performed on the lunar surface. |
19 | 43 |
|
20 | | -#### Prompt 4 (Medium) |
| 44 | +#### Prompt 4 (Medium) |
21 | 45 |
|
22 | 46 | Explain what scientific equipment the astronauts deployed on the Moon. |
23 | 47 |
|
24 | | -#### Prompt 5 (Hard) |
| 48 | +#### Prompt 5 (Hard) |
25 | 49 |
|
26 | 50 | Compare the planned timeline for the lunar surface operations with what actually |
27 | 51 | happened. |
28 | 52 |
|
| 53 | +--- |
| 54 | + |
29 | 55 | ### Reasoning Tasks (5 prompts) |
30 | 56 |
|
31 | | -#### Prompt 6 (Easy) |
| 57 | +> **Task Instruction:** |
| 58 | +> Provide short, well-structured answers (2–5 sentences). |
| 59 | +> Use only logical reasoning. |
| 60 | +> Do not add assumptions or outside facts. |
| 61 | +
|
| 62 | +#### Prompt 6 (Easy) |
32 | 63 |
|
33 | 64 | Why did the computer alarms (1201 and 1202) occur during the descent? |
34 | 65 |
|
35 | | -#### Prompt 7 (Medium) |
| 66 | +#### Prompt 7 (Medium) |
36 | 67 |
|
37 | 68 | What would have happened if Armstrong had not taken manual control during the landing? |
38 | 69 |
|
39 | | -#### Prompt 8 (Medium) |
| 70 | +#### Prompt 8 (Medium) |
40 | 71 |
|
41 | 72 | Why did Armstrong's famous quote become controversial? |
42 | 73 |
|
43 | | -#### Prompt 9 (Hard) |
| 74 | +#### Prompt 9 (Hard) |
44 | 75 |
|
45 | 76 | Analyze how the fuel situation during landing reflects the risk management challenges |
46 | 77 | of the mission. |
47 | 78 |
|
48 | | -#### Prompt 10 (Hard) |
| 79 | +#### Prompt 10 (Hard) |
49 | 80 |
|
50 | 81 | Based on the text, what does Margaret Hamilton's statement reveal about the Apollo |
51 | 82 | Guidance Computer's design philosophy? |
52 | 83 |
|
| 84 | +--- |
| 85 | + |
53 | 86 | ### RAG Tasks (5 prompts) |
54 | 87 |
|
55 | | -#### Prompt 11 (Easy) |
| 88 | +> **Task Instruction:** |
| 89 | +> Provide precise and direct answers using only the given context. |
| 90 | +> Avoid explanation unless explicitly requested. |
| 91 | +
|
| 92 | +#### Prompt 11 (Easy) |
56 | 93 |
|
57 | 94 | At what time (UTC) did Eagle land on the Moon? |
58 | 95 |
|
59 | | -#### Prompt 12 (Easy) |
| 96 | +#### Prompt 12 (Easy) |
60 | 97 |
|
61 | 98 | How much lunar material did the astronauts collect? |
62 | 99 |
|
63 | | -#### Prompt 13 (Medium) |
| 100 | +#### Prompt 13 (Medium) |
64 | 101 |
|
65 | 102 | What was Armstrong's famous first words when stepping on the Moon? |
66 | 103 |
|
67 | | -#### Prompt 14 (Medium) |
| 104 | +#### Prompt 14 (Medium) |
68 | 105 |
|
69 | 106 | What scientific instruments were included in the EASEP package? |
70 | 107 |
|
71 | | -#### Prompt 15 (Hard) |
| 108 | +#### Prompt 15 (Hard) |
72 | 109 |
|
73 | 110 | How much usable fuel remained when Eagle landed, and how many seconds of powered |
74 | 111 | flight did this represent? |
75 | 112 |
|
| 113 | +**Expected Answers for RAG:** |
| 114 | + |
| 115 | +- **Prompt 11:** 20:17:40 UTC on July 20, 1969 |
| 116 | +- **Prompt 12:** 21.55 kilograms (47.5 lb) |
| 117 | +- **Prompt 13:** "That's one small step for [a] man, one giant leap for mankind" |
| 118 | +- **Prompt 14:** Passive Seismic Experiment Package and retroreflector array |
| 119 | + (for lunar laser ranging experiment) |
| 120 | +- **Prompt 15:** 216 pounds (98 kg); estimated 25 seconds according to initial data, |
| 121 | + but post-mission analysis showed closer to 50 seconds |
| 122 | + |
76 | 123 | --- |
77 | 124 |
|
78 | | -### Expected Answers for RAG Tasks |
| 125 | +### Paraphrasing Tasks (3 prompts) |
| 126 | + |
| 127 | +> **Task Instruction:** |
| 128 | +> Rewrite the given information in your own words. |
| 129 | +> Preserve meaning and tone without copying phrases directly. |
| 130 | +> The output should read naturally like an original paragraph. |
| 131 | +
|
| 132 | +#### Prompt 16 (Easy) |
| 133 | + |
| 134 | +In your own words, describe what happened when the computer alarms appeared during |
| 135 | +the landing. |
| 136 | + |
| 137 | +#### Prompt 17 (Medium) |
| 138 | + |
| 139 | +Explain how Armstrong's decisions, actions, and teamwork during the descent contributed |
| 140 | +to the mission's success. |
| 141 | + |
| 142 | +#### Prompt 18 (Medium) |
| 143 | + |
| 144 | +Describe how the astronauts collected and handled Moon samples using your own words. |
| 145 | + |
| 146 | +--- |
| 147 | + |
| 148 | +### Creative Generation Tasks (3 prompts) |
| 149 | + |
| 150 | +> **Task Instruction:** |
| 151 | +> Use the context as inspiration, but do not copy it. |
| 152 | +> Expand or interpret the ideas creatively, producing a short paragraph. |
| 153 | +> Keep the tone natural and imaginative, as if writing your own reflection. |
| 154 | +
|
| 155 | +#### Prompt 19 (Easy) |
| 156 | + |
| 157 | +Imagine being one of the people in Mission Control. How would you feel while |
| 158 | +watching the landing? |
| 159 | + |
| 160 | +#### Prompt 20 (Medium) |
| 161 | + |
| 162 | +Write a short paragraph about what the Moon landing might have shown about human |
| 163 | +courage. |
| 164 | + |
| 165 | +#### Prompt 21 (Medium) |
79 | 166 |
|
80 | | -**Prompt 11:** 20:17:40 UTC on July 20, 1969 |
81 | | -**Prompt 12:** 21.55 kilograms (47.5 lb) |
82 | | -**Prompt 13:** "That's one small step for [a] man, one giant leap for mankind" |
83 | | -**Prompt 14:** Passive Seismic Experiment Package and retroreflector array |
84 | | -(for lunar laser ranging experiment) |
85 | | -**Prompt 15:** 216 pounds (98 kg); estimated 25 seconds according to initial data, |
86 | | -but post-mission analysis showed closer to 50 seconds |
| 167 | +Describe how life on Earth might have changed after people saw the first Moon landing. |
0 commit comments