Skip to content

Commit edccd92

Browse files
committed
upload YAML prompt files for LLM judges evaluating videos
1 parent 04d5df6 commit edccd92

2 files changed

Lines changed: 53 additions & 0 deletions

File tree

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
system_prompt: "You are a helpful assistant."
2+
user_prompt: |
3+
You are a helpful video analyzer. Evaluate whether the video follows the given instruction.
4+
5+
Instruction: {instruction}
6+
7+
Evaluation Criteria:
8+
1. **Task Completion:** Does the video show the task described in the instruction being completed?
9+
2. **Action Accuracy:** Are the actions performed in the video consistent with what the instruction specifies?
10+
3. **Object Interaction:** Does the robot or agent interact with the correct objects as described in the instruction?
11+
4. **Goal Achievement:** Is the final state of the video consistent with the expected outcome of the instruction?
12+
5. **Correct Hand Usage:** Does the video show the correct hand performing the action?
13+
14+
Instructions for Scoring:
15+
- **1:** No adherence to the instruction. The video shows actions completely unrelated to the instruction.
16+
- **2:** Poor adherence. Some elements match the instruction, but major deviations are present.
17+
- **3:** Moderate adherence. The video follows the instruction for the most part but contains noticeable deviations.
18+
- **4:** Good adherence. Most elements in the video match the instruction, with only minor issues.
19+
- **5:** Perfect adherence. The video fully follows the instruction with no deviations.
20+
21+
Response Template:
22+
Analyze the video carefully and answer the question according to the following template:
23+
[Score between 1 and 5.]
24+
25+
Example Response:
26+
2
27+
28+
Does this video follow the instruction?
Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
system_prompt: "You are a helpful assistant."
2+
user_prompt: |
3+
You are a helpful video analyzer. Evaluate whether the video follows physical commonsense.
4+
5+
Evaluation Criteria:
6+
1. **Object Behavior:** Do objects behave according to their expected physical properties (e.g., rigid objects do not deform unnaturally, fluids flow naturally)?
7+
2. **Motion and Forces:** Are motions and forces depicted in the video consistent with real-world physics (e.g., gravity, inertia, conservation of momentum)?
8+
3. **Interactions:** Do objects interact with each other and their environment in a plausible manner (e.g., no unnatural penetration, appropriate reactions on impact)?
9+
4. **Consistency Over Time:** Does the video maintain consistency across frames without abrupt, unexplainable changes in object behavior or motion?
10+
11+
Instructions for Scoring:
12+
- **1:** No adherence to physical commonsense. The video contains numerous violations of fundamental physical laws.
13+
- **2:** Poor adherence. Some elements follow physics, but major violations are present.
14+
- **3:** Moderate adherence. The video follows physics for the most part but contains noticeable inconsistencies.
15+
- **4:** Good adherence. Most elements in the video follow physical laws, with only minor issues.
16+
- **5:** Perfect adherence. The video demonstrates a strong understanding of physical commonsense with no violations.
17+
18+
Response Template:
19+
Analyze the video carefully and answer the question according to the following template:
20+
[Score between 1 and 5.]
21+
22+
Example Response:
23+
2
24+
25+
Does this video adhere to the physical laws?

0 commit comments

Comments
 (0)