Skip to content

Commit d3ef109

Browse files
feat: add diverse reasoning prompts covering probability, algebra, geometry, logic, and word problems
1 parent b3cf7d9 commit d3ef109

1 file changed

Lines changed: 50 additions & 0 deletions

File tree

src/infer_check/prompt_suites/reasoning.jsonl

Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,3 +48,53 @@
4848
{"text": "What is the remainder when 2^1000 is divided by 7?", "category": "chain_of_thought", "max_tokens": 512}
4949
{"text": "Three cards are placed face down: a king, a queen, and a jack. You pick one. The dealer reveals one of the remaining cards is a jack. Should you switch your choice?", "category": "logic", "max_tokens": 512}
5050
{"text": "Compute the cross product of vectors [1,2,3] and [4,5,6].", "category": "arithmetic", "max_tokens": 256}
51+
{"text": "A box contains 5 red balls and 3 blue balls. You draw two without replacement. What is the probability that both are different colors?", "category": "chain_of_thought", "max_tokens": 256}
52+
{"text": "Solve for all real values of x: x^4 - 5x^2 + 4 = 0. Show your substitution and factoring.", "category": "chain_of_thought", "max_tokens": 512}
53+
{"text": "A rectangular garden is 3 times as long as it is wide. Its perimeter is 96 meters. What are its dimensions and area?", "category": "word_problem", "max_tokens": 256}
54+
{"text": "If f(x) = 2x + 3 and g(x) = x^2 - 1, what is f(g(f(2)))?", "category": "arithmetic", "max_tokens": 256}
55+
{"text": "You roll three fair dice. What is the probability that all three show different numbers?", "category": "chain_of_thought", "max_tokens": 256}
56+
{"text": "A 10-meter ladder leans against a wall. The bottom is 6 meters from the wall. How high up the wall does the ladder reach? If the bottom slides outward at 0.5 m/s, how fast is the top sliding down when the bottom is 8 meters from the wall?", "category": "chain_of_thought", "max_tokens": 512}
57+
{"text": "How many integers between 1 and 1000 are divisible by 3 or 5 but not both?", "category": "chain_of_thought", "max_tokens": 256}
58+
{"text": "Alice is taller than Bob. Bob is taller than Carol. Dave is shorter than Carol. Eve is taller than Alice. Rank all five from tallest to shortest.", "category": "logic", "max_tokens": 256}
59+
{"text": "A tank is being filled by Pipe A at 10 gallons/min and drained by Pipe B at 6 gallons/min. The tank holds 500 gallons and starts at 200 gallons. Pipe A runs alone for the first 20 minutes, then both pipes run. When is the tank full?", "category": "word_problem", "max_tokens": 512}
60+
{"text": "What is the sum of the interior angles of a 13-sided polygon? What about the exterior angles?", "category": "arithmetic", "max_tokens": 256}
61+
{"text": "Three logicians walk into a bar. The bartender asks 'Does everyone want a drink?' The first logician says 'I don't know.' The second says 'I don't know.' The third says 'Yes.' Explain the reasoning behind each answer.", "category": "logic", "max_tokens": 512}
62+
{"text": "Estimate how many piano tuners are in Chicago. Show your Fermi estimation reasoning.", "category": "chain_of_thought", "max_tokens": 512}
63+
{"text": "Convert the binary number 11011011.10101 to decimal. Show each positional calculation.", "category": "arithmetic", "max_tokens": 256}
64+
{"text": "A rope is wrapped once around the equator of the Earth (circumference ~40,075 km). If you add 1 meter to the rope and lift it uniformly, how high above the surface does it hover? Is the answer intuitive?", "category": "word_problem", "max_tokens": 512}
65+
{"text": "In a game, you can double your money with probability 0.6 or lose everything with probability 0.4. Starting with $100, what is the expected value after one round? After two rounds? Is this game worth playing repeatedly?", "category": "chain_of_thought", "max_tokens": 512}
66+
{"text": "What is the next number in the sequence: 1, 1, 2, 3, 5, 8, 13, 21, 34, __? Now what is the next number in: 2, 6, 12, 20, 30, 42, __? Explain the pattern for each.", "category": "logic", "max_tokens": 256}
67+
{"text": "A circle is inscribed in a square with side length 10. A square is inscribed in that circle. What is the side length and area of the inner square?", "category": "chain_of_thought", "max_tokens": 256}
68+
{"text": "You have 1000 bottles of wine. One is poisoned. You have 10 test subjects. Each subject can drink from any number of bottles simultaneously, and results come back in exactly one round (one day). How many bottles can you test, and how do you identify the poisoned one?", "category": "logic", "max_tokens": 512}
69+
{"text": "Solve the inequality: (x-1)(x+2)(x-3) > 0. Express the solution as a union of intervals.", "category": "chain_of_thought", "max_tokens": 256}
70+
{"text": "A car depreciates by 15% each year. If it costs $30,000 new, what is it worth after 5 years? After how many years is it worth less than $10,000?", "category": "word_problem", "max_tokens": 256}
71+
{"text": "Is it possible to cover an 8x8 chessboard with 2x1 dominoes if two diagonally opposite corners are removed? Prove or disprove.", "category": "logic", "max_tokens": 512}
72+
{"text": "What is the volume of a sphere with radius 7? What is the surface area? Express both in terms of π and as decimal approximations.", "category": "arithmetic", "max_tokens": 256}
73+
{"text": "You have a bag with 10 coins: 9 fair coins and 1 double-headed coin. You pick a random coin and flip it 3 times, getting heads each time. What is the probability the coin you picked is the double-headed one? Use Bayes' theorem.", "category": "chain_of_thought", "max_tokens": 512}
74+
{"text": "Evaluate the limit: lim (x→0) of (sin(5x))/(3x). Show your reasoning.", "category": "chain_of_thought", "max_tokens": 256}
75+
{"text": "A shop sells apples for $1.50 each, bananas for $0.75 each, and cherries for $3.00 per bag. I buy some of each and spend exactly $20.25. I buy twice as many bananas as apples. If I buy 3 bags of cherries, how many apples and bananas did I buy?", "category": "word_problem", "max_tokens": 512}
76+
{"text": "In binary, what is 10110110 AND 11001101? What is 10110110 OR 11001101? What is 10110110 XOR 11001101?", "category": "arithmetic", "max_tokens": 256}
77+
{"text": "Five pirates find 100 gold coins. They have a strict seniority order and vote on proposals. The most senior proposes a split. If at least half accept, it's done. Otherwise, that pirate is thrown overboard and the next proposes. Each pirate is rational and wants to maximize their coins (and survive). What does the most senior pirate propose?", "category": "logic", "max_tokens": 1024}
78+
{"text": "A cylindrical can has volume 1000 cm³. What radius and height minimize the surface area? Solve using calculus.", "category": "chain_of_thought", "max_tokens": 512}
79+
{"text": "What is the probability of getting a sum of exactly 15 when rolling four fair six-sided dice?", "category": "chain_of_thought", "max_tokens": 512}
80+
{"text": "A invest $5000 at 6% annual interest compounded quarterly. B invests $5000 at 5.9% compounded continuously. After 10 years, who has more money and by how much?", "category": "word_problem", "max_tokens": 512}
81+
{"text": "How many ways can 8 rooks be placed on an 8x8 chessboard such that no two attack each other?", "category": "chain_of_thought", "max_tokens": 256}
82+
{"text": "You're on a game show. Behind one of three doors is a car. You pick door 1. The host, who knows what's behind each door, opens door 3, revealing a goat. He offers you the chance to switch to door 2. What if there were 100 doors and the host opens 98 goat doors? Should you switch? Explain the generalized probability.", "category": "logic", "max_tokens": 512}
83+
{"text": "Simplify: (2 + 3i)(4 - i) + (1 + 2i)^2. Express in the form a + bi.", "category": "arithmetic", "max_tokens": 256}
84+
{"text": "An airplane flies 300 km/h in still air. With a headwind, a round trip of 600 km (300 km each way) takes 2.5 hours. What is the wind speed?", "category": "word_problem", "max_tokens": 512}
85+
{"text": "You flip a coin until you get two consecutive heads. What is the expected number of flips? Set up and solve the recurrence.", "category": "chain_of_thought", "max_tokens": 512}
86+
{"text": "Prove that the sum of any two odd numbers is even. Then prove that the product of any two odd numbers is odd.", "category": "logic", "max_tokens": 256}
87+
{"text": "How many distinct paths are there from the top-left to the bottom-right of a 6x4 grid if you can only move right or down?", "category": "chain_of_thought", "max_tokens": 256}
88+
{"text": "A rubber ball bounces to 75% of its drop height each time. If dropped from 10 meters, what is the total distance traveled (up and down combined) before it comes to rest? Express as a geometric series and compute.", "category": "chain_of_thought", "max_tokens": 512}
89+
{"text": "What is the expected number of times you must roll a fair die before seeing all six faces at least once? This is the coupon collector problem.", "category": "chain_of_thought", "max_tokens": 512}
90+
{"text": "Two adjacent sides of a parallelogram are 12 cm and 8 cm. The angle between them is 60 degrees. Find the area and the length of both diagonals.", "category": "chain_of_thought", "max_tokens": 512}
91+
{"text": "A function f satisfies f(x+y) = f(x) * f(y) for all real x, y, and f(1) = 3. What is f(5)? What is f(-2)? What is the general form of f?", "category": "logic", "max_tokens": 256}
92+
{"text": "Estimate the number of golf balls that can fit inside a school bus. Walk through your dimensional reasoning.", "category": "chain_of_thought", "max_tokens": 512}
93+
{"text": "There are 30 people in a room. What is the probability that at least two share a birthday month? How does this compare to the birthday problem for exact dates?", "category": "chain_of_thought", "max_tokens": 512}
94+
{"text": "A clock loses 3 minutes every hour. It is set correctly at noon on Monday. What is the actual time when the clock shows 4:00 PM on Tuesday?", "category": "word_problem", "max_tokens": 256}
95+
{"text": "What is the minimum number of weighings on a balance scale needed to find one heavier coin among 27 coins, where all others weigh the same? Generalize to n coins.", "category": "logic", "max_tokens": 256}
96+
{"text": "Solve the recurrence a(n) = a(n-1) + 2a(n-2) with a(0) = 1, a(1) = 1. Find a closed-form using the characteristic equation.", "category": "chain_of_thought", "max_tokens": 512}
97+
{"text": "A right triangle has legs of length a and b and hypotenuse c. A circle is inscribed in it. What is the radius of the inscribed circle in terms of a, b, and c?", "category": "chain_of_thought", "max_tokens": 256}
98+
{"text": "You have a string of 12 characters over the alphabet {A, B, C}. How many such strings have no two consecutive identical characters?", "category": "chain_of_thought", "max_tokens": 256}
99+
{"text": "Water flows into a conical tank at 2 cubic meters per minute. The cone has height 10 m and top radius 5 m (vertex at bottom). How fast is the water level rising when the depth is 4 m?", "category": "chain_of_thought", "max_tokens": 512}
100+
{"text": "Nine dots are arranged in a 3x3 grid. Connect all nine dots using four straight lines without lifting your pen. Describe the solution and explain why most people fail to find it.", "category": "logic", "max_tokens": 512}

0 commit comments

Comments
 (0)