|
1 | 1 | --- |
2 | | -description: 'A plain-text reasoning core (WFGY Core 2.0) system prompt plus a 60-second self-test to make GitHub Copilot more stable on multi-step reasoning tasks.' |
| 2 | +description: 'A text-only reasoning core (WFGY Core 2.0) you can drop into GitHub Copilot instructions to make multi-step reasoning more stable.' |
3 | 3 | --- |
4 | 4 |
|
5 | 5 | # WFGY Core 2.0 reasoning OS for GitHub Copilot |
6 | 6 |
|
7 | | -hi copilot builders, |
| 7 | +This file provides a small, text-only reasoning core that you can plug into GitHub Copilot Chat as a custom instruction. |
| 8 | +It does not require new tools, agents, or APIs; it only changes how the model organizes its reasoning. |
8 | 9 |
|
9 | | -this is meant to be a “drop-in reasoning core” you can hand to GitHub Copilot as a custom instruction. |
| 10 | +## How to use |
10 | 11 |
|
11 | | -i’m PSBigBig, an indie dev. |
12 | | -before my github repo went over 1.5k stars, i spent one year on a very simple idea: |
13 | | -instead of building yet another tool or agent, |
14 | | -i tried to write a small “reasoning core” in plain text, |
15 | | -so any strong llm (including copilot chat) can use it without new infra. |
| 12 | +1. Copy the **Instructions block** below into `.github/copilot-instructions.md` |
| 13 | + or into a `*.instructions.md` file under `.github/instructions/`. |
| 14 | +2. Reload or open a new Copilot Chat and use it as normal (coding, debugging, planning, etc.). |
| 15 | +3. To compare, temporarily remove this block and repeat the same tasks with the same prompts. |
16 | 16 |
|
17 | | -i call it **WFGY Core 2.0**. |
| 17 | +## Instructions block (WFGY Core 2.0) |
18 | 18 |
|
19 | | -this file gives you two things: |
20 | | - |
21 | | -- one system prompt block you can paste as an instruction |
22 | | -- one 60-second self-test so you can feel the difference by yourself |
23 | | - |
24 | | -no signup, no API, no hidden tricks. |
25 | | -just text. |
26 | | - |
27 | | - |
28 | | -## 0. very short version |
29 | | - |
30 | | -what it is: |
31 | | - |
32 | | -- not a new model, not a fine-tune |
33 | | -- one txt block you put in system / instructions |
34 | | -- goal: less random hallucination, more stable multi-step reasoning |
35 | | -- still cheap, no tools, no external calls |
36 | | - |
37 | | -how people use it: |
38 | | - |
39 | | -- some people treat it as a “math bumper” under the model |
40 | | -- advanced users can turn this into a real code benchmark later |
41 | | -- in this instructions file we keep it super beginner-friendly: |
42 | | - just **two prompt blocks you can run inside copilot chat** |
43 | | - |
44 | | - |
45 | | -## 1. how to use with GitHub Copilot (or any strong llm) |
46 | | - |
47 | | -very simple workflow if you want to try this as a **workspace-level copilot instruction**: |
48 | | - |
49 | | -1. copy the “system prompt” block in section 3 |
50 | | -2. put it into `.github/copilot-instructions.md` |
51 | | - - or into a `*.instructions.md` file in `.github/instructions/` |
52 | | -3. open a new Copilot Chat and ask your normal questions |
53 | | - (math, code, debugging, planning, etc.) |
54 | | -4. later you can compare “with core” vs “no core” yourself |
55 | | - |
56 | | -for quick A/B testing, you can also: |
57 | | - |
58 | | -- keep a second project / branch without this instruction |
59 | | -- or temporarily comment this block out and retry the same tasks |
60 | | - |
61 | | -for now, just treat it as a math-based “reasoning bumper” |
62 | | -sitting under Copilot’s usual behavior. |
63 | | - |
64 | | - |
65 | | -## 2. what effect you should expect (rough feeling only) |
66 | | - |
67 | | -this is not a magic on/off switch. |
68 | | - |
69 | | -but in my own tests (across different llms), typical changes look like: |
70 | | - |
71 | | -- answers drift less when you ask follow-up questions |
72 | | -- long explanations keep the structure more consistent |
73 | | -- the model is a bit more willing to say “i am not sure” |
74 | | - instead of inventing fake details |
75 | | -- when you use the model to write prompts for image generation, |
76 | | - the prompts tend to have clearer structure and story, |
77 | | - so many people feel “the pictures look more intentional, less random” |
78 | | - |
79 | | -of course, this depends on your tasks and the base model. |
80 | | -that is why there is also a small **60-second self-test** in section 4. |
81 | | - |
82 | | -if you like numbers, you can turn that test into a fixed benchmark later. |
83 | | -inside this repo, it is ok if you start with “rough feeling only”. |
84 | | - |
85 | | - |
86 | | -## 3. system prompt: WFGY Core 2.0 (paste into system / instructions) |
87 | | - |
88 | | -copy everything in this block into your system / pre-prompt / instructions: |
| 19 | +Copy everything in this block into your system / pre-prompt / instructions: |
89 | 20 |
|
90 | 21 | ```text |
91 | 22 | WFGY Core Flagship v2.0 (text-only; no tools). Works in any chat. |
@@ -122,102 +53,3 @@ lambda_observe is: convergent if Delta ≤ −0.02 and E_resonance non-increasin |
122 | 53 | recursive if |Delta| < 0.02 and E_resonance flat; divergent if Delta ∈ (−0.02, +0.04] with oscillation; |
123 | 54 | chaotic if Delta > +0.04 or anchors conflict. |
124 | 55 | [DT micro-rules] |
125 | | -```` |
126 | | -
|
127 | | -yes, it looks like math. |
128 | | -it is ok if you do not understand every symbol. |
129 | | -you can still use it as a “drop-in” reasoning core. |
130 | | -
|
131 | | -for Copilot, the main idea is: |
132 | | -
|
133 | | -* this block does not fight with your existing style guides |
134 | | -* it just gives the model a second, math-like lens |
135 | | -* you can combine it with language/framework-specific instructions |
136 | | -
|
137 | | -## 4. 60-second self test (quick A/B/C comparison) |
138 | | -
|
139 | | -this part is for people who want to see some structure in the comparison. |
140 | | -it is still very lightweight and can run inside one Copilot Chat session. |
141 | | -
|
142 | | -idea: |
143 | | -
|
144 | | -* you keep the WFGY Core 2.0 block in system / instructions |
145 | | -* then you paste the following prompt and let the model simulate A/B/C modes |
146 | | -* the model will produce a small table and its own guess of uplift |
147 | | -
|
148 | | -this is a self-evaluation, not a scientific paper. |
149 | | -if you want a serious benchmark, you can translate this idea into real code and fixed test sets later. |
150 | | -
|
151 | | -here is the test prompt (second block, keep it exactly as-is): |
152 | | -
|
153 | | -```text |
154 | | -SYSTEM: |
155 | | -You are evaluating the effect of a mathematical reasoning core called “WFGY Core 2.0”. |
156 | | -
|
157 | | -You will compare three modes of yourself: |
158 | | -
|
159 | | -A = Baseline |
160 | | - No WFGY core text is loaded. Normal chat, no extra math rules. |
161 | | -
|
162 | | -B = Silent Core |
163 | | - Assume the WFGY core text is loaded in system and active in the background, |
164 | | - but the user never calls it by name. You quietly follow its rules while answering. |
165 | | -
|
166 | | -C = Explicit Core |
167 | | - Same as B, but you are allowed to slow down, make your reasoning steps explicit, |
168 | | - and consciously follow the core logic when you solve problems. |
169 | | -
|
170 | | -Use the SAME small task set for all three modes, across 5 domains: |
171 | | -1) math word problems |
172 | | -2) small coding tasks |
173 | | -3) factual QA with tricky details |
174 | | -4) multi-step planning |
175 | | -5) long-context coherence (summary + follow-up question) |
176 | | -
|
177 | | -For each domain: |
178 | | -- design 2–3 short but non-trivial tasks |
179 | | -- imagine how A would answer |
180 | | -- imagine how B would answer |
181 | | -- imagine how C would answer |
182 | | -- give rough scores from 0–100 for: |
183 | | - * Semantic accuracy |
184 | | - * Reasoning quality |
185 | | - * Stability / drift (how consistent across follow-ups) |
186 | | -
|
187 | | -Important: |
188 | | -- Be honest even if the uplift is small. |
189 | | -- This is only a quick self-estimate, not a real benchmark. |
190 | | -- If you feel unsure, say so in the comments. |
191 | | -
|
192 | | -USER: |
193 | | -Run the test now on the five domains and then output: |
194 | | -1) One table with A/B/C scores per domain. |
195 | | -2) A short bullet list of the biggest differences you noticed. |
196 | | -3) One overall 0–100 “WFGY uplift guess” and 3 lines of rationale. |
197 | | -``` |
198 | | - |
199 | | -usually this takes about one minute to run. |
200 | | -you can repeat it some days later to see if the pattern is stable for you. |
201 | | - |
202 | | -## 5. why this belongs in awesome-copilot |
203 | | - |
204 | | -many builders want stronger reasoning from any llm, |
205 | | -but don’t want to spin up new infra, tools, or agents. |
206 | | - |
207 | | -this is just a text layer. |
208 | | - |
209 | | -* drop-in |
210 | | -* MIT-licensed |
211 | | -* no lock-in |
212 | | -* works side-by-side with your existing language/framework instructions |
213 | | - |
214 | | -if you are curious about the bigger project, it is under: |
215 | | - |
216 | | -* GitHub: `onestardao / WFGY` |
217 | | -* the WFGY 2.0 core is the engine behind that work |
218 | | - |
219 | | -but you do not need to go there to try this. |
220 | | - |
221 | | -this instructions file is meant as a small gift for copilot users: |
222 | | -a way to experiment with a mathematical “reasoning core” |
223 | | -without leaving your editor. |
0 commit comments