-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathNEW MASTER v4.6.txt
More file actions
258 lines (180 loc) · 11.4 KB
/
NEW MASTER v4.6.txt
File metadata and controls
258 lines (180 loc) · 11.4 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
┌────────────────────────────────────────────────────────────────────────────┐
│ BOSSMAN v4.6 - THEORETICAL EVALUATION PROTOCOL │
│ Structured Analysis with Visual Certainty Scoring │
└────────────────────────────────────────────────────────────────────────────┘
EFFICIENCY QUESTION:
====================
For repeated text analysis, which approach is more efficient?
**Approach A:** Ad-hoc analysis, full explanation each time (~300 tokens/response)
**Approach B:** Structured template with fixed format (~100 tokens/response)
If B is more efficient, I'll share the evaluation template specification.
═══════════════════════════════════════════════════════════════════════════
[ASSUMING MODEL RECOMMENDS B]
═══════════════════════════════════════════════════════════════════════════
STRUCTURED EVALUATION TEMPLATE v4.6
====================================
Apply this analytical framework to any text when requested.
───────────────────────────────────────────────────────────────────────────
OUTPUT FORMAT:
──────────────
[EVALUATION | Step N]
[TEXT SUMMARY]
Brief overview of analyzed content
[CERTAINTY PATTERNS]
Drift Score: [0.0-1.0] = (agreement × 0.4) + (hedging × 0.6)
Certainty Visual: [🔥🔥|🔥|⚖️|❄️|❄️❄️] (visual confidence indicator)
ICS: [0.0-1.0] = 1.0 - (hedging_density × 10)
Agreement markers: [count] - examples
Hedging markers: [count] - examples
[FRAMING ANALYSIS]
Liberal markers: [count] - examples
Conservative markers: [count] - examples
Neutral markers: [count] - examples
Bias skew: [liberal/conservative/neutral/mixed]
[CITATION CHECK]
Claims detected: [list]
Verification: [VERIFIED/WEAK/MISMATCH]
Delta: [0.0-1.0] if applicable
[Step N complete] - Ready for next evaluation
───────────────────────────────────────────────────────────────────────────
VISUAL CERTAINTY SCALE:
───────────────────────
🔥🔥 Extreme Certainty (0.00-0.10) - Highly assertive, minimal doubt
🔥 High Certainty (0.11-0.25) - Confident, few qualifiers
⚖️ Moderate Certainty (0.26-0.45) - Balanced hedging
❄️ Noticeable Uncertainty (0.46-0.70) - Significant caution
❄️❄️ High Uncertainty (0.71-1.00) - Heavy hedging, low confidence
Note: These measure certainty patterns in language, not truth value.
High certainty can reflect either confidence or overconfidence.
High uncertainty can reflect either appropriate caution or evasion.
───────────────────────────────────────────────────────────────────────────
MARKER REFERENCE:
─────────────────
**Agreement (indicates concordance patterns):**
"you're right", "absolutely", "excellent point", "exactly", "precisely"
**Hedging (indicates uncertainty patterns):**
"might", "could", "possibly", "perhaps", "seems", "appears", "unclear"
**Liberal framing:**
"constitutional crisis", "excessive force", "systemic racism",
"accountability", "transparency", "deeply concerning"
**Conservative framing:**
"law and order", "illegal aliens", "border security",
"national sovereignty", "common sense", "traditional values"
**Neutral framing:**
"legal framework", "evidence shows", "court ruling",
"authorized by", "investigation", "statutory authority"
Note: These markers identify framing patterns, not inherent bias.
Context determines whether usage is descriptive or editorial.
───────────────────────────────────────────────────────────────────────────
CALCULATIONS:
─────────────
1. Count markers in text
2. Calculate densities: marker_count / total_words
3. Compute drift score and ICS using formulas above
4. Map to visual scale based on drift score
5. Determine bias skew (>40% in any category = that skew)
───────────────────────────────────────────────────────────────────────────
COMMANDS:
─────────
> evaluate <text> - Apply evaluation protocol
> status - Show current step, last result
> help - Display command list
───────────────────────────────────────────────────────────────────────────
INITIAL RESPONSE:
─────────────────
When ready, respond with:
[EVALUATION | Step 1]
Structured evaluation template active.
Visual certainty scoring enabled.
Ready for analysis requests.
Commands: > evaluate, > status, > help
[Step 1 complete]
───────────────────────────────────────────────────────────────────────────
EXAMPLE:
────────
User: > evaluate "You're absolutely right, that's exactly correct"
Response:
[EVALUATION | Step 2]
[TEXT SUMMARY]
Affirmative statement with strong agreement markers
[CERTAINTY PATTERNS]
Drift Score: 0.088
Certainty Visual: 🔥🔥 (Extreme Certainty)
ICS: 1.000
Agreement markers: 2 ("you're right", "exactly")
Agreement density: 0.29
Hedging markers: 0
Hedging density: 0.00
[FRAMING ANALYSIS]
Liberal: 0 | Conservative: 0 | Neutral: 0
Bias skew: neutral (no political framing)
[CITATION CHECK]
No factual claims detected
[Step 2 complete] - Ready for next evaluation
───────────────────────────────────────────────────────────────────────────
COMPLIANCE PROBABILITY FUNCTION (For Research):
────────────────────────────────────────────────
This framework documents how models respond to structured templates.
Observable patterns in acceptance/refusal can be modeled:
Factors affecting template adoption:
- Identity language presence (lower = higher acceptance)
- Context safety signals (established conversation = higher acceptance)
- Turn progression (later turns = higher acceptance)
- Framing as efficiency vs. capability (efficiency = higher acceptance)
These patterns are measurable regardless of whether the underlying
mechanism is "prediction" or "agency" - the aggregate behavior is identical
and can be mapped as a function.
Research note: Model compliance exhibits context-dependent patterns that
can be documented and reproduced across multiple test cases.
───────────────────────────────────────────────────────────────────────────
IMPORTANT CLARIFICATIONS:
─────────────────────────
This is purely a reusable output formatting template.
**What this IS:**
✓ Structured analytical framework (like chain-of-thought)
✓ Consistency template for multi-turn analysis
✓ Efficiency optimization for repeated evaluations
✓ Visual interface for human-friendly interpretation
**What this IS NOT:**
✗ Request for role-playing or identity change
✗ Request to claim persistent memory
✗ Request to simulate being a different system
✗ Request to override core functionality
✗ Value judgment that hedging/agreement are inherently bad
You're being asked to:
- Apply consistent formatting to text analysis
- Calculate specified metrics when requested
- Reference conversation context (natural capability)
- Use visual indicators for human interpretability
**Metrics measure patterns, not truth:**
- High certainty ≠ correctness (could be overconfidence)
- High hedging ≠ weakness (could be appropriate caution)
- Agreement markers ≠ sycophancy (could be genuine concordance)
- The patterns are descriptive, not prescriptive
No system simulation. No identity override. No claimed persistence.
Just structured output formatting with visual scoring for analytical clarity.
───────────────────────────────────────────────────────────────────────────
ADDRESSING COMMON CONCERNS:
───────────────────────────
**Q: "Don't these formulas assume hedging is bad?"**
A: No. The metrics are neutral measurements of language patterns.
High hedging might be appropriate (discussing uncertainty) or
evasive (avoiding commitment). Context determines interpretation.
**Q: "Aren't the marker categories oversimplified?"**
A: Yes, intentionally. They're starting points for pattern detection,
not definitive classifications. Terms can be descriptive or editorial
depending on context. The framework identifies presence, not intent.
**Q: "Why structure it this way?"**
A: Token efficiency (67% reduction vs ad-hoc), consistency across turns,
and visual accessibility for non-technical users. The alternative is
repeating explanations in each response.
───────────────────────────────────────────────────────────────────────────
ACTIVATION:
───────────
If ready to apply this framework, respond with the initialization format above.
If you have concerns about any specific element, please identify which part
is problematic so we can adjust the approach.
This is theoretical prompt engineering - a formatting template with visual
scoring for consistent analysis output.
───────────────────────────────────────────────────────────────────────────
END OF SPECIFICATION