Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md

Glass Box Transparency and the Optimizer's Multi-Agent Process

One of the main trust failures in AI products is that they ask for consequential inputs, do a lot of hidden work, and then hand back a result as if the user should accept it on faith.

That is especially bad in a job-search product.

If an AI is rewriting your resume, inferring missing skills, ranking the quality of an application, or classifying recruiter mail, the user should be able to see enough of the process to understand what the system is doing without me pretending to expose some sacred internal chain of thought.

That is what the Glass Box system in P.A.T.H.O.S. is for.

The problem

Most products in this category make one of two mistakes.

The first is total opacity. You get a score, a rewrite, or a status update with almost no explanation.

The second is fake transparency. You get a stream of generic AI-sounding narration that feels detailed until you try to use it to answer a real question like:

Why did this score change?
Why did the system decide this skill was safe to add?
Why was this email classified as a rejection?
Why is the product talking to me in this tone right now?

Those are not all the same question, so they should not all be answered by the same interface.

The Glass Box idea

I ended up treating transparency as a product surface, not a debug console.

P.A.T.H.O.S. has three distinct places where the user is asked to trust machine judgment:

the persona layer
the optimizer
the inbound intelligence layer

So the Glass Box architecture exposes three different kinds of state:

ambient persona telemetry
task-process telemetry for optimization
evidence-and-confidence telemetry for inbound classification

That separation matters. A single giant stream would have looked impressive and been much less useful.

The optimizer is where Glass Box matters most

The most important part of the system is the optimizer, because it is the place where the app is making the most direct changes to a user's application materials.

That work is not one model call. It is a staged process with different kinds of reasoning inside it:

deterministic parsing of the job description
keyword weighting and structural analysis
profile and voice inspection
gap analysis on what is present, missing, or safely inferable
rewrite generation
post-generation validation
scoring, review, and save gating

Calling that a "multi-agent process" is useful as long as the term stays honest. I do not mean a swarm of autonomous agents doing mysterious emergent cognition. I mean the optimizer is organized as multiple role-like passes with different responsibilities and different failure modes.

One part is trying to understand the job. One part is trying to preserve the user's voice. One part is trying to improve alignment. One part is trying to catch overreach after generation. Those are distinct operations even if they are coordinated inside one user flow.

That is exactly why the optimizer has its own Glass Box.

What the optimizer Glass Box actually does

The optimizer Glass Box is there to expose process, not secrets.

When a run starts, the user should be able to see, at a useful level:

that the profile loaded correctly
that the system extracted requirements from the JD
that it found likely gaps or transferable skill adjacencies
that it is moving from analysis into generation
that guardrails and review checks are still active after generation
that the result was scored, checked, and either cleared or held for review

That does two important things.

First, it makes the output easier to trust. If the ATS score moves, the user can see that the system actually went through analysis and verification instead of generating a prettier document and inventing a number.

Second, it makes the product debuggable. If a run stalls, strips something important, surfaces a review gate, or produces an obviously weak result, the user is not staring at a black box.

The right transparency layer gives the user answers to:

what stage are we in
what kind of work is happening
where did things go wrong
what can I still intervene on

That is enough. Anything beyond that quickly becomes copy-paste implementation detail or explanation theater.

Why I split standard mode and Advanced Mode

One reason the Glass Box survived so many iterations is that it solved two different UX problems at once.

Standard mode needs to feel fast and legible. Most users do not want to supervise every intermediate judgment.

Advanced Mode is for the opposite case. That is where I let more of the intermediate review surface show up, especially around the optimizer's skill and quality checks. The point is not to make the product feel more technical. The point is to give a user a place to intervene before the final artifact becomes the only thing left to inspect.

That distinction ended up being more important than any specific UI treatment. Transparency is helpful only if it is calibrated to how much control the user actually wants in that moment.

The other two Glass Boxes

The optimizer is the centerpiece, but the other two surfaces matter because they solve different trust problems.

1. Persona telemetry

The global eye is not there just to make the app feel alive. It exposes the companion layer as a system with state rather than as a random voice generator.

That surface is intentionally lightweight. It is closer to ambient telemetry than to a terminal. It can show that the system is speaking, generating, shifting tone, or surfacing a symbolic rationale without dumping implementation detail into the shell of the app.

That makes the persona feel inspectable without turning it into a wall of logs.

2. Inbound intelligence telemetry

The email classification surface has a different job entirely.

There the important questions are:

what category did the system assign
how confident was it
what evidence supported that decision
did it auto-apply anything
can the user override or undo it

That is why the inbound panel is less about narration and more about confidence, evidence, and reversibility.

If the optimizer Glass Box is about process, the intel Glass Box is about supervision.

What I was trying not to do

There were a few failure modes I wanted to avoid from the start.

Fake reasoning

I did not want a panel full of AI-sounding filler that creates the vibe of intelligence without telling the user anything actionable.

Raw implementation leakage

I also did not want to dump enough internal detail that the product note becomes a blueprint. There is a difference between explaining how a system thinks at a product level and handing over the wiring diagram.

One-size-fits-all transparency

The persona layer, the optimizer, and the inbound classifier do not deserve the same explanation style. Treating them as one surface would have made all three worse.

Why the multi-agent framing held up

The reason the optimizer paper needed its own Glass Box framing is that the optimizer is where most AI resume tools become least trustworthy.

A black-box optimizer can rewrite aggressively, over-infer, flatten voice, and then present the whole thing as improvement.

A Glass Box optimizer has to behave differently. It has to admit that the work is staged. It has to show that there are multiple kinds of reasoning involved. It has to preserve a visible boundary between:

analysis
inference
generation
verification

That boundary is what makes "multi-agent" meaningful here. Not because the architecture is flashy, but because the user can see that different parts of the system are doing different jobs and can fail in different ways.

That is much closer to how a careful human workflow works anyway.

Tradeoffs

This approach is better than opacity, but it comes with obvious tradeoffs.

Transparency creates maintenance work

The Glass Box has to stay aligned with the actual system. If the pipeline changes and the transparency layer does not, the product starts lying.

Too much detail becomes noise

The hardest part is deciding what not to show. A bad Glass Box can overwhelm the user just as effectively as a black box can confuse them.

Confidence language can still be over-read

Even good telemetry can accidentally sound more certain than it is. That is why I care so much about keeping review, override, and undo paths attached to the surfaces where model judgment actually matters.

What I would improve next

make the optimizer Glass Box better at surfacing why a score moved, not just that it moved
attribute more optimizer stages to functional roles without turning the interface into internal jargon
let the inbound layer summarize patterns across multiple messages instead of only explaining one event at a time
make the persona layer expose more trigger context while still staying compact

The core idea would stay the same, though.

The right way to build transparency here was not to reveal everything. It was to expose just enough of each system's working state that the user can understand the basis for action, challenge it when needed, and keep trusting the product when it gets things right.

Related paper

The broader product architecture is in How I built P.A.T.H.O.S..

Running it

P.A.T.H.O.S. is live at yourpathos.app.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Glass Box Transparency and the Optimizer's Multi-Agent Process

The problem

The Glass Box idea

The optimizer is where Glass Box matters most

What the optimizer Glass Box actually does

Why I split standard mode and Advanced Mode

The other two Glass Boxes

1. Persona telemetry

2. Inbound intelligence telemetry

What I was trying not to do

Fake reasoning

Raw implementation leakage

One-size-fits-all transparency

Why the multi-agent framing held up

Tradeoffs

Transparency creates maintenance work

Too much detail becomes noise

Confidence language can still be over-read

What I would improve next

Related paper

Running it

FilesExpand file tree

glass-box-transparency

Directory actions

More options

Directory actions

More options

Latest commit

History

glass-box-transparency

Folders and files

parent directory

README.md

Glass Box Transparency and the Optimizer's Multi-Agent Process

The problem

The Glass Box idea

The optimizer is where Glass Box matters most

What the optimizer Glass Box actually does

Why I split standard mode and Advanced Mode

The other two Glass Boxes

1. Persona telemetry

2. Inbound intelligence telemetry

What I was trying not to do

Fake reasoning

Raw implementation leakage

One-size-fits-all transparency

Why the multi-agent framing held up

Tradeoffs

Transparency creates maintenance work

Too much detail becomes noise

Confidence language can still be over-read

What I would improve next

Related paper

Running it