mssm-sftechstack
diff --git a/‎README.md‎
Lines changed: 69 additions & 120 deletions b/‎README.md‎
Lines changed: 69 additions & 120 deletions
diff --git a/‎__pycache__/main.cpython-314.pyc‎
18.4 KB b/‎__pycache__/main.cpython-314.pyc‎
18.4 KB
@@ -4,16 +4,16 @@
 ![Python 3.9+](https://img.shields.io/badge/python-3.9%2B-blue)
 ![License: MIT](https://img.shields.io/badge/license-MIT-green)
 
-Developers waste hours manually correlating Salesforce deployment logs, coverage reports, and PMD violations across separate tools to find a root cause that should take minutes to identify.
-This tool feeds those signals into Claude simultaneously and returns a ranked diagnosis — what broke, which component caused it, and what to fix first — in seconds.
+When a Salesforce deployment fails, this tool identifies the root cause in seconds — not hours.
+It feeds deployment errors, coverage data, and PMD violations into Claude and returns a ranked diagnosis: exactly what broke, which component caused it, and what to fix first.
 
-> **Why this matters:** There are over 150,000 active Salesforce customer organisations worldwide. Every org doing custom Apex development faces this exact debugging pattern when a deployment fails. Salesforce's own CLI tools — `sf project deploy`, `sf scanner` — produce structured JSON output that no open source tool currently cross-correlates into a ranked diagnosis. This project fills that gap using Claude's reasoning to turn three disconnected outputs into a single, actionable result. There is no equivalent open source tool in the Salesforce ecosystem today.
+> Salesforce's CLI tools produce structured JSON output that no open source tool currently cross-correlates into a ranked diagnosis. This project fills that gap — there is no equivalent in the Salesforce ecosystem today.
 
 ---
 
 ## Problem
 
-When a Salesforce deployment fails, engineers read raw logs, pull coverage reports, and run PMD scans — then manually piece together why it broke across three disconnected tools.
+When a deployment fails, engineers read raw logs, pull coverage reports, and run PMD scans — then manually piece together why it broke across three disconnected tools.
 
 - Deployment logs show *what* failed, not *why*
 - Coverage gaps and code violations have no visible connection to the error
@@ -23,38 +23,57 @@ When a Salesforce deployment fails, engineers read raw logs, pull coverage repor
 
 ## Why Claude
 
-PMD flags a violation. The coverage tool reports 62%. The deployment log shows a `NullPointerException` in `OpportunityService`. Three tools, three outputs — none of them tell you these signals share a single root cause.
+Salesforce CLI, PMD, and coverage tools each report one signal in isolation — they show *what* happened, not *why*, and have no awareness of each other.
+Claude reads all three signals together: when a `NullPointerException` in `OpportunityService` coincides with low coverage and critical PMD violations, it identifies a single `@TestSetup` gap as the common cause — not three separate problems requiring three separate fixes.
+The result is a specific root cause per component and a P0-ranked fix list — without opening a single log file.
 
-Claude reasons across all three inputs simultaneously and identifies the causal structure: the `@TestSetup` gap caused the exception, the exception suppressed test execution, and suppressed tests pulled coverage below the deployment threshold. That is not summarization — it is cross-signal reasoning that a rule-based tool cannot perform.
+---
 
-Two properties make Claude specifically well-suited for this:
+## Quick Start
 
-- **Structured output reliability.** The tool depends on Claude returning valid JSON that conforms to a strict schema on every call. Claude follows schema and formatting instructions precisely enough to be used in a pipeline — where a malformed response is a hard failure, not a warning.
-- **Apex domain knowledge.** Claude accurately identifies Salesforce-specific patterns — safe navigation (`?.`), `@TestSetup` data gaps, governor limit causes — without domain-specific fine-tuning. This means the tool works on real org failures out of the box.
+**No API key needed — run a preset scenario instantly:**
 
-The result is a risk score (0–10) and a P0-ranked fix list. Engineers know within seconds whether a deployment is blocked, which component caused it, and what to fix first — without reading a single log line.
+```bash
+python main.py 1            # Failure  — risk score 7 🔴
+python main.py 2            # Medium   — risk score 3 🟡
+python main.py 3            # Healthy  — risk score 0 🟢
+```
 
----
+**Live mode — send your own data to Claude:**
 
-## How It Reduces Debugging Time
+```bash
+pip install -r requirements-live.txt
 
-Instead of opening three tools and manually correlating their outputs, engineers submit one JSON payload and receive a ranked diagnosis — specific component, technical cause, and exact fix sequence.
-Claude surfaces the causal chain behind the failure, not a list of symptoms, so the P0 fix is clear before any code is opened.
-A 0–10 risk score tells the team immediately whether the deployment can proceed or is actively blocked, eliminating the manual triage step entirely.
+# Windows
+set ANTHROPIC_API_KEY=your_key_here
+# Mac / Linux
+export ANTHROPIC_API_KEY=your_key_here
+
+python main.py --input mydata.json --live
+```
+
+**End-to-end with a real Salesforce org:**
+
+```bash
+sf project deploy start --json > deploy_result.json
+sf scanner run --json > pmd_result.json
+python parse_deployment.py --deploy deploy_result.json --pmd pmd_result.json --out input.json
+python main.py --input input.json --live
+```
 
 ---
 
 ## What It Produces
 
-Input schema accepted by the tool:
+**Input** — three fields from your Salesforce org:
 
 | Field | Type | Description |
 |---|---|---|
 | `code_coverage` | `float` | Org-level Apex code coverage percentage |
 | `failed_deployments` | `list` | Each item: `component`, `error`, `failed_tests` count |
 | `code_quality_issues` | `dict` | `pmd_violations` (total) and `critical` (severity 1–2) |
 
-Output returned by Claude:
+**Output** — returned by Claude:
 
 | Field | Description |
 |---|---|
@@ -68,68 +87,25 @@ Output returned by Claude:
 
 ## Risk Scoring
 
-Weights reflect Salesforce deployment reality. Coverage below 75% is a hard platform blocker with no override — so it carries the highest weight. Active runtime failures block the pipeline immediately. PMD critical violations are serious but do not prevent deployment on their own.
+Coverage below 75% is a hard Salesforce platform blocker with no override — so it carries the highest weight. Active runtime failures block the pipeline immediately. Critical PMD violations are serious but do not prevent deployment on their own.
 
 | Dimension | Max Points | Trigger |
 |---|---|---|
-| Code coverage | +4 | < 75% — hard Salesforce deployment blocker, no override possible |
-| Deployment failures | +3 | Runtime errors (+2) and failed tests (+1) stack — pipeline cannot proceed |
-| Critical PMD violations | +2 | SOQL in loops, hardcoded IDs — governor limit failures under production load |
+| Code coverage | +4 | < 75% — no override possible |
+| Deployment failures | +3 | Runtime errors (+2) and failed tests (+1) stack |
+| Critical PMD violations | +2 | SOQL in loops, hardcoded IDs — governor limit risk |
 
 | Score | Risk Level |
 |---|---|
 | 0–2 | 🟢 Low — deployment can proceed |
-| 3–5 | 🟡 Medium — quality issues present, no blockers |
-| 6–10 | 🔴 High — at least one active deployment blocker |
-
----
-
-## Requirements
-
-- Python 3.9 or higher
-- No dependencies for mocked mode (Python standard library only)
-- `anthropic` package required for `--live` mode: `pip install -r requirements-live.txt`
-- Salesforce CLI (`sf`) required only for end-to-end org integration
+| 3–5 | 🟡 Medium — quality issues, no blockers |
+| 6–10 | 🔴 High — at least one active blocker |
 
 ---
 
-## Running the Tool
-
-**Mocked mode** — no API key needed, uses pre-generated sample outputs for reproducible demos:
-
-```bash
-python main.py              # Interactive menu
-python main.py 1            # Failure  — risk score 7 🔴
-python main.py 2            # Medium   — risk score 3 🟡
-python main.py 3            # Healthy  — risk score 0 🟢
-```
-
-**Live mode** — sends your data to Claude and returns a real-time diagnosis:
-
-```bash
-pip install -r requirements-live.txt
-
-# Windows
-set ANTHROPIC_API_KEY=your_key_here
-# Mac / Linux
-export ANTHROPIC_API_KEY=your_key_here
+## Example: Debugging a Failed Salesforce Deployment
 
-python main.py 1 --live                   # Preset scenario via Claude
-python main.py --input mydata.json --live  # Your own JSON via Claude
-```
-
-**End-to-end with a real Salesforce org:**
-
-```bash
-sf project deploy start --json > deploy_result.json
-sf scanner run --json > pmd_result.json
-python parse_deployment.py --deploy deploy_result.json --pmd pmd_result.json --out input.json
-python main.py --input input.json --live
-```
-
----
-
-## Example: Failed Deployment — NullPointerException
+A deployment fails during a sprint release. The engineer has three data points: coverage is below threshold, `OpportunityService` threw an exception, and the static analyser flagged violations.
 
 ### Input
 
@@ -150,7 +126,7 @@ python main.py --input input.json --live
 }
 ```
 
-### Output (sample generated by Claude using the tool's prompt)
+### Output
 
 ```json
 {
@@ -163,67 +139,52 @@ python main.py --input input.json --live
       "component": "Overall Org"
     },
     {
-      "issue": "NullPointerException in OpportunityService blocking all deployment",
+      "issue": "NullPointerException in OpportunityService — all 3 dependent test methods failing, pipeline blocked",
       "severity": "Critical",
       "component": "OpportunityService"
     },
     {
-      "issue": "3 test methods failing — pipeline cannot proceed until resolved",
+      "issue": "15 critical PMD violations — SOQL-in-loop patterns will trigger governor limit failures under production load",
       "severity": "High",
       "component": "OpportunityService"
-    },
-    {
-      "issue": "15 critical PMD violations flagged — SOQL-in-loop patterns risk governor limit failures",
-      "severity": "High",
-      "component": "Overall Org"
     }
   ],
   "root_causes": [
     {
-      "cause": "OpportunityService accesses a relationship field (e.g. Opportunity.Account.Name) without a null-guard — the field is null because @TestSetup does not insert a parent Account before creating the Opportunity",
+      "cause": "OpportunityService calls opp.Account.Name without a null-guard — Account is null because @TestSetup creates Opportunity records without inserting a parent Account first",
       "component": "OpportunityService"
     },
     {
-      "cause": "@TestSetup inserts Opportunity records without a related Account — all methods that traverse the Account relationship encounter null at runtime, causing both the exception and the test failures",
-      "component": "OpportunityService"
-    },
-    {
-      "cause": "Coverage at 62% is a direct consequence of the test failures — the same @TestSetup gap suppresses coverage for every OpportunityService method that depends on related data",
+      "cause": "The same @TestSetup gap suppresses execution of every method that traverses the Account relationship — this is why coverage dropped to 62%, not a separate coverage problem",
       "component": "Overall Org"
     }
   ],
   "recommendations": [
     {
-      "action": "Add null-guard in OpportunityService before traversing Account relationship — use safe navigation operator (?.) on all lookup fields: opp.Account?.Name instead of opp.Account.Name",
+      "action": "In OpportunityService, replace opp.Account.Name with opp.Account?.Name — the safe navigation operator prevents the NullPointerException when Account is not loaded",
       "priority": "P0 - Immediate"
     },
     {
-      "action": "Fix @TestSetup in OpportunityServiceTest — insert Account record first, then create Opportunity with AccountId populated before any test method runs",
+      "action": "In OpportunityServiceTest @TestSetup, insert an Account record and set AccountId on each Opportunity before any test method runs — this unblocks all 3 failing tests and recovers coverage above 75% automatically",
       "priority": "P0 - Immediate"
     },
     {
-      "action": "After null-guard and @TestSetup are fixed, re-run deployment — coverage should recover above 75% automatically as the suppressed test paths now execute",
-      "priority": "P1 - High"
-    },
-    {
-      "action": "Run 'sf scanner run --category Design,Security' on OpportunityService — resolve SOQL-in-loop violations before next production release to avoid governor limit failures under load",
+      "action": "Run 'sf scanner run --category Performance --target force-app/main/default/classes/OpportunityService.cls' and move all SOQL calls outside loop bodies before the next production release",
       "priority": "P1 - High"
     }
   ]
 }
 ```
 
-**What went wrong:** Coverage at 62% and a `NullPointerException` in `OpportunityService` appear to be two independent blockers. The analysis identifies they share one root cause: a `@TestSetup` gap (missing parent `Account`) that causes the exception, fails the tests, and suppresses coverage for every related method.
-
-**What the analysis reveals beyond the raw input:** The input states `"error": "NullPointerException"` and `"failed_tests": 3`. Nothing in the input mentions `@TestSetup`, parent records, or the Account relationship. Claude identified the likely test data gap, the specific Apex anti-pattern (unsafe relationship traversal), and that the coverage failure is a *symptom* of the exception — not a separate problem. This is a sample output generated by Claude using the same structured prompt the tool sends in `--live` mode.
-
-**Fix first:** Add the null-guard and fix `@TestSetup` in one commit. Fixing them together recovers coverage automatically — no additional test writing required.
+- **One root cause, three symptoms.** A single `@TestSetup` gap caused the exception, the 3 test failures, and the coverage drop — one fix resolves all of them.
+- **Exact fix, not general advice.** P0 recommendations name the specific call to change (`opp.Account.Name` → `opp.Account?.Name`) and exactly what to insert in the test setup.
+- **Prioritised.** Two P0 actions unblock the deployment. The P1 SOQL fix is scoped to a specific file and command.
 
 ---
 
 ## Example: Multi-Component Failure — Governor Limit
 
-This example shows the tool handling a different failure class: a trigger hitting SOQL limits under load, with two components affected simultaneously.
+A trigger hitting SOQL limits under load, with two components affected simultaneously.
 
 ### Input
 
@@ -249,7 +210,7 @@ This example shows the tool handling a different failure class: a trigger hittin
 }
 ```
 
-### Output (sample generated by Claude using the tool's prompt)
+### Output
 
 ```json
 {
@@ -312,17 +273,7 @@ This example shows the tool handling a different failure class: a trigger hittin
 }
 ```
 
-**What the analysis reveals:** The input only lists two error strings and violation counts. Claude connected the `LimitException` to the SOQL-in-loop PMD violations — identifying the critical violations as the likely *cause* of the runtime error, not a separate issue. It also identified that the two failures independently suppress coverage, meaning both must be fixed before coverage recovers.
-
----
-
-## Scenarios at a Glance
-
-| # | Scenario | Coverage | Failures | PMD | Risk Score |
-|---|---|---|---|---|---|
-| 1 | Failure | 62% | `NullPointerException` in `OpportunityService` | 120 (15 critical) | 7 🔴 |
-| 2 | Medium | 75% | None | 40 (5 critical) | 3 🟡 |
-| 3 | Healthy | 90% | None | 5 (0 critical) | 0 🟢 |
+Claude connected the `LimitException` to the SOQL-in-loop PMD violations — identifying the critical violations as the *cause* of the runtime error, not a separate issue. The two failures independently suppress coverage, so both must be fixed before coverage recovers.
 
 ---
 
@@ -344,17 +295,15 @@ salesforce-devops-ai-assistant/
 
 ## Claude Integration
 
-In `--live` mode, `main.py` sends a structured prompt to the Anthropic API (`claude-sonnet-4-6`) containing the DevOps metrics and a strict JSON schema contract. Claude returns a ranked diagnosis — risks, root causes, and prioritised recommendations — which is validated against the expected schema before display. If the response violates the schema, the tool exits with a clear error rather than silently surfacing bad output.
+In `--live` mode, `main.py` sends a structured prompt to `claude-sonnet-4-6` with the DevOps metrics and a strict JSON schema. The response is validated against the schema before display — if Claude returns a malformed response, the tool exits with a clear error rather than silently surfacing bad output.
 
-The three preset scenarios ship with pre-generated outputs to support reproducible demos and offline testing — the same prompt and schema were used to generate them. Mocked and live modes share an identical input/output contract; switching between them requires only the `--live` flag.
+The preset scenarios ship with pre-generated outputs for reproducible demos and offline testing. Mocked and live modes share an identical input/output contract; switching requires only the `--live` flag.
 
 ---
 
 ## Contributing
 
-Contributions are welcome. See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines on how to add new scenarios, extend the input schema, or improve the prompt.
-
-To run the tests locally:
+Contributions are welcome. See [CONTRIBUTING.md](CONTRIBUTING.md) for how to add scenarios, extend the schema, or improve the prompt.
 
 ```bash
 pip install pytest
@@ -365,16 +314,16 @@ python -m pytest tests/ -v
 
 ## Roadmap
 
-**Near-term (small, shippable)**
-- `--output` flag to write the JSON result to a file for CI pipeline integration
-- `--quiet` flag for machine-readable output (JSON only, no formatted display)
-- GitHub Actions example workflow showing end-to-end org analysis on deployment failure
+**Near-term**
+- `--output` flag to write JSON results to a file for CI pipeline integration
+- `--quiet` flag for machine-readable output (JSON only, no display formatting)
+- GitHub Actions example workflow for end-to-end org analysis on deployment failure
 
 **Longer-term**
-- **Apex stack trace parsing** — Accept raw `sf project deploy --json` exception stacks directly; extract line numbers and call chains for line-level diagnosis
-- **Historical diffing** — Compare risk scores across consecutive deployments to surface regressions before they become blockers
-- **Multi-component correlation** — Identify when a failure in one class cascades coverage loss across dependent classes
-- **Slack / Teams alerts** — Push P0 recommendations to engineering channels immediately on detection
+- **Apex stack trace parsing** — accept raw `sf project deploy --json` exception stacks for line-level diagnosis
+- **Historical diffing** — compare risk scores across deployments to surface regressions early
+- **Native CI/CD integration** — GitHub Actions, Jenkins, and Copado pipeline hooks
+- **Slack / Teams alerts** — push P0 recommendations to engineering channels on detection
 
 ---