aws-solutions-library-samples
diff --git a/‎CHANGELOG.md‎
Lines changed: 23 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎Makefile‎
Lines changed: 84 additions & 5 deletions b/‎Makefile‎
Lines changed: 84 additions & 5 deletions
diff --git a/‎README.md‎
Lines changed: 1 addition & 0 deletions b/‎README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎VERSION‎
Lines changed: 1 addition & 1 deletion b/‎VERSION‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎config_library/unified/ds11-passport-application/README.md‎
Lines changed: 140 additions & 0 deletions b/‎config_library/unified/ds11-passport-application/README.md‎
Lines changed: 140 additions & 0 deletions
@@ -4,7 +4,30 @@ SPDX-License-Identifier: MIT-0
 # Changelog
 
 ## [Unreleased]
+  
+## [0.5.8]
+
+### Added
+
+- **Excluded-class feature — skip static instruction / legal / boilerplate pages** — Government forms and similar packages often bundle static informational pages (legal warnings, fee instructions, tax notices, oaths) alongside the pages that carry applicant data. Mark a document class with `x-aws-idp-exclude-from-processing: true` and all downstream stages (extraction, assessment, summarization, rule validation, evaluation) skip sections classified as that class — making **zero LLM calls** on boilerplate pages.
+  - Optional `x-aws-idp-exclusion-reason` ("instructions", "legal", "cover-page", …) surfaces as a grey **`Skipped: <reason>`** badge in the UI Sections panel and as an **"Excluded Sections (Not Evaluated)"** table in the evaluation markdown report.
+  - Configurable via the **UI Configuration Editor** → Document Schema → select a document-type class → "Exclude from Processing" checkbox + "Exclusion Reason" input.
+  - New end-to-end sample config at `config_library/unified/ds11-passport-application/` with a matching DS-11 U.S. Passport Application PDF fixture and a standalone demo notebook (`notebooks/usecase-specific-examples/ds11-passport-application/`).
+  - Additive: classes without the new flag behave exactly as before.
+  - See `docs/classification.md#excluding-static-pages-eg-instructions-legal-boilerplate`.
+
+### Changed
 
+- **UI dependency cleanup — eliminated 11 of 12 npm deprecation warnings** — Replaced deprecated `@aws-sdk/*` packages with `@smithy/*` equivalents, removed unused Babel plugins, migrated ESLint 8→9 (flat config), upgraded Prettier 2→3, and upgraded jsdom 26→29. Added `"type": "module"` to `package.json`. Also added `caughtErrors: 'none'` to ESLint config to stop flagging unused catch clause variables. Added `FORCE=1` arg to `make ui-lint` to force re-run despite checksum match.
+
+- **Headless deployment documentation generalized** — headless mode is no longer documented as a GovCloud-only capability. New `docs/headless-deployment.md` is the canonical guide covering headless deployment for both Commercial and GovCloud regions (API-only / pipeline integrations, organizational restrictions on UI-layer services, cost optimization, and required for GovCloud). 
+
+## Templates
+   - us-west-2: `https://s3.us-west-2.amazonaws.com/aws-ml-blog-us-west-2/artifacts/genai-idp/idp-main_0.5.8.yaml`
+   - us-east-1: `https://s3.us-east-1.amazonaws.com/aws-ml-blog-us-east-1/artifacts/genai-idp/idp-main_0.5.8.yaml`
+   - eu-central-1: `https://s3.eu-central-1.amazonaws.com/aws-ml-blog-eu-central-1/artifacts/genai-idp/idp-main_0.5.8.yaml`
+  
+  
 ## [0.5.7]
 
 ### Added
 
@@ -21,6 +21,10 @@ else
   PIP := $(CURDIR)/$(VENV_DIR)/bin/pip
 endif
 
+# idp-cli invocation — uses `python -m idp_cli.cli` so it works whether or not
+# the virtualenv is activated (picks up $(PYTHON) which prefers .venv).
+IDP_CLI := $(PYTHON) -m idp_cli.cli
+
 ##@ General
 .PHONY: help
 help: ## Show this help message
@@ -101,7 +105,7 @@ setup-venv: ## Create .venv and install all packages into it
 	@echo -e "$(YELLOW)   To activate manually: source $(VENV_DIR)/bin/activate$(NC)"
 
 ##@ Code Quality
-lint: ruff-lint format check-arn-partitions validate-buildspec ui-lint codegen-check ## Run all linting (ruff, format, ARN checks, buildspec, UI, codegen)
+lint: ruff-lint format check-arn-partitions validate-buildspec ui-lint codegen-check ## Run all linting (ruff, format, ARN checks, buildspec, UI, codegen). Use FORCE=1 to force UI lint re-run despite checksum match.
 fastlint: ruff-lint format check-arn-partitions validate-buildspec ## Quick lint without UI checks
 
 ruff-lint: ## Run ruff linting with auto-fix
@@ -251,17 +255,21 @@ endif
 	@echo "Starting UI development server..."
 	cd src/ui && npm run start
 
-ui-lint: ## Run UI linting with checksum caching (skips if unchanged)
+ui-lint: ## Run UI linting with checksum caching (skips if unchanged). Use FORCE=1 to force re-run.
 	@echo "Checking if UI lint is needed..."
 	@CURRENT_HASH=$$($(PYTHON) -c "from publish import IDPPublisher; p = IDPPublisher(); print(p.get_directory_checksum('src/ui'))"); \
 	STORED_HASH=$$(test -f src/ui/.checksum && cat src/ui/.checksum || echo ""); \
-	if [ "$$CURRENT_HASH" != "$$STORED_HASH" ]; then \
-		echo "UI code checksum changed - running lint..."; \
+	if [ -n "$(FORCE)" ] || [ "$$CURRENT_HASH" != "$$STORED_HASH" ]; then \
+		if [ -n "$(FORCE)" ]; then \
+			echo "FORCE=1 set - running lint..."; \
+		else \
+			echo "UI code checksum changed - running lint..."; \
+		fi; \
 		cd src/ui && npm ci --prefer-offline --no-audit && npm run lint -- --fix && npm run typecheck || exit 1; \
 		echo "$$CURRENT_HASH" > .checksum; \
 		echo -e "$(GREEN)✅ UI lint and typecheck completed and checksum updated$(NC)"; \
 	else \
-		echo -e "$(GREEN)✅ UI code checksum unchanged - skipping lint$(NC)"; \
+		echo -e "$(GREEN)✅ UI code checksum unchanged - skipping lint (use FORCE=1 to force re-run)$(NC)"; \
 	fi
 
 ui-build: ## Build UI for production
@@ -393,3 +401,74 @@ dsr-scan: ## Run DSR security scan
 dsr-fix: ## Run DSR interactive fix
 	@echo "Running DSR interactive fix..."
 	$(PYTHON) scripts/dsr/fix.py
+
+##@ Deploy
+# Thin wrappers around `idp-cli publish` / `deploy` / `delete` for the common
+# 80% case. Uncommon flags can still be passed via EXTRA_ARGS="--foo --bar".
+# See 'docs/idp-cli.md' (or 'idp-cli <cmd> --help') for the full option list.
+
+.PHONY: publish deploy delete-stack
+
+# Usage examples:
+#   make publish REGION=us-east-1
+#   make publish REGION=us-east-1 BUCKET_BASENAME=my-idp-artifacts PREFIX=v1
+#   make publish REGION=us-gov-west-1 HEADLESS=1
+#   make publish REGION=us-east-1 PUBLIC=1 EXTRA_ARGS="--clean-build --verbose"
+publish: ## Build & publish IDP artifacts to S3 (Usage: make publish REGION=... [BUCKET_BASENAME=...] [PREFIX=...] [HEADLESS=1] [PUBLIC=1] [EXTRA_ARGS=...])
+ifndef REGION
+	$(error REGION is not set. Usage: make publish REGION=us-east-1 [BUCKET_BASENAME=...] [PREFIX=...] [HEADLESS=1] [PUBLIC=1] [EXTRA_ARGS=...])
+endif
+	@echo -e "$(CYAN)Running idp-cli publish (region=$(REGION))...$(NC)"
+	$(IDP_CLI) publish \
+		--source-dir . \
+		--region $(REGION) \
+		$(if $(BUCKET_BASENAME),--bucket-basename $(BUCKET_BASENAME)) \
+		$(if $(PREFIX),--prefix $(PREFIX)) \
+		$(if $(HEADLESS),--headless) \
+		$(if $(PUBLIC),--public) \
+		$(EXTRA_ARGS)
+
+# Usage examples:
+#   make deploy STACK_NAME=my-idp ADMIN_EMAIL=me@example.com                 # create new stack
+#   make deploy STACK_NAME=my-idp                                             # update existing stack
+#   make deploy STACK_NAME=my-idp-dev ADMIN_EMAIL=me@example.com FROM_CODE=1  # build & deploy from local source
+#   make deploy STACK_NAME=my-idp ADMIN_EMAIL=me@example.com HEADLESS=1       # headless (no UI)
+#   make deploy STACK_NAME=my-idp CUSTOM_CONFIG=./my-config.yaml              # update config on existing stack
+#   make deploy STACK_NAME=my-idp NO_WAIT=1                                   # fire-and-forget (default is --wait)
+#   make deploy STACK_NAME=my-idp EXTRA_ARGS="--max-concurrent 200 --log-level DEBUG"
+deploy: ## Deploy/update IDP CloudFormation stack (Usage: make deploy STACK_NAME=... [ADMIN_EMAIL=...] [REGION=...] [FROM_CODE=1] [HEADLESS=1] [CUSTOM_CONFIG=...] [TEMPLATE_URL=...] [TEMPLATE_FILE=...] [NO_WAIT=1] [EXTRA_ARGS=...])
+ifndef STACK_NAME
+	$(error STACK_NAME is not set. Usage: make deploy STACK_NAME=my-stack [ADMIN_EMAIL=...] [REGION=...] [FROM_CODE=1] [HEADLESS=1] [CUSTOM_CONFIG=...] [NO_WAIT=1] [EXTRA_ARGS=...])
+endif
+	@echo -e "$(CYAN)Running idp-cli deploy (stack=$(STACK_NAME))...$(NC)"
+	$(IDP_CLI) deploy \
+		--stack-name $(STACK_NAME) \
+		$(if $(ADMIN_EMAIL),--admin-email $(ADMIN_EMAIL)) \
+		$(if $(REGION),--region $(REGION)) \
+		$(if $(FROM_CODE),--from-code .) \
+		$(if $(HEADLESS),--headless) \
+		$(if $(CUSTOM_CONFIG),--custom-config $(CUSTOM_CONFIG)) \
+		$(if $(TEMPLATE_URL),--template-url $(TEMPLATE_URL)) \
+		$(if $(TEMPLATE_FILE),--template-file $(TEMPLATE_FILE)) \
+		$(if $(NO_WAIT),,--wait) \
+		$(EXTRA_ARGS)
+
+# Usage examples:
+#   make delete-stack STACK_NAME=test-stack                                   # interactive
+#   make delete-stack STACK_NAME=test-stack FORCE=1                            # skip confirmation
+#   make delete-stack STACK_NAME=test-stack FORCE=1 EMPTY_BUCKETS=1            # empty buckets first
+#   make delete-stack STACK_NAME=test-stack FORCE=1 FORCE_DELETE_ALL=1         # comprehensive cleanup
+delete-stack: ## Delete an IDP CloudFormation stack (Usage: make delete-stack STACK_NAME=... [FORCE=1] [EMPTY_BUCKETS=1] [FORCE_DELETE_ALL=1] [REGION=...] [NO_WAIT=1] [EXTRA_ARGS=...])
+ifndef STACK_NAME
+	$(error STACK_NAME is not set. Usage: make delete-stack STACK_NAME=my-stack [FORCE=1] [EMPTY_BUCKETS=1] [FORCE_DELETE_ALL=1])
+endif
+	@echo -e "$(YELLOW)Running idp-cli delete (stack=$(STACK_NAME))...$(NC)"
+	$(IDP_CLI) delete \
+		--stack-name $(STACK_NAME) \
+		$(if $(FORCE),--force) \
+		$(if $(EMPTY_BUCKETS),--empty-buckets) \
+		$(if $(FORCE_DELETE_ALL),--force-delete-all) \
+		$(if $(REGION),--region $(REGION)) \
+		$(if $(NO_WAIT),,--wait) \
+		$(EXTRA_ARGS)
+
@@ -174,6 +174,7 @@ For detailed deployment and testing instructions, see the [Deployment Guide](./d
 - [Architecture](./docs/architecture.md) - Detailed component architecture and data flow
 - [Demo Videos](./docs/demo-videos.md) - Comprehensive collection of feature demonstration videos
 - [Deployment](./docs/deployment.md) - Build, publish, deploy, and test instructions
+- [Headless Deployment](./docs/headless-deployment.md) - Backend-only deployment (no UI/AppSync/Cognito/WAF) for API-only use cases; required for GovCloud
 - [IDP CLI](./docs/idp-cli.md) - Command line interface for batch processing, evaluation workflows, and interactive Agent Chat
 - [Web UI](./docs/web-ui.md) - Web interface features and usage
 - [Agent Analysis](./docs/agent-analysis.md) - Natural language analytics and data visualization feature
 
@@ -1 +1 @@
-0.5.7
+0.5.8
@@ -0,0 +1,140 @@
+# DS-11 U.S. Passport Application Sample
+
+This sample configuration demonstrates the **excluded-class feature** — a way
+to tell the IDP pipeline that a particular document class contains only
+static/boilerplate pages (instructions, legal warnings, cover pages, tax
+notices, etc.) and should be skipped during extraction, assessment,
+summarization, rule validation, and evaluation.
+
+## What it demonstrates
+
+`samples/DS11-USPassportApplication.pdf` is a 6-page US State Department
+passport application form in which:
+
+| Page | Content | Nature |
+|------|------------------------------------------------------|------------------|
+| 1 | WARNING: False statements… legal warning | Static legal |
+| 2 | Passport fee and payment instructions | Static instructions |
+| 3 | DS-11 FEDERAL TAX LAW (Section 6039E) notice | Static legal |
+| 4 | DS-11 ACTS OR CONDITIONS affidavit | Static oath |
+| 5 | APPLICATION FOR A U.S. PASSPORT (form front) | Dynamic form |
+| 6 | Travel Plans / Permanent Address (form back) | Dynamic form |
+
+This config is a **minimal override config** — it only declares `notes` +
+`classes`. All other settings (`classification:`, `extraction:`,
+`assessment:`, `summarization:`, `ocr:`, `evaluation:`) are inherited
+from the bundled system defaults via `merge_config_with_defaults()` at
+deploy time (production) or at notebook-load time (demos). You only
+need to declare the classes you care about.
+
+With this config:
+
+1. The classifier sees **two** classes, `PassportApplicationInstructions`
+   and `PassportApplication`.
+2. The **primary classification mechanism** is the LLM multimodal
+   page-level classifier: each page is sent (image + OCR text) to
+   Bedrock and the best-matching class is chosen using the class
+   `description` field. This is robust to form revisions, OCR quirks,
+   and wording differences.
+3. The **optional regex fast-path** on the excluded class
+   (`x-aws-idp-document-page-content-regex`) short-circuits pages whose
+   OCR text matches a known stable boilerplate phrase. If the regex
+   misses, the LLM still catches the page via the description. The
+   regex is narrowly scoped to a single conservative anchor; see the
+   comment in `config.yaml` for details.
+4. The document is segmented into two sections via the existing BIO-like
+   section-boundary logic. The classification service propagates the
+   `excluded` flag from the class config onto the `Section`.
+5. Downstream services (extraction, assessment, summarization, rule
+   validation) see `section.excluded == True` and **skip** those
+   sections. They still write a small `result.json` stub so the UI and
+   reporting database have something to show:
+
+   ```json
+   {
+     "status": "skipped_excluded_class",
+     "stage": "extraction",
+     "section_id": "1",
+     "classification": "PassportApplicationInstructions",
+     "excluded": true,
+     "exclusion_reason": "instructions",
+     "page_ids": ["1", "2", "3", "4"],
+     "message": "Section 1 classified as 'PassportApplicationInstructions' …"
+   }
+   ```
+
+6. The evaluation service filters excluded sections out of the
+   precision/recall/F1 calculation and appends an **Excluded Sections**
+   table to the markdown report so nothing is silently dropped.
+
+7. The UI renders excluded sections in the Sections panel with a grey
+   `Skipped: instructions` badge next to the class name.
+
+## How to try it
+
+### 1. As a library / test fixture
+
+```bash
+# From the repo root
+python -c "
+from idp_common.models import Document, Section
+from idp_common.section_exclusion import is_section_excluded, build_skipped_stub_result
+
+doc = Document(id='ds11-demo')
+sec = Section(
+    section_id='1',
+    classification='PassportApplicationInstructions',
+    page_ids=['1','2','3','4'],
+    excluded=True,
+    exclusion_reason='instructions',
+)
+assert is_section_excluded(sec)
+print(build_skipped_stub_result(doc, sec, stage='extraction'))
+"
+```
+
+### 2. In a live deployment
+
+1. Load this config into your stack:
+
+   ```bash
+   idp-cli configuration create \\
+     --stack-name <your-stack> \\
+     --version-name ds11 \\
+     --path config_library/unified/ds11-passport-application/config.yaml
+   idp-cli configuration activate --stack-name <your-stack> --version-name ds11
+   ```
+
+2. Upload `samples/DS11-USPassportApplication.pdf` through the web UI or
+   CLI, and inspect the resulting sections in the Sections panel — the
+   first section (pages 1–4) will display a **Skipped: instructions**
+   badge and the extraction/summary panels for that section will show
+   the skipped-stub message. Only the second section (pages 5–6) will
+   be extracted.
+
+## Key schema extensions
+
+Two new class-level extensions power the feature:
+
+| Key | Type | Meaning |
+|-----|------|---------|
+| `x-aws-idp-exclude-from-processing` | boolean | When `true`, downstream services skip sections classified as this class. |
+| `x-aws-idp-exclusion-reason` | string | Optional short reason (`"instructions"`, `"legal"`, `"cover-page"`) shown in UI badges and evaluation reports. |
+
+The existing
+`x-aws-idp-document-page-content-regex` extension is used as a fast path
+so the LLM doesn't have to classify boilerplate pages that clearly
+contain anchor phrases from the form template.
+
+## Notes & caveats
+
+- The regex fast path relies on OCR text being available. When OCR is
+  disabled (e.g. image-only mode), the LLM still recognizes
+  `PassportApplicationInstructions` visually thanks to the detailed
+  class `description`.
+- The `properties: {}` on the excluded class is intentional — there's
+  nothing to extract from boilerplate pages. The classifier doesn't
+  require properties.
+- Regex patterns can be tuned to match additional state-department
+  revisions of DS-11. The (`?is`) flags make matching case-insensitive
+  and tolerant of OCR line-break artefacts.