Skip to content

Commit 784f6e8

Browse files
committed
Merge branch 'feature/standard-classes' into 'develop'
Standard Class Catalog See merge request genaiic-reusable-assets/engagement-artifacts/genaiic-idp-accelerator!573
2 parents f76e939 + b2df5d0 commit 784f6e8

16 files changed

Lines changed: 5576 additions & 38 deletions

File tree

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,8 @@ SPDX-License-Identifier: MIT-0
77

88
### Added
99

10+
- **Standard Class Catalog** — When adding a new document class in the Schema Builder, users can now choose between **Custom Class** (define from scratch) and **Standard Class** (import from a catalog of 35 pre-built document types). Standard classes are derived from AWS BDA standard blueprints and include common document types like Invoice, Receipt, W-2, Bank Statement, Payslip, US Driver License, US Passport, various tax forms (1040, 941, 940, W-9, 1098, 1099), insurance cards, birth/death/marriage certificates, and more. Each standard class comes with a complete extraction schema including attributes, descriptions, and nested types. Imported classes are fully editable. Run `make classes-from-bda` to refresh the catalog from the BDA API.
11+
1012
- **Documentation Site** — Added a hosted documentation site built with [Astro Starlight](https://starlight.astro.build/), auto-deployed to GitHub Pages. Provides full-text search (Pagefind), sidebar navigation organized by topic, dark/light mode, and a professional landing page — all sourced directly from the existing `docs/` markdown files with zero content duplication. Browse at [aws-solutions-library-samples.github.io/accelerated-intelligent-document-processing-on-aws](https://aws-solutions-library-samples.github.io/accelerated-intelligent-document-processing-on-aws/).
1113

1214
- **Discovery accessible from CLI and SDK** — Discovery can now be run programmatically via the IDP SDK (`client.discovery.run()`) and CLI (`idp-cli discover`), enabling users with many document classes to automate schema generation without the Web UI. Supports both modes: without ground truth (exploratory) and with ground truth (optimized). ([#228](https://github.com/aws-solutions-library-samples/accelerated-intelligent-document-processing-on-aws/issues/228))

Makefile

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -286,6 +286,13 @@ docs-deploy: docs-build
286286
cd docs-site && npx gh-pages -d dist --dotfiles
287287
@echo -e "$(GREEN)✅ Docs deployed to GitHub Pages!$(NC)"
288288

289+
# Generate standard class catalog from BDA standard blueprints
290+
# Fetches all AWS standard blueprints and converts them to IDP class schemas
291+
classes-from-bda:
292+
@echo "Generating standard class catalog from BDA standard blueprints..."
293+
python3 scripts/generate_standard_classes.py --region us-east-1 --output src/ui/src/data/standard-classes.json
294+
@echo -e "$(GREEN)✅ Standard class catalog updated! Review changes in src/ui/src/data/standard-classes.json$(NC)"
295+
289296
# DSR (Deliverable Security Review) targets
290297
dsr-setup:
291298
@echo "Setting up DSR tool..."

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.5.2-wip4
1+
0.5.2.4

docs/configuration.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ The GenAIIDP solution provides multiple configuration approaches to customize do
1515

1616
The web interface allows real-time configuration updates without stack redeployment:
1717

18-
- **Document Classes**: Define and modify document categories and their descriptions (using JSON Schema format)
18+
- **Document Classes**: Define and modify document categories and their descriptions (using JSON Schema format). Choose from **35+ pre-built standard classes** (Invoice, Receipt, W-2, Bank Statement, etc.) or create custom classes from scratch.
1919
- **Extraction Attributes**: Configure fields to extract for each document class (defined as JSON Schema properties)
2020
- **Few Shot Examples**: Upload and configure example documents to improve accuracy (supported in Pattern 2)
2121
- **Model Selection**: Choose between available Bedrock models for classification and extraction

docs/json-schema-migration.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -193,6 +193,9 @@ The web UI provides two ways to create/edit document schemas:
193193
1. **Schema Builder** - Visual editor with drag-and-drop interface
194194
- Navigate to Configuration → Document Schema tab
195195
- Click "Schema Builder" view
196+
- Click "Add Class" to choose between:
197+
- **Custom Class** — define your own class with custom fields
198+
- **Standard Class** — import from 35+ pre-built document types (Invoice, Receipt, W-2, Bank Statement, Payslip, Driver License, Passport, tax forms, insurance cards, certificates, and more) derived from AWS BDA standard blueprints. Imported classes are fully editable.
196199
- Add/edit document types and properties visually
197200

198201
2. **JSON View** - Direct JSON editing with validation

lib/idp_cli_pkg/idp_cli/cli.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -209,7 +209,7 @@ def _display_deployment_failure(deployer, stack_name: str, result: dict):
209209

210210

211211
@click.group()
212-
@click.version_option(version="0.5.2-wip4")
212+
@click.version_option(version="0.5.2.4")
213213
def cli():
214214
"""
215215
IDP CLI - Batch document processing for IDP Accelerator

lib/idp_cli_pkg/pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ build-backend = "setuptools.build_meta"
77

88
[project]
99
name = "idp-cli"
10-
version = "0.5.2-wip4"
10+
version = "0.5.2.4"
1111
description = "Command-line interface for IDP Accelerator batch document processing"
1212
authors = [{name = "AWS"}]
1313
license = {text = "MIT-0"}

lib/idp_common_pkg/pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ exclude = [
2020

2121
[project]
2222
name = "idp_common"
23-
version = "0.5.2-wip4"
23+
version = "0.5.2.4"
2424
description = "Common utilities for GenAI IDP Accelerator patterns"
2525
authors = [{ name = "AWS", email = "noreply@amazon.com" }]
2626
requires-python = ">=3.10,<3.14"

lib/idp_common_pkg/setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -117,7 +117,7 @@
117117

118118
setup(
119119
name="idp_common",
120-
version="0.5.2-wip4",
120+
version="0.5.2.4",
121121
packages=find_packages(
122122
exclude=[
123123
"build",

lib/idp_sdk/idp_sdk/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -91,7 +91,7 @@
9191
StopWorkflowsResult,
9292
)
9393

94-
__version__ = "0.5.2-wip4"
94+
__version__ = "0.5.2.4"
9595

9696
__all__ = [
9797
# Client

0 commit comments

Comments
 (0)