@@ -91,13 +91,15 @@ is exactly what it does and does not do:
9191 Tool executions ** inside** AWS action groups are opaque — the adapter
9292 cannot see them, let alone block them. Use ` guardToolUse() ` to enforce
9393 at the tool level manually, or push tool calls onto the host side.
94- - ** Multi-modal content is not scanned by default.** Image, PDF, and audio
95- blocks on Anthropic/Vercel AI/Genkit/LlamaIndex/Bedrock pass through
96- without injection detection in the current release — a vision-enabled
97- agent bypasses every input scan unless you wire your own scanner.
98- Opt-in per-modality scanning (image OCR, PDF text extract, Whisper for
99- audio) is on the near-term roadmap; cost, latency, and data-egress
100- considerations mean it will ship as opt-in, not on-by-default.
94+ - ** Multi-modal scanning is opt-in.** Image, PDF, and audio blocks pass
95+ through without injection detection by default. Register a per-modality
96+ extractor with ` registerModalityScanner() ` and call ` scanMultiModal() `
97+ from ` governance-sdk/scan/multi-modal ` before ` enforce() ` ; the result's
98+ concatenated text feeds the existing cascade. The SDK ships the
99+ orchestration only — the actual OCR / PDF parser / ASR is caller-
100+ supplied so the zero-dep promise stands. Defaults to text-only;
101+ per-block timeouts and fail-closed semantics (` onMissingScanner ` ,
102+ ` onExtractError ` ) are configurable.
101103
102104## Packages
103105
0 commit comments