fix: CoPilot review pass on PR #44

chris-colinsky · chris-colinsky · commit fb84521c6b5a · 2026-05-15T18:07:55.000-07:00
- audio/video symmetry in the substring fallback of
  _looks_like_content_rejection
- explicit isinstance(block, ImageBlock) guard in _block_to_wire to
  surface added union variants as a TypeError instead of an
  AttributeError on .source
- clarify ImageBlock.media_type docstring: permitted but redundant on
  URL sources (the URL payload carries content-type), provider
  implementations MAY consume it as a hint
- reword CHANGELOG qualifier '(proposal X, spec vY.Z)' →
  '(proposal X, introduced in spec vY.Z)' on the 0015 and 0016 entries
  so it doesn't read like a per-entry submodule pin change
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -8,10 +8,10 @@ The format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/). The
 
 ### Added
 
-- **Image content blocks for user messages (proposal 0015, spec v0.13.0).** `UserMessage.content` now accepts `str | list[ContentBlock]`. The block surface introduces `TextBlock`, `ImageBlock`, `ImageSourceURL`, `ImageSourceInline`, and the `ContentBlock` / `ImageSource` discriminated unions over the block / source `type` field. `ImageBlock` carries a `media_type` (required for inline sources; ignored for URL sources; typed as `str | None` so callers MAY pass any `image/*` type the bound model supports) and an optional `detail` hint (`"auto"` / `"low"` / `"high"`; `None` default omits the field from the wire so providers apply their own default). System, assistant, and tool messages stay text-string-only; image inputs are user-only in v1.
+- **Image content blocks for user messages (proposal 0015, introduced in spec v0.13.0).** `UserMessage.content` now accepts `str | list[ContentBlock]`. The block surface introduces `TextBlock`, `ImageBlock`, `ImageSourceURL`, `ImageSourceInline`, and the `ContentBlock` / `ImageSource` discriminated unions over the block / source `type` field. `ImageBlock` carries a `media_type` (required for inline sources; ignored for URL sources; typed as `str | None` so callers MAY pass any `image/*` type the bound model supports) and an optional `detail` hint (`"auto"` / `"low"` / `"high"`; `None` default omits the field from the wire so providers apply their own default). System, assistant, and tool messages stay text-string-only; image inputs are user-only in v1.
 - **`OpenAIProvider` content-array wire mapping.** When `UserMessage.content` is a content-block sequence, the wire body uses OpenAI's `content` array per §8.1.1. `TextBlock → {type: "text", text}`. `ImageBlock` with a URL source maps to `{type: "image_url", image_url: {url, detail?}}`. `ImageBlock` with an inline source constructs an RFC 2397 `data:<media_type>;base64,<base64_data>` URI and goes through the same `image_url` entry shape. Inline bytes pass through unchanged — no inspection, transcoding, or re-encoding.
 - **New error category `ProviderUnsupportedContentBlock` (non-transient).** Raised when the bound model rejects a content block type / media variant. Distinct from `ProviderInvalidRequest` (which covers spec-shape malformation): this category surfaces a *capability* mismatch, letting callers route differently (e.g., fall back to a multimodal-capable provider) without overloading the malformed-request category. Carries `block_type` ("image" / "audio" / "video") and `reason` (provider's human-readable message) when those are recoverable from the rejection. `OpenAIProvider` detects content rejection via HTTP 400 bodies — heuristic on `error.code` (known set: `image_content_not_supported`, `unsupported_image_media_type`, `audio_content_not_supported`, etc.), `error.type` (`image_parse_error`), and `error.message` ("does not support" + image/audio/video).
-- **Structured output (proposal 0016, spec v0.14.0).** `Provider.complete()` now accepts an optional `response_schema` parameter — either a JSON Schema dict or a Pydantic `BaseModel` subclass. When supplied, the provider constrains the model's output to the schema and populates `Response.parsed` with the validated value (`dict` for dict-schema input, a `BaseModel` instance for class input). New `StructuredOutputInvalid` error category (non-transient by default) raises on JSON parse failure or schema validation failure; carries the requested schema, the raw response content, and a failure description.
+- **Structured output (proposal 0016, introduced in spec v0.14.0).** `Provider.complete()` now accepts an optional `response_schema` parameter — either a JSON Schema dict or a Pydantic `BaseModel` subclass. When supplied, the provider constrains the model's output to the schema and populates `Response.parsed` with the validated value (`dict` for dict-schema input, a `BaseModel` instance for class input). New `StructuredOutputInvalid` error category (non-transient by default) raises on JSON parse failure or schema validation failure; carries the requested schema, the raw response content, and a failure description.
 - **`OpenAIProvider` native response_format wire path.** When `response_schema` is supplied, the chat-completions request body carries `response_format: { type: "json_schema", json_schema: { name, schema, strict } }`. The `strict` flag is determined by a deep recursive walk over the schema (object-property required-coverage rule across `anyOf` / `oneOf` / `allOf` and `$ref` targets, with cycle protection); unresolvable refs fall through to `strict: false`. The `name` field uses `schema.title` when present, otherwise a deterministic sha256-prefix hash.
 - **`OpenAIProvider` prompt-augmentation fallback.** Constructor flag `force_prompt_augmentation_fallback: bool` (default `False`) and read-only inspect property `uses_prompt_augmentation_fallback: bool`. When the flag is on, structured-output calls build a fresh message list with a system directive containing the serialized schema, omit `response_format` from the wire, and validate the response post-receive. The caller's original `messages` list is never mutated. Use for OpenAI-compatible servers (older vLLM, some LM Studio releases, llama.cpp variants) that reject or silently ignore `response_format`.
 - **Provider-agnostic schema helpers.** `openarmature.llm.validate_response_schema(schema)` (raises `ProviderInvalidRequest` when the schema is not a dict with a top-level `type: "object"`) and `openarmature.llm.strict_mode_supported(schema)` (the deep-tree strict-mode constraint check) are exported for reuse by future Anthropic/Gemini providers.
diff --git a/src/openarmature/llm/messages.py b/src/openarmature/llm/messages.py
@@ -174,11 +174,14 @@ class ImageBlock(BaseModel):
     Attributes:
         type: The discriminator literal ``"image"``.
         source: One of ``ImageSourceURL`` or ``ImageSourceInline``.
-        media_type: IANA media type. Required when source is inline;
-            ignored when source is a URL. Providers MUST accept
-            ``image/png``, ``image/jpeg``, ``image/webp`` at minimum
-            and MAY accept additional ``image/*`` types they document
-            support for.
+        media_type: IANA media type. Required when source is inline.
+            Permitted but redundant when source is a URL (the URL
+            payload carries the content-type); the OpenAI wire path
+            currently does not surface it for URL sources, but
+            provider implementations MAY consume it as a hint.
+            Providers MUST accept ``image/png``, ``image/jpeg``,
+            ``image/webp`` at minimum and MAY accept additional
+            ``image/*`` types they document support for.
         detail: Image-processing fidelity hint. One of ``"auto"``,
             ``"low"``, ``"high"``. ``None`` (the default) omits the
             field from the wire.
diff --git a/src/openarmature/llm/providers/openai.py b/src/openarmature/llm/providers/openai.py
@@ -75,6 +75,7 @@
 from ..messages import (
     AssistantMessage,
     ContentBlock,
+    ImageBlock,
     ImageSourceInline,
     Message,
     SystemMessage,
@@ -694,7 +695,8 @@ def _message_to_wire(msg: Message) -> dict[str, Any]:
 def _block_to_wire(block: ContentBlock) -> dict[str, Any]:
     if isinstance(block, TextBlock):
         return {"type": "text", "text": block.text}
-    # ImageBlock
+    if not isinstance(block, ImageBlock):  # pyright: ignore[reportUnnecessaryIsInstance]
+        raise TypeError(f"unhandled content block type: {type(block).__name__}")
     if isinstance(block.source, ImageSourceInline):
         url = f"data:{block.media_type};base64,{block.source.base64_data}"
     else:
@@ -854,8 +856,9 @@ def _looks_like_content_rejection(
         if error_code in _CONTENT_REJECTION_ERROR_CODES:
             return True
         lower_code = error_code.lower()
-        if "image" in lower_code and ("not_supported" in lower_code or "unsupported" in lower_code):
-            return True
+        for block_type in ("image", "audio", "video"):
+            if block_type in lower_code and ("not_supported" in lower_code or "unsupported" in lower_code):
+                return True
     if isinstance(error_type, str) and error_type.lower() in {
         "image_parse_error",
         "image_content_not_supported",