AGENTS.md — Eyrie

Universal LLM provider runtime. One interface for every model. Authentication, routing, streaming, retries, caching — handled.

Design Principles

Model-agnostic — single interface for 75+ LLM providers
Zero opinions — consumers control routing, caching, and retry strategies
Streaming-first — all responses are streamed; blocking is opt-in

Observability

See hawk/docs/OTEL-CONVENTIONS.md for the shared OpenTelemetry attribute vocabulary (gen_ai.*, cost.usd, etc.) used across all GrayCodeAI repos.

Build & Test

go test ./...                    # Run all tests
go test -race ./...              # Race detector
go test -coverprofile=c.out ./... # Coverage
go vet ./...                     # Static analysis
gofumpt -w .                     # Format
make ci                          # Full CI suite

Architecture

provider.go — Provider interface and registry
routing.go — Model routing and fallback chains
streaming.go — SSE streaming with backpressure
auth.go — API key management and rotation
cache.go — Response caching (optional)
retry.go — Retry with exponential backoff + Retry-After
catalog.go — Model catalog and capability discovery

Conventions

Go 1.26+, pure Go, no CGO
Table-driven tests
Conventional Commits: feat:, fix:, docs:, refactor:, test:
No Co-authored-by: trailers (auto-stripped by githook)
gofumpt formatting enforced in CI
Credential setup flow documented in docs/guides/CREDENTIAL-SETUP-FLOW.md

Common Pitfalls

Provider interface is the boundary — keep it stable
Streaming tests need careful goroutine management
go.work here should stay minimal; hawk's own go.work adds an external/eyrie replace so hawk can develop against a local eyrie checkout. Do not add extra local replace directives here without coordinating with hawk's workspace.

Naming Conventions

Provider interface: client.Provider with Chat(), StreamChat(), Ping(), Name() — implemented per LLM vendor
Client types: EyrieClient, EyrieMessage, EyrieResponse, EyrieTool, EyrieUsage — Eyrie prefix for public types
Config struct: EyrieConfig with Provider, APIKey, BaseURL, Model, MaxRetries fields
Provider implementations: AnthropicClient, OpenAIClient, GeminiClient, BedrockClient, etc. — in client/ package
Compatibility configs: OpenAICompat, GrokCompat, OpenRouterCompat — Compat suffix for provider quirks
Error type: EyrieError with Provider, Op, StatusCode, RequestID, Message, Err fields
Stream types: StreamResult, SSEEvent, StreamEvent — streaming is SSE-based
Retry config: RetryConfig embeds types.RetryConfig + adds RetryOn []int for HTTP status codes
Version wiring: client.Version set via SetVersion() from root package — avoids circular import

API Patterns

Provider auto-detection: DetectProvider() checks env vars in priority order (ANTHROPIC_API_KEY, OPENAI_API_KEY, etc.)
Client creation: client.NewEyrieClient(&EyrieConfig{...}) or client.Client(&EyrieConfig{...}) — both work
Chat method: c.Chat(ctx, messages, opts) — non-streaming, returns *EyrieResponse
Stream method: c.StreamChat(ctx, messages, opts) — returns *StreamResult, caller must defer sr.Close()
Auto-continuation: StreamChatContinue() transparently retries when stop_reason == max_tokens
Provider fallback: fallback.go implements fallback chains across providers
Rate limiting: ratelimit.go implements token bucket per provider — prevents hitting API limits
Semantic caching: semantic_cache.go caches similar prompts — optional, configurable TTL
Retry with backoff: retry.go — exponential backoff + jitter, respects Retry-After header, retries on 429/500/502/503/529
Error hierarchy: EyrieError has IsRetriable(), IsAuthError(), IsRateLimited() methods for programmatic handling
SSE parsing: parseSSEStream() reads bufio.Scanner with 2MB buffer, emits SSEEvent to channel

Testing Patterns

httptest.NewServer for provider tests: mock LLM API responses with realistic JSON
Provider-specific test files: anthropic_test.go, openai_test.go, gemini_test.go — each provider tested independently
Env var manipulation: os.Setenv/os.Unsetenv in tests with defer cleanup — test provider detection
Credentials store: credentials.MapStore{} for testing — credentials.SetDefaultStore(store) + t.Cleanup(func() { ... })
Client test structure: create mock server, create client with server URL, call Chat/StreamChat, assert response
Header assertions: verify X-Api-Key, Anthropic-Version, User-Agent headers are sent correctly
Stream tests: parse SSE events from mock server, verify content/tool_call/done event types
Retry tests: mock server returning 429/500, verify retry count and backoff behavior
Cache tests: call same prompt twice, verify second call hits cache (no second HTTP request)
Fuzz tests: fuzz_test.go for input parsing robustness

Refactoring Guidelines

Safe to refactor: retry.go, ratelimit.go, cache.go — internal infrastructure, no public API changes
Safe to refactor: stream.go SSE parsing — internal implementation detail
Safe to refactor: fallback.go, weighted.go — routing strategies, extend with new strategies
Safe to refactor: cost_estimator.go, cache_analytics.go — metrics and tracking
Do not touch: Provider interface (Chat, StreamChat, Ping, Name) — breaking change for all implementations
Do not touch: EyrieMessage, EyrieResponse, ChatOptions struct field names — serialization contract
Do not touch: EyrieError struct — used by consumers for error type assertions
Do not touch: client.EyrieConfig — constructor contract for all consumers
Safe to extend: add new provider implementations, new SSE event types, new cache strategies
When adding a provider: create client/<provider>.go, implement Provider interface, register in provider_registry.go

Key File Locations

What	Where
Provider interface	`client/client.go` (`Provider`, `EyrieConfig`, `EyrieMessage`, `ContentPart`)
Chat implementation	`client/chat.go` (`Chat()`, `StreamChat()`, `StreamChatContinue()`)
Anthropic provider	`client/anthropic.go`
OpenAI provider	`client/openai.go`
Gemini provider	`client/gemini.go`
Bedrock provider	`client/bedrock.go`
Vertex provider	`client/vertex.go`
Azure provider	`client/azure.go`
Provider registry	`client/provider_registry.go`
Provider compatibility	`client/compat.go` (`OpenAICompat`, `GrokCompat`, etc.)
SSE streaming	`client/stream.go` (`parseSSEStream()`, `SSEEvent`)
Retry logic	`client/retry.go` (`RetryConfig`, `backoffDelay()`, `shouldRetry()`)
Rate limiting	`client/ratelimit.go`, `client/adaptive_ratelimit.go`
Caching	`client/cache.go`, `client/semantic_cache.go`, `client/cache_analytics.go`
Fallback chains	`client/fallback.go`
Auto-continuation	`client/continuation.go`
Error types	`client/errors.go` (`EyrieError`, `IsRetriable()`, `IsAuthError()`)
Error constants	`errors/errors.go` (API error messages, prompt-too-long parsing)
Model catalog	`catalog/` (pricing, context windows, capabilities per provider)
Credentials	`credentials/` (key storage, env detection, scrubbing) — `HasSecret` is silent on miss (boolean predicate); `LookupSecret` logs `Debug` on `ErrNotFound` and `Warn` on real backend errors
Mock provider	`client/mock.go`
Main test file	`client/client_test.go` (httptest servers, provider detection)
Linter config	`.golangci.yml` (govet, ineffassign, misspell — minimal)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AGENTS.md — Eyrie

Design Principles

Observability

Build & Test

Architecture

Conventions

Common Pitfalls

Naming Conventions

API Patterns

Testing Patterns

Refactoring Guidelines

Key File Locations

Uh oh!

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AGENTS.md — Eyrie

Design Principles

Observability

Build & Test

Architecture

Conventions

Common Pitfalls

Naming Conventions

API Patterns

Testing Patterns

Refactoring Guidelines

Key File Locations