ApartsinProjects
diff --git a/‎ClaudeSkill/README.md‎
Lines changed: 29 additions & 0 deletions b/‎ClaudeSkill/README.md‎
Lines changed: 29 additions & 0 deletions
diff --git a/‎ClaudeSkill/configure.md‎
Lines changed: 141 additions & 0 deletions b/‎ClaudeSkill/configure.md‎
Lines changed: 141 additions & 0 deletions
diff --git a/‎ClaudeSkill/deploy-proxy.md‎
Lines changed: 146 additions & 0 deletions b/‎ClaudeSkill/deploy-proxy.md‎
Lines changed: 146 additions & 0 deletions
@@ -0,0 +1,29 @@
+# ModelMesh Claude Code Skills
+
+This folder contains Claude Code skill definitions for working with ModelMesh.
+Each `.md` file is a self-contained skill that Claude Code can load and execute.
+
+## Available Skills
+
+| Skill | File | Description |
+|---|---|---|
+| **Install** | `install.md` | Install ModelMesh into any project (Python, TypeScript, or Docker) |
+| **Configure** | `configure.md` | Generate modelmesh.yaml config with providers, models, pools |
+| **Integrate** | `integrate.md` | Replace existing AI SDK calls with ModelMesh routing |
+| **Deploy Proxy** | `deploy-proxy.md` | Set up and deploy the Docker OpenAI proxy |
+| **Test** | `test.md` | Run ModelMesh test suite and verify integration |
+
+## Usage with Claude Code
+
+These skills can be loaded as custom commands in Claude Code. To use:
+
+1. Copy the desired skill `.md` file into your project's `.claude/` directory
+2. Or reference them directly when asking Claude Code for help
+
+## How Skills Work
+
+Each skill file contains:
+- **Context**: What ModelMesh is and how it works
+- **Decision tree**: Questions to ask the user to determine the right approach
+- **Implementation steps**: Exact commands and code to execute
+- **Verification**: How to confirm the integration works
@@ -0,0 +1,141 @@
+# ModelMesh Configure Skill
+
+## Purpose
+Generate a `modelmesh.yaml` configuration file tailored to the user's needs.
+
+## Decision Steps
+
+1. **Which providers?** Ask which API keys the user has available.
+2. **Which capabilities?** What does their app need?
+   - Chat completion (text generation)
+   - Text embeddings
+   - Text-to-speech
+   - Speech-to-text
+   - Image generation
+   - Code generation
+3. **Which rotation strategy?**
+   - `stick-until-failure` (default, recommended) — stay with working model
+   - `cost-first` — minimize spending
+   - `round-robin` — spread load evenly
+   - `latency-first` — fastest response
+   - `rate-limit-aware` — avoid rate limits
+4. **Budget controls?** Does the user want daily spend limits per provider?
+5. **Secret store?** How are API keys managed?
+   - `modelmesh.env.v1` (default) — environment variables
+   - `modelmesh.dotenv.v1` — .env file
+   - Cloud options: AWS Secrets Manager, Google Secret Manager, Azure Key Vault
+
+## Configuration Template
+
+```yaml
+# modelmesh.yaml — Generated configuration
+secrets:
+  store: modelmesh.env.v1
+
+providers:
+  # Add providers based on user's available API keys
+  openai.llm.v1:
+    api_key: ${secrets:OPENAI_API_KEY}
+    budget:
+      daily_limit: 10.00     # optional
+
+  anthropic.claude.v1:
+    api_key: ${secrets:ANTHROPIC_API_KEY}
+
+  groq.api.v1:
+    api_key: ${secrets:GROQ_API_KEY}
+
+models:
+  # Add models for each provider
+  gpt-4o-mini:
+    provider: openai.llm.v1
+    capabilities:
+      - generation.text-generation.chat-completion
+    features:
+      tool_calling: true
+      structured_output: true
+      json_mode: true
+      system_prompt: true
+    constraints:
+      context_window: 128000
+      max_output_tokens: 16384
+
+  claude-3-5-haiku:
+    provider: anthropic.claude.v1
+    capabilities:
+      - generation.text-generation.chat-completion
+    features:
+      tool_calling: true
+      system_prompt: true
+    constraints:
+      context_window: 200000
+      max_output_tokens: 8192
+
+  llama-3.3-70b:
+    provider: groq.api.v1
+    capabilities:
+      - generation.text-generation.chat-completion
+    features:
+      tool_calling: true
+      system_prompt: true
+    constraints:
+      context_window: 131072
+      max_output_tokens: 32768
+
+pools:
+  text-generation:
+    strategy: modelmesh.stick-until-failure.v1
+    capability: generation.text-generation
+```
+
+## Provider Reference
+
+| Provider ID | Env Var | Models |
+|---|---|---|
+| `openai.llm.v1` | `OPENAI_API_KEY` | gpt-4o, gpt-4o-mini, gpt-4-turbo |
+| `anthropic.claude.v1` | `ANTHROPIC_API_KEY` | claude-sonnet-4, claude-3-5-haiku |
+| `google.gemini.v1` | `GOOGLE_API_KEY` | gemini-2.0-flash, gemini-1.5-pro |
+| `groq.api.v1` | `GROQ_API_KEY` | llama-3.3-70b, mixtral-8x7b |
+| `deepseek.api.v1` | `DEEPSEEK_API_KEY` | deepseek-chat, deepseek-coder |
+| `mistral.api.v1` | `MISTRAL_API_KEY` | mistral-large, mistral-medium |
+| `together.api.v1` | `TOGETHER_API_KEY` | Various open-source models |
+| `openrouter.gateway.v1` | `OPENROUTER_API_KEY` | Multi-provider gateway |
+| `xai.grok.v1` | `XAI_API_KEY` | grok-2, grok-2-mini |
+| `cohere.nlp.v1` | `COHERE_API_KEY` | command-r, command-r-plus |
+
+## Capability Paths
+
+| Short Name | Full Path | Use For |
+|---|---|---|
+| `chat-completion` | `generation.text-generation.chat-completion` | Chat, Q&A, text generation |
+| `text-generation` | `generation.text-generation` | Broader text generation pool |
+| `text-embeddings` | `representation.embeddings.text-embeddings` | Semantic search, RAG |
+| `text-to-speech` | `generation.audio.text-to-speech` | Voice synthesis |
+| `speech-to-text` | `understanding.audio.speech-to-text` | Transcription |
+| `text-to-image` | `generation.image.text-to-image` | Image generation |
+| `code-generation` | `generation.text-generation.code-generation` | Code completion |
+
+## Strategy Reference
+
+| Strategy ID | Best For |
+|---|---|
+| `modelmesh.stick-until-failure.v1` | General use (default) |
+| `modelmesh.round-robin.v1` | Even load distribution |
+| `modelmesh.cost-first.v1` | Budget-sensitive apps |
+| `modelmesh.latency-first.v1` | Real-time applications |
+| `modelmesh.priority-selection.v1` | Preferred model with fallback |
+| `modelmesh.rate-limit-aware.v1` | High-volume apps |
+| `modelmesh.load-balanced.v1` | Weighted traffic splitting |
+| `modelmesh.session-stickiness.v1` | Conversation continuity |
+
+## Verification
+
+After creating the config, verify:
+
+```python
+from modelmesh.config.mesh_config import MeshConfig
+config = MeshConfig.from_yaml("modelmesh.yaml")
+print(f"Providers: {len(config.providers)}")
+print(f"Models: {len(config.models)}")
+print(f"Pools: {len(config.pools)}")
+```
@@ -0,0 +1,146 @@
+# ModelMesh Deploy Proxy Skill
+
+## Purpose
+Set up and deploy the ModelMesh Docker proxy as an OpenAI-compatible REST API server.
+
+## Prerequisites
+
+- Docker and Docker Compose installed
+- At least one AI provider API key
+
+## Quick Deploy
+
+### Step 1: Get the Project
+
+```bash
+git clone https://github.com/ApartsinProjects/ModelMesh.git
+cd ModelMesh
+```
+
+### Step 2: Configure API Keys
+
+```bash
+cp .env.example .env
+```
+
+Edit `.env` and add your API keys:
+```env
+OPENAI_API_KEY=sk-...
+ANTHROPIC_API_KEY=sk-ant-...
+GROQ_API_KEY=gsk_...
+```
+
+### Step 3: Configure Models and Pools
+
+Edit `modelmesh.yaml` to define:
+- Which providers to use
+- Which models to expose
+- Which pools to create (capability groupings)
+- Rotation strategy
+
+See the `configure.md` skill for detailed configuration options.
+
+### Step 4: Build and Start
+
+```bash
+docker compose up --build
+```
+
+Or use the automation script:
+```bash
+./scripts/proxy-up.sh
+```
+
+For detached (background) mode:
+```bash
+./scripts/proxy-up.sh --detach
+```
+
+### Step 5: Verify
+
+```bash
+# Health check
+curl http://localhost:8080/health
+
+# List models and pools
+curl http://localhost:8080/v1/models
+
+# Test chat completion
+curl -X POST http://localhost:8080/v1/chat/completions \
+  -H "Content-Type: application/json" \
+  -d '{"model":"text-generation","messages":[{"role":"user","content":"Hello!"}]}'
+```
+
+Or use the smoke test script:
+```bash
+./scripts/proxy-test.sh
+```
+
+## Authentication
+
+To require a bearer token:
+
+```bash
+docker compose up --build
+# Add to docker-compose.yaml command:
+command: ["--config", "/app/modelmesh.yaml", "--host", "0.0.0.0", "--port", "8080", "--token", "my-secret-token"]
+```
+
+Clients must then include: `Authorization: Bearer my-secret-token`
+
+## Proxy CLI Reference
+
+```
+python -m modelmesh.proxy [OPTIONS]
+
+Options:
+  --config PATH     YAML configuration file (default: auto-detect)
+  --host HOST       Bind address (default: 0.0.0.0)
+  --port PORT       Listen port (default: 8080)
+  --token TOKEN     Optional bearer token for authentication
+  --log-level LEVEL Logging level: DEBUG, INFO, WARNING, ERROR (default: INFO)
+```
+
+## API Endpoints
+
+| Method | Path | Description |
+|---|---|---|
+| `GET` | `/health` | Health check (returns status, uptime, model count) |
+| `GET` | `/v1/models` | List available models and pools |
+| `POST` | `/v1/chat/completions` | Chat completion (streaming + non-streaming) |
+| `POST` | `/v1/embeddings` | Text embeddings |
+| `POST` | `/v1/audio/speech` | Text-to-speech |
+| `POST` | `/v1/audio/transcriptions` | Speech-to-text |
+
+## Docker Compose Configuration
+
+```yaml
+services:
+  modelmesh-proxy:
+    build: .
+    ports:
+      - "8080:8080"
+    env_file: .env
+    volumes:
+      - ./modelmesh.yaml:/app/modelmesh.yaml:ro
+    command: ["--config", "/app/modelmesh.yaml", "--host", "0.0.0.0", "--port", "8080"]
+```
+
+## Stopping
+
+```bash
+docker compose down
+# Or:
+./scripts/proxy-down.sh
+# Clean up volumes and images:
+./scripts/proxy-down.sh --clean
+```
+
+## Production Considerations
+
+1. **Use `--token` for authentication** in production environments
+2. **Set daily budget limits** per provider in `modelmesh.yaml`
+3. **Use cloud secret stores** (AWS Secrets Manager, etc.) instead of env vars
+4. **Monitor** via the `/health` endpoint for uptime checks
+5. **Reverse proxy**: Place behind nginx/caddy for TLS and rate limiting
+6. **Logging**: Use `--log-level DEBUG` during setup, `INFO` in production