CVAT Remote Model Orchestrator

Enterprise-level CVAT/Nuclio integration platform with FastAPI. Run AI models on powerful remote hosts with specialized GPUs and connect them directly to your CVAT instance via Nuclio with ultra-high performance and zero-config automation.

Features

Direct Remote Connection: Models run on a dedicated remote host (Inference Server) and connect to CVAT via automatically generated Nuclio functions.
Multi-Model Type Support: Handle Detectors, Trackers, and Interactors through a unified interface.
Multi-Instance Support: Run the same model implementation with different weights or configurations simultaneously.
Generic Configuration: Pass any custom parameters to models via the config block in models.yaml.
Auto-Detection & Generation: New models are detected automatically. If implementation files or classes.txt are missing, they are generated using intelligent templates.
Auto-Configuration: Model types and ports are automatically managed in config/models.yaml.
Nuclio Generator: Dynamic deployment files for CVAT are automatically generated with type-specific annotations and specs, pre-configured to point to your remote host.
Lazy Initialization: Models load only when requested, with no overhead during server startup.
Smart Resource Management: Automatic memory unloading for idle models.
Enterprise Architecture: Fully typed, PEP8 compliant, and designed for production extensibility.

Software Flow

1. Initialization & Orchestration

The Orchestrator (src/services/orchestrator.py) is the centralized controller. It monitors the environment and ensures the system state matches the configuration.

graph TD
    A[Orchestrator Loop] -->|Scan| B(File System src/models/*)
    B -->|Detect New Model| C{Is Configured?}
    C -->|No| D[Auto-Register in models.yaml]
    D -->|Set Type| E[Generate Implementation & Classes]
    E -->|Assign Port| F[Generate Nuclio Files]
    F -->|Start Server| G[FastAPI Subprocess]
    C -->|Yes| G

2. Request Flow (CVAT -> Nuclio -> Remote Host)

Data flows through the generated Nuclio function directly to the dedicated Remote Inference Server.

sequenceDiagram
    participant CVAT
    participant Nuclio as Nuclio Function (Proxy)
    participant API as FastAPI Server (Remote Host)
    participant Model as AI Model (Detector/Tracker/...)

    CVAT->>Nuclio: Send Image (Base64)
    Nuclio->>API: POST /infer (Forward to Remote Host)
    
    rect rgb(200, 255, 200)
    Note over API, Model: Lazy Initialization
    API->>API: Check if Model Loaded
    alt Model Not Loaded
        API->>Model: Load Weights (First Run)
    end
    end

    API->>Model: Run Inference (infer)
    Model-->>API: Return Results
    API-->>Nuclio: Return JSON (CVAT Format)
    Nuclio-->>CVAT: Return Annotations

Project Structure

yolov12-backend-fastapi/
├── src/                    # Main source directory
│   ├── core/              # interfaces, config, exceptions
│   ├── services/          # orchestrator, model loader, runner
│   ├── models/            # Model implementations (Detector/Tracker/Interactor)
│   └── api/               # FastAPI application logic
├── config/                # Configuration
│   ├── models.yaml        # Central model registry (Auto-updated)
│   └── cvat/             # Generic Nuclio template (template.yaml)
├── nuclio_functions/      # Generated Nuclio deployment packages
├── scripts/              # Utility scripts
├── main.py              # System Entry Point
├── pyproject.toml       # Project configuration and dependencies
└── uv.lock              # Lockfile for reproducible environments

Management Script

For convenience, a docker-compose style management script is provided:

# Start in foreground
./orchestrator.sh up

# Start in background (detached)
./orchestrator.sh up -d

# Stop the system
./orchestrator.sh down

# Check status
./orchestrator.sh status

# View logs
./orchestrator.sh logs -f

Quick Start (Alternative)

If you prefer running directly:

uv sync
uv run python main.py

Configuration & Multi-Instance

Settings are managed in config/models.yaml. The orchestrator automatically registers new models found in src/models/ and assigns them a unique port.

Multi-Instance & Custom Weights

You can define multiple instances of the same model implementation with different configurations:

models:
  yolov12n_v1:
    implementation: "yolov12n"
    port: 5001
    config:
      weights: "path/to/weights1.pt"
  yolov12n_v2:
    implementation: "yolov12n"
    port: 5002
    config:
      weights: "path/to/weights2.pt"
      confidence_threshold: 0.5
      device: 'cuda:0'
    classes: ["person", "car", "truck"]
    interpreter_path: "/path/to/specific/venv/bin/python"

Any field inside the config dictionary is passed as a keyword argument to the model's __init__ method. The interpreter_path field allows running the model instance in a different Python environment (e.g., with specific CUDA or library versions).

Development & Testing

Installation

# Clone the repository and install dependencies
uv sync

Running Tests

The project uses pytest for automated testing.

uv run pytest tests/

See tests/README.md for more details.

Code Style

This project follows PEP8 standards. Comprehensive docstrings are provided for all core modules.

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVAT Remote Model Orchestrator

Features

Software Flow

1. Initialization & Orchestration

2. Request Flow (CVAT -> Nuclio -> Remote Host)

Project Structure

Management Script

Quick Start (Alternative)

Configuration & Multi-Instance

Multi-Instance & Custom Weights

Development & Testing

Installation

Running Tests

Code Style

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
scripts		scripts
serverless		serverless
src		src
tests		tests
LICENSE		LICENSE
README.md		README.md
main.py		main.py
orchestrator.log		orchestrator.log
orchestrator.sh		orchestrator.sh
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

CVAT Remote Model Orchestrator

Features

Software Flow

1. Initialization & Orchestration

2. Request Flow (CVAT -> Nuclio -> Remote Host)

Project Structure

Management Script

Quick Start (Alternative)

Configuration & Multi-Instance

Multi-Instance & Custom Weights

Development & Testing

Installation

Running Tests

Code Style

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages