Prediction Guard Microservice

Prediction Guard allows you to utilize hosted open access LLMs, LVMs, and embedding functionality with seamlessly integrated safeguards. In addition to providing a scalable access to open models, Prediction Guard allows you to configure factual consistency checks, toxicity filters, PII filters, and prompt injection blocking. Join the Prediction Guard Discord channel and request an API key to get started.

Start Microservice
Consume Microservice

Start Microservice

You can build and run the Prediction Guard microservice using Docker Compose.

Run Docker with Docker Compose

export service_name="textgen-predictionguard"

cd comps/llms/deployment/docker_compose/
docker compose -f compose_text-generation.yaml up ${service_name} -d

Consume Microservice

See the Prediction Guard docs for available model options.

Without stream

curl -X POST http://localhost:9000/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "Hermes-3-Llama-3.1-8B",
        "messages": "Tell me a joke.",
        "max_tokens": 100,
        "temperature": 0.7,
        "top_p": 0.9,
        "top_k": 50,
        "stream": false
    }'

With stream

curl -N -X POST http://localhost:9000/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "Hermes-3-Llama-3.1-8B",
        "messages": "Tell me a joke.",
        "max_tokens": 100,
        "temperature": 0.7,
        "top_p": 0.9,
        "top_k": 50,
        "stream": true
    }'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prediction Guard Microservice

Table of Contents

Start Microservice

Run Docker with Docker Compose

Consume Microservice

Without stream

With stream

FilesExpand file tree

README_predictionguard.md

Latest commit

History

README_predictionguard.md

File metadata and controls

Prediction Guard Microservice

Table of Contents

Start Microservice

Run Docker with Docker Compose

Consume Microservice

Without stream

With stream