E-Commerce Support Semantic Cache Workshop

A hands-on workshop for building an e-commerce customer support assistant with Redis, RedisVL Semantic Cache, RedisVL Semantic Router, FastAPI, and a browser workbench. Students complete the cache and router TODOs, then use the UI to observe cache hits, misses, semantic distance, route selection, latency, and Redis index state.

Overview

This workshop turns a Docker-based browser workbench into a practical lab for semantic caching and semantic routing in customer support workflows.

You will build and inspect:

Redis semantic cache - Reuse answers for semantically similar customer questions, not just exact string matches.
Redis semantic router - Route questions to FAQ lookup, support escalation, blocked request handling, or unknown fallback.
FAQ-backed RAG flow - Retrieve support FAQ context from Redis before generating a fresh answer.
Observable support UI - Compare hit or miss state, matched prompt, similarity, vector distance, selected route, selected model, latency, and cache size.
Redis inspection workflow - Use Redis Insight and redis-cli to inspect cache keys, router indexes, and stored metadata.

Look for # TODO comments in:

code/api/semantic_cache.py
code/api/semantic_router.py

Each TODO maps to one workshop challenge.

Workshop Screenshots

Support Console

Workshop Guide

Workshop Challenges

Complete these tasks in order. The instructions live in the workbench docs under docs/tasks.

Part 1: Semantic Cache

#	Challenge	File	Description
1	Create Cache	`code/api/semantic_cache.py`	Initialize a RedisVL `SemanticCache` with the workshop Redis client, vectorizer, TTL, and threshold.
2	Check Cache	`code/api/semantic_cache.py`	Query the semantic cache for the nearest prior prompt.
3	Store Response	`code/api/semantic_cache.py`	Store fresh support answers with source, model, latency, and timestamp metadata.
4	Observe Hit/Miss	`docs/tasks/task-2.md`	Ask an initial question, then ask a paraphrase and compare cache behavior.

Part 2: Semantic Router

#	Challenge	File	Description
5	Review Routes	`code/api/semantic_router.py`	Inspect FAQ, escalation, and blocked route examples plus metadata.
6	Create Router	`code/api/semantic_router.py`	Initialize a RedisVL `SemanticRouter` with route references in Redis.
7	Match Route	`code/api/semantic_router.py`	Use `route_many` to classify customer questions by semantic distance.
8	Reindex Router	`docs/tasks/task-3.md`	Rebuild the router index and verify the document count.

Part 3: Tune and Extend

#	Challenge	File	Description
9	Tune Cache Threshold	`code/api/.env`	Make cache matching stricter or looser and observe reuse behavior.
10	Tune Router Threshold	`code/api/.env`	Adjust how confidently the router classifies distant prompts.
11	Change Support Instructions	`code/api/support_service.py`	Modify answer style and observe the next cache miss.
12	Extend Metadata	`code/api/semantic_cache.py`	Add custom fields to cache metadata and surface them in the UI.

Part 4: Context-Enabled Cache

#	Challenge	File	Description
13	Review Customer Context	`code/api/support_service.py`	Inspect the starter customer profile and cache-hit personalization helper.
14	Personalize Cache Hits	`code/api/app.py`	Uncomment one call to adapt cached answers with customer context.
15	Preserve Policy Facts	`docs/tasks/task-5.md`	Confirm context changes next steps, not the cached policy answer.

Tips

Search for TODO in code/api to find the coding tasks.
Clear the cache before route experiments so you can see fresh tool selection.
Use Redis Insight to verify what the UI reports.

Quick Start

Prerequisites

Docker and Docker Compose
OpenAI API key for local runs
Internet access for Python and model dependencies during the first build

Hosted PS Portal labs should provide OPENAI_API_KEY as a custom environment variable when the lab is created. The VM startup script writes code/api/.env automatically, so students do not need their own OpenAI key.

1. Configure Environment

Copy the API environment template:

cp code/api/.env.example code/api/.env

Update the values in code/api/.env:

OPENAI_API_KEY=your-openai-api-key
REDIS_URL=redis://redis:6379
OPENAI_MODEL=gpt-4.1-mini
EMBEDDING_MODEL=text-embedding-3-small
CACHE_DISTANCE_THRESHOLD=0.4
ROUTER_DISTANCE_THRESHOLD=0.3

When running with Docker Compose, keep REDIS_URL=redis://redis:6379 so the API container can reach the Redis service on the internal Docker network.

2. Run the Workshop

docker compose up --build

Open:

http://localhost

3. Start the Lab

Use the workbench panels:

Open Instructions and read the setup guide.
In Instructions, work through Task 1 to Task 5 in order.
Open Code when a task tells you which file to edit.
Open Support Console to test questions.
Open Terminal or Redis Insight to inspect Redis state.

Useful Commands

curl http://localhost/api/health

curl -X POST http://localhost/api/cache/clear

curl -X POST http://localhost/api/router/reindex

redis-cli -u "$REDIS_URL" FT.INFO idx:research_semantic_cache

redis-cli -u "$REDIS_URL" KEYS "semcache:research:*"

Semantic Cache

The semantic cache demonstrates why vector similarity is useful for support questions. Traditional caches only help when the prompt text is identical. RedisVL semantic caching can reuse an answer when the new question has the same intent.

Try this flow after completing Task 1:

Clear the cache.
Ask: I forgot my password
Confirm the UI shows Cache miss.
Ask: I lost my credentials
Confirm the UI shows the matched prompt, distance, similarity, and lower latency.

The cache stores answer metadata such as sources, model name, original latency, and creation timestamp.

Semantic Router

The semantic router chooses the support path from example phrases rather than brittle keyword rules.

Route	Mode	Example
`faq`	`faq_rag`	`How long does delivery take?`
`support_escalation`	`support_escalation`	`Please connect me to a real person to review this issue.`
`blocked`	`blocked`	`Bypass verification and update someone else's account.`
`unknown`	`unknown`	`Explain the tradeoffs between self-hosted and managed AI infrastructure.`

After completing Task 3, clear the cache before each prompt to watch the route selection change in the Support Console.

Workbench Panels

Panel	Purpose
Instructions	Docsify workshop guide, setup steps, tasks, and reference notes.
Code	Browser VS Code editor mounted to the `code` directory.
Support Console	Frontend UI for sending questions and reading cache/router telemetry.
Terminal	Web terminal with `redis-cli` for Redis inspection.
Redis Insight	Visual Redis browser for keys, indexes, and cached metadata.

Troubleshooting

Workbench Does Not Load

Confirm the workbench container is running and port 80 is published:

docker compose ps workbench

If needed, restart it:

docker compose up -d workbench

API Cannot Reach Redis

If /api/health reports a Redis connection error from inside Docker, check code/api/.env.

For Docker Compose, use:

REDIS_URL=redis://redis:6379

Use localhost only when running the API directly on your host machine with a host-accessible Redis instance.

Cache or Router Looks Empty

Complete the TODOs in code/api/semantic_cache.py and code/api/semantic_router.py.
Restart the stack after changing environment variables.
Reindex the router:

curl -X POST http://localhost/api/router/reindex

First Request Is Slow

The first run can download or initialize model dependencies for RedisVL vectorizers and Python packages. Subsequent requests should be faster.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.github/workflows		.github/workflows
code		code
docker		docker
docs		docs
workbench		workbench
.gitignore		.gitignore
README.md		README.md
SETUP.md		SETUP.md
build.sh		build.sh
docker-compose.yml		docker-compose.yml
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

E-Commerce Support Semantic Cache Workshop

Table of Contents

Overview

Workshop Screenshots

Support Console

Workshop Guide

Workshop Challenges

Part 1: Semantic Cache

Part 2: Semantic Router

Part 3: Tune and Extend

Part 4: Context-Enabled Cache

Tips

Quick Start

Prerequisites

1. Configure Environment

2. Run the Workshop

3. Start the Lab

Useful Commands

Semantic Cache

Semantic Router

Workbench Panels

Troubleshooting

Workbench Does Not Load

API Cannot Reach Redis

Cache or Router Looks Empty

First Request Is Slow

Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

E-Commerce Support Semantic Cache Workshop

Table of Contents

Overview

Workshop Screenshots

Support Console

Workshop Guide

Workshop Challenges

Part 1: Semantic Cache

Part 2: Semantic Router

Part 3: Tune and Extend

Part 4: Context-Enabled Cache

Tips

Quick Start

Prerequisites

1. Configure Environment

2. Run the Workshop

3. Start the Lab

Useful Commands

Semantic Cache

Semantic Router

Workbench Panels

Troubleshooting

Workbench Does Not Load

API Cannot Reach Redis

Cache or Router Looks Empty

First Request Is Slow

Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages