Add Docker + Kubernetes deployment stack with autoscaling and worker isolation by fuzziecoder · Pull Request #78 · fuzziecoder/Flexi-Roaster

fuzziecoder · 2026-02-24T14:45:29Z

Motivation

Provide a first-class containerized setup so the backend and frontend can be built and run in Docker for local and CI workflows.
Enable production-ready orchestration features (rolling updates, readiness/liveness probes, auto-scaling) for cloud Kubernetes deployments.
Separate execution workers from API pods to allow worker isolation, node scheduling, and independent scaling for pipeline execution workloads.

Description

Added container assets: Dockerfile.backend, Dockerfile.frontend, and .dockerignore to build backend and frontend images and reduce build contexts.
Added local orchestration via docker-compose.yml that runs backend, frontend, and an isolated worker service using the backend.worker entrypoint.
Implemented a simple isolated worker runtime at backend/worker.py with graceful shutdown handling intended for Kubernetes worker pools.
Added Kubernetes manifests under deploy/k8s (namespace.yaml, backend.yaml, frontend.yaml, worker.yaml, autoscaling.yaml, kustomization.yaml) that include rolling update strategy, readiness/liveness probes, resource requests/limits, nodeSelector/tolerations for worker isolation, and HPAs for autoscaling.
Added deploy/README.md with usage instructions and managed-cluster guidance for AWS EKS, Google GKE, and Azure AKS, and linked a short deployment note into the repo README.md.

Testing

Ran Python compile check with python -m py_compile backend/worker.py and it completed successfully.
Parsed all YAML manifests and the Docker Compose file using yaml.safe_load_all (script run via python - <<'PY' ... PY) and confirmed no YAML parsing errors.
Verified the added Dockerfiles and compose config are present and healthy via local file validations used during the rollout.

Codex Task

vercel · 2026-02-24T14:45:31Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
flexi-roaster	Ready	Preview, Comment	Feb 24, 2026 2:45pm

coderabbitai · 2026-02-24T14:45:42Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch codex/implement-containerization-and-orchestration

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

devin-ai-integration

Devin Review found 1 potential issue.

View 5 additional findings in Devin Review.

devin-ai-integration · 2026-02-24T14:48:14Z

+    while RUNNING:
+        # Placeholder for queue-based execution workers.
+        # This keeps the worker pool isolated from API pods.
+        print("worker-heartbeat")
+        time.sleep(15)


🟡 Graceful shutdown delayed up to 15 seconds because time.sleep() auto-retries after signal (PEP 475)

The worker's graceful shutdown mechanism doesn't work promptly. When SIGTERM/SIGINT is received during time.sleep(15), the signal handler sets RUNNING = False, but due to PEP 475 (Python 3.5+), time.sleep() automatically retries for the remaining duration after the signal handler returns. The while RUNNING condition is not re-checked until the full 15-second sleep completes.

Root Cause and Verification

PEP 475 modified the standard library to automatically retry system calls that are interrupted by signals (EINTR). This means time.sleep(15) will resume sleeping for the remaining time after the _shutdown_handler sets RUNNING = False.

Verified empirically: a time.sleep(5) interrupted by a signal after 1 second still sleeps the full 5 seconds, even though the signal handler ran at the 1-second mark.

Actual behavior: Worker takes up to 15 seconds to shut down after receiving SIGTERM, because time.sleep(15) at backend/worker.py:25 resumes after the signal handler completes.

Expected behavior: Worker should exit promptly (within milliseconds) after receiving SIGTERM.

Impact: In Kubernetes, this means pod termination is delayed by up to 15 seconds on every rolling update or scale-down. While this is within the default 30-second terminationGracePeriodSeconds, it unnecessarily slows deployments and wastes resources. If the sleep interval were increased (e.g., to 60 seconds), it could exceed the grace period and cause forced kills (SIGKILL).

Fix: Use threading.Event.wait() instead of time.sleep(), which can be interrupted immediately:

import threading _stop_event = threading.Event() def _shutdown_handler(signum, frame): _stop_event.set() while not _stop_event.is_set(): print("worker-heartbeat") _stop_event.wait(15)

Prompt for agents

In backend/worker.py, replace the time.sleep-based loop with a threading.Event-based approach for prompt graceful shutdown. Specifically: 1. At the top of the file (around line 7-8), replace `RUNNING = True` with: import threading _stop_event = threading.Event() 2. Change the _shutdown_handler function (lines 11-13) to: def _shutdown_handler(signum, frame): _stop_event.set() 3. Change the main loop (lines 21-25) from: while RUNNING: print("worker-heartbeat") time.sleep(15) to: while not _stop_event.is_set(): print("worker-heartbeat") _stop_event.wait(15) 4. Remove the `import time` if no longer needed, and remove the `RUNNING` global variable. The threading.Event.wait() method returns immediately when the event is set, unlike time.sleep() which auto-retries after signal interruption due to PEP 475.

Was this helpful? React with 👍 or 👎 to provide feedback.

Add Docker and Kubernetes deployment assets with autoscaling

9a1b867

fuzziecoder added the codex label Feb 24, 2026 — with ChatGPT Codex Connector

fuzziecoder self-assigned this Feb 24, 2026

vercel Bot deployed to Preview February 24, 2026 14:45 View deployment

fuzziecoder added apertre3.0 hard and removed codex labels Feb 24, 2026

fuzziecoder merged commit 9a5e283 into codex/fix-remaining-issues-and-raise-pr Feb 24, 2026
7 checks passed

devin-ai-integration Bot reviewed Feb 24, 2026

View reviewed changes

fuzziecoder removed the apertre3.0 label Mar 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Docker + Kubernetes deployment stack with autoscaling and worker isolation#78

Add Docker + Kubernetes deployment stack with autoscaling and worker isolation#78
fuzziecoder merged 1 commit intocodex/fix-remaining-issues-and-raise-prfrom
codex/implement-containerization-and-orchestration

fuzziecoder commented Feb 24, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

vercel Bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented Feb 24, 2026

Review skipped

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

fuzziecoder commented Feb 24, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description

Testing

Uh oh!

vercel Bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Feb 24, 2026

Review skipped

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fuzziecoder commented Feb 24, 2026 •

edited by devin-ai-integration Bot

Loading

vercel Bot commented Feb 24, 2026 •

edited

Loading