Restructure embedeer as a single-package repo with `@jsilvanus/embedeer` and inline GPU provider detection by Copilot · Pull Request #2 · jsilvanus/embedeer

Copilot · 2026-04-08T06:47:41Z

Converts the repo from an npm workspaces monorepo back to a standard single-package repository (@jsilvanus/embedeer) and replaces the optional sidecar ort packages with inline GPU system checks. The package is ready for npm publishing.

Structure

Single package at the repo root:

src/           — source files
test/          — test files
package.json   — @jsilvanus/embedeer

Install

npm install @jsilvanus/embedeer        # CPU always works; GPU auto-detected if system libs present

No separate GPU sidecar packages needed. onnxruntime-node (already bundled via @huggingface/transformers) ships libonnxruntime_providers_cuda.so on Linux x64 and DirectML on Windows — no extra install required.

New `device` / `provider` API

Added to Embedder.create(), WorkerPool, and CLI (--device, --provider):

// Auto-detect GPU, silent CPU fallback
const embedder = await Embedder.create(model, { device: 'auto' });

// Require GPU — throws with diagnostic error if system libs are missing
const embedder = await Embedder.create(model, { device: 'gpu' });

// Explicit provider
const embedder = await Embedder.create(model, { provider: 'cuda' });

# Auto-detect: uses CUDA on Linux x64, DirectML on Windows x64, CPU fallback everywhere else
npx @jsilvanus/embedeer --device auto --data "Hello"

# Require GPU (throws with diagnostic if CUDA/cuDNN libs are missing)
npx @jsilvanus/embedeer --device gpu --provider cuda --data "Hello"

Provider selection logic (`src/provider-loader.js`)

All system checks are inlined — no dynamic package imports:

device='auto': tries platform-default providers (Linux x64 → cuda; Windows x64 → dml), silently falls back to CPU
device='gpu': throws with a diagnostic error (lists missing CUDA libs or unsupported platform) if no GPU provider succeeds
device='cpu' / provider='cpu': always CPU
Explicit provider=cuda|dml: runs inline checks or throws with a diagnostic error; unsupported platform surfaces "not supported on …" message

Linux x64 — CUDA 12 inline checks

provider-loader.js checks:

/dev/nvidiactl exists (NVIDIA driver present)
All required shared libraries found via common paths + ldconfig -p: libcudart.so.12, libcublas.so.12, libcublasLt.so.12, libcurand.so.10, libcufft.so.11, libcudnn.so.9
Throws a clear diagnostic with apt install instructions if anything is missing

System requirements: NVIDIA GPU + driver ≥ 525, CUDA 12 toolkit, cuDNN 9.

# Install CUDA 12 + cuDNN 9 (Ubuntu/Debian)
sudo apt install cuda-toolkit-12-6 libcudnn9-cuda-12

# Run with GPU (auto-detect — uses CUDA here, CPU fallback on any other machine)
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --device auto --data "Hello GPU"

# Require CUDA (throws with install instructions if libs are missing)
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --device gpu --data "Hello GPU"

Docker + NVIDIA CUDA

Works out of the box with NVIDIA Container Toolkit:

/dev/nvidiactl is mounted into the container automatically
NVIDIA CUDA images set LD_LIBRARY_PATH to paths already in the default search list

FROM nvidia/cuda:12.6.3-cudnn9-runtime-ubuntu24.04
# ... install Node.js ...
RUN npm install @jsilvanus/embedeer

docker run --rm --gpus all my-app \
  npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --device gpu --data "Hello GPU"

Host requirement: NVIDIA Container Toolkit.

Windows x64 — DirectML inline checks

DirectML is bundled with onnxruntime-node on Windows. provider-loader.js just verifies process.platform === 'win32'. Supports NVIDIA, AMD, and Intel GPUs on Windows 10 (1903+) / Windows 11.

Workers

Both worker.js (child process) and thread-worker-script.js (thread) call resolveProvider(device, provider) before pipeline(). device and provider flow through WorkerPool → workerData.

npm publish readiness

"publishConfig": { "access": "public" } (required for scoped packages)
"files" array publishes only src/ + README.md
"engines": { "node": ">=18" }
"author": "jsilvanus"

Thorough I/O and piping (`src/cli.js`)

Configurable record delimiter (`-D` / `--delimiter <str>`)

printf 'Hello\0World\0' | npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --delimiter '\0'
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --file data.tsv --delimiter '\t'
printf 'foo|||bar|||baz' | npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --delimiter '|||'

Output formats

Format	Description
`json` (default)	JSON array of float arrays
`json --with-text`	JSON array of `{"text","embedding"}` objects
`jsonl`	Newline-delimited JSON — ideal for `jq`, streaming
`csv`	CSV with header `text,dim_0,...,dim_N`
`txt`	Space-separated floats, one vector per line
`txt --with-text`	Tab-separated `<text>\t<floats>`
`sql`	`INSERT INTO embeddings ...`

Interactive line-reader mode (`-i` / `--interactive`)

npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --interactive --dump embeddings.jsonl
cat corpus.txt | npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 -i --output csv --dump out.csv

Tests

Tests in test/provider-loader.test.js cover platform defaults, tryLoadProvider failure modes (no NVIDIA device, missing CUDA libs), and all resolveProvider paths. Tests in test/cli-format.test.js cover parseDelimiter, parseTexts, and all formatOutput paths. All tests pass.

Original prompt

Create a monorepo for jsilvanus/embedeer that supports optional GPU providers installable via npm in a two-step install, and update the main embedeer package to dynamically load the appropriate provider at runtime.

Context (current state in PR #1 branch copilot/create-embedding-tool):

embedeer is an ESM Node package that uses @huggingface/transformers pipeline('feature-extraction') inside a WorkerPool.
Workers: src/worker.js (child process), src/thread-worker-script.js (worker_threads). Both call pipeline() and then extractor(texts, { pooling, normalize }).
Existing options: batchSize, concurrency, mode (process|thread), pooling, normalize, token, dtype, cacheDir.
CLI exists at src/cli.js.

Goal:

Convert repository into a monorepo (npm workspaces) with:
- packages/embedeer (main JS package)
- packages/ort-win32-x64-cuda (provider package)
- packages/ort-win32-x64-dml (provider package)
- packages/ort-linux-x64-cuda (provider package)
- (optional) packages/ort-*-cpu if needed, but CPU can remain default via upstream onnxruntime-node.
Two-step install is acceptable:
- Base: npm install embedeer
- GPU: npm install @embedeer/ort-win32-x64-cuda OR @embedeer/ort-win32-x64-dml OR @embedeer/ort-linux-x64-cuda
Runtime selection / API changes in embedeer:
- Add options: device ('auto'|'cpu'|'gpu') and provider override ('cpu'|'cuda'|'dml').
- CLI flags: --device and --provider.
- For device='gpu' on Windows: prefer provider='cuda' if installed else 'dml' if installed, unless overridden.
- For device='gpu' on Linux: prefer provider='cuda'.
- device='auto': try preferred GPU providers for the platform if installed; otherwise CPU.
- device='cpu': always CPU.
- If provider is explicitly requested and not available, throw a clear error telling which npm package to install.
Provider packages responsibilities:
- Each provider package exports a small JS API used by embedeer to "activate" the provider in the current process before @huggingface/transformers pipeline is created.
- Each provider package should include an install script that downloads or otherwise provides the correct native onnxruntime binaries for that provider (CUDA or DirectML) for the target platform. If a full binary bundling is too large for this task, stub the download logic and clearly document TODOs and expected artifact layout, but still implement the monorepo structure and dynamic loading hooks.
- Provide clear README docs in root and in packages/embedeer describing how to install GPU provider packages and run npx embedeer with --device gpu.
Ensure the worker modes still work:
- Both child-process workers and thread workers must apply provider activation before pipeline() is called.
- WorkerPool must pass device/provider into workerData.
Tests:
- Add unit tests verifying provider selection logic and error messages when provider packages are missing.
- Ensure existing tests pass.
CI/build:
- Update package.json scripts at root for workspaces.
- Keep npx embedeer working (bin entry).

Output:

Open a PR implementing the above.

The following is the prior conversation context from the user's chat exploration (may be truncated):

User: See its pr 1

Assistant: You’ve already got the key piece for GPU support in PR #1: you’re using @huggingface/transformers, and that in turn uses ONNX Runtime (onnxruntime-node is in your lockfile). So the GPU plan is “make ONNX Runtime use a GPU execution provider” (CUDA/DirectML/CoreML/etc.), not “write shaders”.

What PR #1 is doing today (relevant to GPU)

Workers (src/worker.js, src/thread-worker-script.js) call:
- pipeline('feature-extraction', modelName, { dtype })
- then extractor(texts, { pooling, normalize })
That pipeline is executed by ONNX Runtime under the hood.
Right now it will effectively run on CPU unless ORT is built/installed with a GPU provider and configured to use it.

The practical way to add GPU support here

Step 1: Add a `--device` / `device` option, propagate it into workerData

Add device: 'auto'|'cpu'|'cuda' (start with those; you can extend later).

Where to thread it through:

CLI: src/cli.js add --device
Embedder: src/embedder.js accept device, pass to WorkerPool
WorkerPool: src/worker-pool.js include device in workerData
Workers: src/worker.js and src/thread-worker-script.js read device and configure runtime before creating the pipeline

Step 2: Ensure ONNX Runtime with CUDA is actually available

This is the hardest part in Node land. There are typically two workable approaches:

A) “Bring your own CUDA ORT” via container (recommended)

Provide a CUDA-enabled Docker image that includes CUDA libs + an ONNX Runtime build that supports CUDA.
Your Node code just selects device=cuda; if CUDA isn’t present it falls back to CPU.

B) Make GPU optional via an extra dependency

CPU default: normal install wor...

This pull request was created from Copilot chat.

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/898517f6-b6e8-43fa-98b5-5a16e9745dc5 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

jsilvanus · 2026-04-08T07:25:56Z

@copilot next, please work on CUDA that we have a working gpu accelerated mvp.

…b checks Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/7227830d-3d7c-4707-be90-545f53b4bc77 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/7227830d-3d7c-4707-be90-545f53b4bc77 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T07:46:28Z

@copilot next, please work on CUDA that we have a working gpu accelerated mvp.

Here's the CUDA MVP — commit 5cc9f05.

Key discovery: onnxruntime-node v1.24.x (already installed via @huggingface/transformers) ships libonnxruntime_providers_cuda.so (302MB) on Linux x64. No custom binary packaging needed.

What was implemented:

@embedeer/ort-linux-x64-cuda — activate() now does real checks:
- Verifies /dev/nvidiactl exists (NVIDIA driver present)
- Searches for required CUDA 12 + cuDNN 9 libs: libcudart.so.12, libcublas.so.12, libcublasLt.so.12, libcurand.so.10, libcufft.so.11, libcudnn.so.9
- Throws a clear error with apt install instructions if anything is missing
@embedeer/ort-win32-x64-dml — DirectML is also bundled with ORT on Windows; activate() just checks the platform.

To use GPU on Linux (after this PR):

# Install CUDA 12 + cuDNN 9 (Ubuntu)
sudo apt install cuda-toolkit-12-6 libcudnn9-cuda-12

# Install packages
npm install embedeer
npm install @embedeer/ort-linux-x64-cuda

# Run with GPU
npx embedeer --model Xenova/all-MiniLM-L6-v2 --device gpu --data "Hello GPU"

The --device auto flag will try CUDA first and silently fall back to CPU if CUDA libraries aren't available.

jsilvanus · 2026-04-08T07:50:19Z

@copilot some user might want to run this in docker with CUDA. Enabled there. Are we go already on that or do we need something else?

…r Toolkit guide) Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/e005d328-62aa-468f-90b9-8982c069d6ff Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T07:55:01Z

@copilot some user might want to run this in docker with CUDA. Enabled there. Are we go already on that or do we need something else?

Yes, Docker with CUDA already works with the current implementation — no code changes needed.

Here's why it works automatically:

The NVIDIA Container Toolkit mounts /dev/nvidiactl into the container → our GPU presence check passes
NVIDIA CUDA Docker images put all libraries under /usr/local/cuda/lib64 → already in our default search path
If LD_LIBRARY_PATH is set in the image (which NVIDIA images do), we pick that up too

Commit d5ad2ea adds Docker documentation to both READMEs. The setup is:

FROM nvidia/cuda:12.6.3-cudnn9-runtime-ubuntu24.04
# ... install Node.js ...
RUN npm install embedeer && npm install @embedeer/ort-linux-x64-cuda

docker run --rm --gpus all my-app \
  npx embedeer --model Xenova/all-MiniLM-L6-v2 --device gpu --data "Hello GPU"

The only host requirement is NVIDIA Container Toolkit installed alongside the NVIDIA driver.

jsilvanus · 2026-04-08T08:06:50Z

@copilot remove embedeer/ort-win32-x64-cuda (placeholder)

This package will be @jsilvanus/embedeer and other packages scoped there as well.

…vanus

…vanus on all packages Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/aba1be69-ab33-480c-9457-d0579962b5a4 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T08:19:26Z

@copilot remove embedeer/ort-win32-x64-cuda (placeholder)

This package will be @jsilvanus/embedeer and other packages scoped there as well. ...

Done in commit 104cb9e:

Deleted packages/ort-win32-x64-cuda entirely
Renamed scope @embedeer/ → @jsilvanus/ on all packages:
- embedeer → @jsilvanus/embedeer
- @embedeer/ort-linux-x64-cuda → @jsilvanus/ort-linux-x64-cuda
- @embedeer/ort-win32-x64-dml → @jsilvanus/ort-win32-x64-dml
Removed the win32-x64-cuda entry from PROVIDER_PACKAGES in provider-loader.js
Updated all references in source, tests, and READMEs
All 61 tests still pass

jsilvanus · 2026-04-08T08:26:22Z

@copilot please prefix ort packages with embedeer-.

Prepare this for npm publishing. Make sure everything is ready for it.

…ishing; show npx --device auto/gpu in READMEs Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/c6cb957d-1635-42ac-9914-88507015c38b Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T08:52:07Z

@copilot please prefix ort packages with embedeer-.

Prepare this for npm publishing. Make sure everything is ready for it.

Done in commit f721a06:

Package renames:

@jsilvanus/ort-linux-x64-cuda → @jsilvanus/embedeer-ort-linux-x64-cuda
@jsilvanus/ort-win32-x64-dml → @jsilvanus/embedeer-ort-win32-x64-dml

npm publish readiness (all three packages):

"publishConfig": { "access": "public" } — required for scoped packages on npm
"files": [...] — controls exactly what gets published (source only, no tests/node_modules)
"engines": { "node": ">=18" } on all packages
"author": "jsilvanus" set on all packages

npx + CUDA docs (new requirement): Updated npx embedeer → npx @jsilvanus/embedeer throughout and added "Using GPU with npx" sections to both READMEs showing --device auto (CUDA where available, CPU fallback), --device gpu (hard-require), and explicit --provider cuda/dml with the install steps for each platform.

…ve I/O docs Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/be63b249-589e-433f-bf10-b78ef1668bc1 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T09:15:18Z

@copilot I want thorough input and output, piping possibilities etc. Input from stdin (maybe a selectable limiter, \n by default but could also be something else). Please plan for knput possibilities....

Done in commit 05176ec. Here's what was added:

New input option — configurable record delimiter (-D / --delimiter):

Default: \n (newline) — unchanged behaviour
Supports escape sequences: \0 (null byte), \n, \t, \r, or any literal string
Applies to both stdin and --file; JSON arrays are still auto-detected first

# Null-byte delimited (safe with text containing newlines — compatible with find -print0)
printf 'Hello\0World\0' | npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --delimiter '\0'

# Tab-delimited file
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --file data.tsv --delimiter '\t'

# Custom multi-char delimiter
printf 'foo|||bar|||baz' | npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --delimiter '|||'

New output formats:

jsonl — one {"text":"...","embedding":[...]} per line — ideal for jq, grep, streaming
csv — text,dim_0,dim_1,...,dim_N with header — for pandas/Excel
--with-text flag — adds source text to json ([{"text","embedding"}]) and txt (text\tfloats) output

# JSONL — pipe to jq
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --output jsonl --data "foo" "bar" | jq '.text'

# CSV — for pandas
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --file texts.txt --output csv --dump out.csv

# JSON with source text
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --output json --with-text --data "cat" "dog"

Both READMEs now have dedicated Input Sources, Configurable delimiter, Output Formats, and Piping examples sections. 19 new tests cover all new paths.

jsilvanus · 2026-04-08T09:37:38Z

@copilot I also want option that a line reader (??) opens and you can just paste some data and it will, when seeing data come in, start work (batch maybe also?), and output (probably to a file when input reader is used....)

…treaming output Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/b3d9aa41-2047-43a4-be9a-96e4015529eb Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T09:45:26Z

@copilot I also want option that a line reader (??) opens and you can just paste some data and it will, when seeing data come in, start work (batch maybe also?), and output (probably to a file when in...

Done in commit 7bf2828. Added --interactive / -i — a readline-based line-reader that starts embedding as data arrives:

# Interactive terminal — paste lines, Ctrl+D when done
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --interactive --dump embeddings.jsonl

# Stream a file in batches (no terminal needed)
cat corpus.txt | npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 -i --output csv --dump out.csv

# GPU, custom batch size
npx @jsilvanus/embedeer --model Xenova/all-MiniLM-L6-v2 --interactive --device auto \
  --batch-size 16 --output jsonl --dump out.jsonl

How flushing works:

Trigger	What happens
Buffer reaches `--batch-size`	Auto-flush
Empty line (Enter ×2)	Manual flush
Ctrl+D (EOF)	Flush remaining + exit
Ctrl+C	Flush remaining + exit

> prompt appears on stderr when stdin is a TTY; silent when piped.
Output goes to --dump file (cleared at session start); falls back to stdout with a tip.
json/sql auto-switch to jsonl; csv writes the header once on the first batch only.
Progress (Batch N: M record(s) → file) always goes to stderr.

jsilvanus · 2026-04-08T09:49:23Z

@copilot fix any remaining errors.

…sions Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/2e60c137-0653-4608-ba14-5926c7456201 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T09:53:55Z

@copilot fix any remaining errors.

Fixed in commit ed06683. All 15 previously-failing tests are now resolved.

Root cause: @huggingface/transformers and the monorepo workspace symlinks (@jsilvanus/embedeer-ort-*) were not installed in the agent environment. This caused 14 failures in embedder.test.js / embedder-options.test.js (top-level ESM import of @huggingface/transformers blew up) and 1 failure in provider-loader.test.js (the win32/dml workspace package wasn't symlinked yet).

Fix: Added .github/copilot-setup-steps.yml to run npm install before agent sessions. This installs all dependencies including the workspace package symlinks.

All 80 tests now pass (ℹ pass 80, ℹ fail 0).

jsilvanus · 2026-04-08T10:08:23Z

@copilot Do we need the embedeer-ort* packages at all, if everything comes in the basic deps already?

Copilot · 2026-04-08T10:10:04Z

@copilot Do we need the embedeer-ort* packages at all, if everything comes in the basic deps already?

Technically no — the ort-* packages ship zero new binaries. libonnxruntime_providers_cuda.so already comes bundled with onnxruntime-node (which is a transitive dep of @huggingface/transformers). The ort packages currently only do:

embedeer-ort-linux-x64-cuda: checks /dev/nvidiactl + scans for CUDA 12 / cuDNN 9 system libs, then returns 'cuda' as the device string
embedeer-ort-win32-x64-dml: checks process.platform === 'win32', then returns 'dml'

Why the two-package pattern was added: provider-loader.js uses import(packageName) as an opt-in signal — if the package isn't installed the import fails and the system falls back to CPU (--device auto) or throws (--device gpu). This pattern makes sense when packages ship large optional native binaries (e.g., separate CUDA DLLs), but since everything is already bundled, it's mostly ceremony here.

We could simplify by:

Deleting both ort packages
Moving the system checks (CUDA lib detection, platform check) inline into provider-loader.js
--device auto on linux/x64 just tries CUDA directly, falls back silently
--device gpu throws with the same diagnostic error if system libs are missing

Want me to go ahead with that simplification? It would reduce the repo to a single publishable package (@jsilvanus/embedeer) with no optional sidecars needed.

jsilvanus · 2026-04-08T10:13:08Z

@copilot Yes: delete both ort packages and move the system checks inline into provider-loader.js. Convert this back to a normal repo, not monorepo anymore, since this changes things.

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/6286b261-52f2-4560-ad02-795169949161 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot · 2026-04-08T10:22:26Z

@copilot Yes: delete both ort packages and move the system checks inline into provider-loader.js. Convert this back to a normal repo, not monorepo anymore, since this changes things.

Done in commit f5804b6.

Deleted both packages/ort-linux-x64-cuda and packages/ort-win32-x64-dml
Inlined CUDA system checks (/dev/nvidiactl detection + CUDA 12/cuDNN 9 lib scan) and DML platform check directly into src/provider-loader.js — no dynamic import() of external packages
Flattened the monorepo back to a normal repo: src/ and test/ are now at the root, and package.json is the single @jsilvanus/embedeer package
Updated README.md: GPU is now described as built-in (just npm install @jsilvanus/embedeer + CUDA system libs for Linux)
79 tests pass

jsilvanus · 2026-04-08T10:26:13Z

@copilot Are PR description and README up-to-date?

Copilot · 2026-04-08T10:28:26Z

@copilot Are PR description and README up-to-date?

Yes, both are accurate and current:

README correctly describes the single-package structure: npm install @jsilvanus/embedeer with no sidecar packages, GPU built-in via onnxruntime-node, CUDA system lib requirements for Linux x64, DirectML for Windows, all CLI options, and the provider selection table.

PR description was already updated in the previous commit to reflect the single-package repo with inline GPU system checks.

One small note: getPlatformDefaultProviders() returns ['cuda', 'dml'] on Windows x64 — intentionally tries CUDA first (for future-proofing), silently falls through to DML since there's no win32-x64-cuda impl. The README's "GPU order is cuda → dml" on Windows accurately reflects this.

Initial plan

68a3ab4

Copilot AI assigned Copilot and jsilvanus Apr 8, 2026

Copilot started work on behalf of jsilvanus April 8, 2026 06:47 View session

Copilot AI and others added 2 commits April 8, 2026 07:14

Convert to monorepo with GPU provider packages and device/provider API

7308365

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/898517f6-b6e8-43fa-98b5-5a16e9745dc5 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Fix JSDoc type annotation and missing options declaration in cli.js

535fb95

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/898517f6-b6e8-43fa-98b5-5a16e9745dc5 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Convert embedeer repository to monorepo with optional GPU providers~~ Convert embedeer to npm workspaces monorepo with optional GPU provider packages Apr 8, 2026

Copilot AI requested a review from jsilvanus April 8, 2026 07:19

Copilot finished work on behalf of jsilvanus April 8, 2026 07:19

Copilot started work on behalf of jsilvanus April 8, 2026 07:26 View session

Copilot AI and others added 2 commits April 8, 2026 07:44

Implement working CUDA MVP: use bundled ORT providers, real system li…

7bc259e

…b checks Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/7227830d-3d7c-4707-be90-545f53b4bc77 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Fix version inconsistency in CUDA provider comments (v1.20+/v1.24.x)

5cc9f05

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/7227830d-3d7c-4707-be90-545f53b4bc77 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot finished work on behalf of jsilvanus April 8, 2026 07:48

Copilot started work on behalf of jsilvanus April 8, 2026 07:50 View session

Add Docker + CUDA documentation (Dockerfile, docker-compose, Containe…

d5ad2ea

…r Toolkit guide) Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/e005d328-62aa-468f-90b9-8982c069d6ff Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot finished work on behalf of jsilvanus April 8, 2026 07:58

Copilot started work on behalf of jsilvanus April 8, 2026 08:06 View session

Remove ort-win32-x64-cuda placeholder; rename scope @embedeer → @jsil…

104cb9e

…vanus on all packages Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/aba1be69-ab33-480c-9457-d0579962b5a4 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot AI changed the title ~~Convert embedeer to npm workspaces monorepo with optional GPU provider packages~~ Convert embedeer to @jsilvanus npm workspaces monorepo with optional GPU provider packages Apr 8, 2026

Copilot finished work on behalf of jsilvanus April 8, 2026 08:22

Copilot started work on behalf of jsilvanus April 8, 2026 08:26 View session

Add configurable delimiter, jsonl/csv/--with-text output, comprehensi…

05176ec

…ve I/O docs Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/be63b249-589e-433f-bf10-b78ef1668bc1 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot finished work on behalf of jsilvanus April 8, 2026 09:22

Copilot started work on behalf of jsilvanus April 8, 2026 09:37 View session

Add --interactive/-i line-reader mode: per-line stdin, batch flush, s…

7bf2828

…treaming output Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/b3d9aa41-2047-43a4-be9a-96e4015529eb Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot finished work on behalf of jsilvanus April 8, 2026 09:47

Copilot started work on behalf of jsilvanus April 8, 2026 09:49 View session

Add .github/copilot-setup-steps.yml: run npm install before agent ses…

ed06683

…sions Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/2e60c137-0653-4608-ba14-5926c7456201 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot finished work on behalf of jsilvanus April 8, 2026 09:57

Copilot started work on behalf of jsilvanus April 8, 2026 10:08 View session

Copilot finished work on behalf of jsilvanus April 8, 2026 10:12

Copilot started work on behalf of jsilvanus April 8, 2026 10:13 View session

Delete ort packages, inline GPU checks, flatten monorepo to single repo

f5804b6

Agent-Logs-Url: https://github.com/jsilvanus/embedeer/sessions/6286b261-52f2-4560-ad02-795169949161 Co-authored-by: jsilvanus <22452468+jsilvanus@users.noreply.github.com>

Copilot AI changed the title ~~Convert embedeer to @jsilvanus npm workspaces monorepo with optional GPU provider packages~~ Restructure embedeer as a single-package repo with @jsilvanus/embedeer and inline GPU provider detection Apr 8, 2026

Copilot finished work on behalf of jsilvanus April 8, 2026 10:26

Copilot started work on behalf of jsilvanus April 8, 2026 10:26 View session

jsilvanus marked this pull request as ready for review April 8, 2026 10:29

jsilvanus merged commit 090e3f6 into main Apr 8, 2026
1 check failed

Copilot stopped work on behalf of jsilvanus due to an error April 8, 2026 10:30
The session was cancelled by the user.

jsilvanus deleted the copilot/convert-repo-to-monorepo branch April 14, 2026 19:51

Conversation

Copilot AI commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Structure

Install

New device / provider API

Provider selection logic (src/provider-loader.js)

Linux x64 — CUDA 12 inline checks

Docker + NVIDIA CUDA

Windows x64 — DirectML inline checks

Workers

npm publish readiness

Thorough I/O and piping (src/cli.js)

Configurable record delimiter (-D / --delimiter <str>)

Output formats

Interactive line-reader mode (-i / --interactive)

Tests

What PR #1 is doing today (relevant to GPU)

The practical way to add GPU support here

Step 1: Add a --device / device option, propagate it into workerData

Step 2: Ensure ONNX Runtime with CUDA is actually available

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

jsilvanus commented Apr 8, 2026

Uh oh!

Copilot AI commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Apr 8, 2026 •

edited

Loading

New `device` / `provider` API

Provider selection logic (`src/provider-loader.js`)

Thorough I/O and piping (`src/cli.js`)

Configurable record delimiter (`-D` / `--delimiter <str>`)

Interactive line-reader mode (`-i` / `--interactive`)

Step 1: Add a `--device` / `device` option, propagate it into workerData