Feature/ai model training by DongDuong2001 · Pull Request #17 · lab68dev/lab68dev-platform

DongDuong2001 · 2026-01-20T17:10:34Z

This pull request introduces an end-to-end training and inference pipeline for a custom AI assistant model tailored for software development tasks and technical Q&A. It includes scripts for synthetic dataset generation, a configurable training setup, a FastAPI-based inference server, and all necessary dependencies. The most important changes are grouped below:

1. Dataset Generation and Training Pipeline

Added generate_dataset.py and a more advanced generate_dataset_backup.py to synthesize ~4000 examples (task creation and tech Q&A) in TinyLlama chat format for model fine-tuning. These scripts use templates and randomization to create diverse, structured prompts and responses. [1] [2]
Added training_config.yaml for reproducible training runs, specifying model, LoRA, dataset, and generation hyperparameters.

2. Inference Server

Introduced inference/server.py, a FastAPI app for serving the fine-tuned model with endpoints for text generation and structured task creation. The server loads LoRA adapters if available, applies chat formatting, and exposes CORS for integration.

3. Documentation and Dependencies

Added a detailed README.md with setup, training, inference, and hardware requirements to guide users through the pipeline.
Created requirements.txt listing all dependencies for training, inference, and utilities, ensuring reproducibility.

Introduces a new training_config.yaml file specifying model, LoRA, training, dataset, and generation settings for the Lab68Dev AI model. This configuration will be used to control training and inference parameters.

Introduces generate_dataset.py to create synthetic training and validation datasets for the Lab68Dev AI model. The script generates structured task and Q&A examples, formats them for TinyLlama chat, and saves them as JSONL files for model training.

Introduces generate_dataset_backup.py for creating synthetic training data for task creation and technical Q&A. The script generates structured prompts and responses for software development tasks and technical explanations, supporting AI model training.

Implements a FastAPI server to serve the Lab68Dev AI model with endpoints for health checks, text generation, and task creation. Loads model and tokenizer on startup, supports CORS, and provides structured request/response models.

Introduces documentation for setting up, training, and running the custom NLP model, including hardware requirements and model details.

Introduces a requirements.txt file specifying core machine learning, inference server, utility, and testing dependencies for the ai-model project.

Introduces ai-model/train.py, a script to fine-tune TinyLlama using LoRA and 4-bit quantization for task creation and tech Q&A. The script loads configuration from YAML, sets up model and tokenizer, loads datasets, configures training arguments, and saves the trained model and tokenizer.

Eliminated the retrieval and injection of RAG context in the chat API endpoint. The endpoint now directly forwards user messages to the Ollama model without attempting to augment them with RAG context.

Deleted lib/services/rag-service.ts, which contained the RAG (Retrieval-Augmented Generation) service for document embedding, storage, and retrieval. This removes all related logic for managing and searching knowledge base documents.

Deleted the 'index-knowledge' and 'index-knowledge:clear' scripts, and removed the '@xenova/transformers', 'ai', and 'chromadb' dependencies from package.json as they are no longer needed.

Deleted scripts/index-knowledge.js, which handled indexing documentation and platform features into the RAG system. This may indicate a change in how knowledge indexing is managed or a migration to a different approach.

vercel · 2026-01-20T17:10:42Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
lab68dev-platform-1ds5	Ready	Preview, Comment	Jan 22, 2026 4:09pm

Cleaned up the scripts section by removing an unnecessary trailing comma after the 'start:next' script.

Copilot

Pull request overview

This pull request introduces a complete AI model training and inference pipeline while removing the existing RAG (Retrieval-Augmented Generation) system. The PR replaces browser/server-based RAG embeddings with a standalone Python-based training pipeline for fine-tuning TinyLlama for software development tasks.

Changes:

Removed RAG-based knowledge base system including embeddings service, indexing scripts, and related dependencies
Added Python-based AI model training pipeline with LoRA fine-tuning for TinyLlama
Introduced FastAPI inference server for serving the fine-tuned model

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 15 comments.

Show a summary per file

File	Description
scripts/index-knowledge.js	Removed knowledge base indexing script (RAG removal)
lib/services/rag-service.ts	Removed RAG embeddings and document search service
app/api/chat/route.ts	Removed RAG context retrieval from chat API
package.json	Removed RAG-related dependencies and indexing scripts; version bump to 0.1.1
ai-model/train.py	New training script with LoRA configuration and 4-bit quantization
ai-model/requirements.txt	Python dependencies for training and inference
ai-model/inference/server.py	FastAPI server for model inference with generation endpoints
ai-model/data/generate_dataset.py	Synthetic dataset generator for training examples
ai-model/data/generate_dataset_backup.py	Incomplete backup dataset generator (truncated)
ai-model/config/training_config.yaml	Centralized training hyperparameters and model configuration
ai-model/README.md	Setup and usage documentation for the AI training pipeline

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot · 2026-01-22T15:45:00Z

@DongDuong2001 I've opened a new pull request, #18, to work on those changes. Once the pull request is ready, I'll request review from you.

Co-authored-by: DongDuong2001 <64120873+DongDuong2001@users.noreply.github.com>

…ssary lock Co-authored-by: DongDuong2001 <64120873+DongDuong2001@users.noreply.github.com>

[WIP] WIP Address feedback on AI model training feature implementation

Bump pnpm version from 8 to 10 across all jobs in the GitHub Actions CI workflow to ensure compatibility with the latest features and improvements.

Cleaned up pnpm-lock.yaml by removing several unused packages and their dependencies, including ai, chromadb, @xenova/transformers, and related libraries. This reduces lockfile size and helps maintain a leaner dependency tree.

DongDuong2001 added 11 commits January 20, 2026 16:15

Add training configuration YAML for AI model

e74af65

Introduces a new training_config.yaml file specifying model, LoRA, training, dataset, and generation settings for the Lab68Dev AI model. This configuration will be used to control training and inference parameters.

Add FastAPI inference server for AI model

038cb4c

Implements a FastAPI server to serve the Lab68Dev AI model with endpoints for health checks, text generation, and task creation. Loads model and tokenizer on startup, supports CORS, and provides structured request/response models.

Add README for AI model training pipeline

a9e6e7f

Introduces documentation for setting up, training, and running the custom NLP model, including hardware requirements and model details.

Add initial requirements.txt for ai-model

d3764d6

Introduces a requirements.txt file specifying core machine learning, inference server, utility, and testing dependencies for the ai-model project.

Remove RAG integration from chat API route

77b37ff

Eliminated the retrieval and injection of RAG context in the chat API endpoint. The endpoint now directly forwards user messages to the Ollama model without attempting to augment them with RAG context.

Remove RAG service implementation

eb1cf55

Deleted lib/services/rag-service.ts, which contained the RAG (Retrieval-Augmented Generation) service for document embedding, storage, and retrieval. This removes all related logic for managing and searching knowledge base documents.

Remove unused scripts and dependencies

84cf60a

Deleted the 'index-knowledge' and 'index-knowledge:clear' scripts, and removed the '@xenova/transformers', 'ai', and 'chromadb' dependencies from package.json as they are no longer needed.

Remove knowledge base indexer script

5a919fd

Deleted scripts/index-knowledge.js, which handled indexing documentation and platform features into the RAG system. This may indicate a change in how knowledge indexing is managed or a migration to a different approach.

DongDuong2001 self-assigned this Jan 20, 2026

DongDuong2001 assigned mthutt and DongDuong2001 and unassigned DongDuong2001 Jan 20, 2026

DongDuong2001 added the features New feature add to project. label Jan 20, 2026

Remove trailing comma from scripts in package.json

2e3db0c

Cleaned up the scripts section by removing an unnecessary trailing comma after the 'start:next' script.

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 20, 2026 17:14 Failure

Bump version to 0.1.1 in package.json

81e3cbc

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 20, 2026 17:30 Failure

DongDuong2001 requested a review from Copilot January 22, 2026 15:29

Copilot started reviewing on behalf of DongDuong2001 January 22, 2026 15:29 View session

Copilot AI reviewed Jan 22, 2026

View reviewed changes

DongDuong2001 and others added 3 commits January 22, 2026 22:39

Update ai-model/README.md

ac5310c

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update ai-model/data/generate_dataset.py

156edfa

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update ai-model/data/generate_dataset.py

62c75dc

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:40 Failure

Update ai-model/data/generate_dataset_backup.py

7ec9b90

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:40 Failure

Update ai-model/inference/server.py

904d438

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:41 Failure

Update ai-model/inference/server.py

2758e30

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:41 Failure

Update ai-model/train.py

0e41915

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:42 Failure

Update ai-model/train.py

711587c

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:42 Failure

Update ai-model/train.py

4386cb9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:43 Failure

Update ai-model/train.py

40642c2

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:43 Failure

Update ai-model/train.py

68b2710

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:44 Failure

Initial plan

e8672d2

Copilot AI mentioned this pull request Jan 22, 2026

[WIP] WIP Address feedback on AI model training feature implementation #18

Merged

5 tasks

Copilot AI and others added 4 commits January 22, 2026 15:47

Fix CORS security, add thread safety, and improve config error handling

b4fd65f

Co-authored-by: DongDuong2001 <64120873+DongDuong2001@users.noreply.github.com>

Add Python-specific entries to .gitignore and remove __pycache__

2006e05

Co-authored-by: DongDuong2001 <64120873+DongDuong2001@users.noreply.github.com>

Address code review feedback: improve CORS handling and remove unnece…

e0f5f8a

…ssary lock Co-authored-by: DongDuong2001 <64120873+DongDuong2001@users.noreply.github.com>

Merge pull request #18 from lab68dev/copilot/sub-pr-17

95f0f89

[WIP] WIP Address feedback on AI model training feature implementation

vercel Bot had a problem deploying to Preview – lab68dev-platform-1ds5 January 22, 2026 15:50 Failure

DongDuong2001 added 3 commits January 22, 2026 23:07

Update pnpm version to 10 in CI workflow

3ec1ec3

Bump pnpm version from 8 to 10 across all jobs in the GitHub Actions CI workflow to ensure compatibility with the latest features and improvements.

Remove unused dependencies from pnpm-lock.yaml

64a1d1e

Cleaned up pnpm-lock.yaml by removing several unused packages and their dependencies, including ai, chromadb, @xenova/transformers, and related libraries. This reduces lockfile size and helps maintain a leaner dependency tree.

Merge branch 'main' into feature/ai-model-training

b4d69bc

DongDuong2001 requested a review from mthutt January 22, 2026 16:09

vercel Bot deployed to Preview – lab68dev-platform-1ds5 January 22, 2026 16:09 View deployment

DongDuong2001 merged commit f9f0f80 into main Jan 22, 2026
5 checks passed

DongDuong2001 deleted the feature/ai-model-training branch January 22, 2026 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/ai model training#17

Feature/ai model training#17
DongDuong2001 merged 33 commits into
mainfrom
feature/ai-model-training

DongDuong2001 commented Jan 20, 2026

Uh oh!

vercel Bot commented Jan 20, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

DongDuong2001 commented Jan 20, 2026

Uh oh!

vercel Bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vercel Bot commented Jan 20, 2026 •

edited

Loading