feat: add OpenAI-compatible HTTP server for mellea backends by markstur · Pull Request #746 · generative-computing/mellea

markstur · 2026-03-25T23:51:33Z

Misc PR

Type of PR

Bug Fix
New Feature
Documentation
Other

Description

Link to Issue: Fixes Create OpenAI API-compatible HTTP interface for mellea #521

feat: add OpenAI-compatible HTTP server for mellea backends

Implement FastAPI server with /v1/chat/completions endpoint
Add streaming support via Server-Sent Events
Support all mellea backends (Ollama, OpenAI, HF, Watsonx, LiteLLM)
Include tool calling and token usage tracking
Add comprehensive test suite (9 tests)
Provide documentation and usage examples
Enable deployment as standalone service

Testing

Tests added to the respective file if code was changed
New code has 100% coverage if code as added
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

- Implement FastAPI server with /v1/chat/completions endpoint - Add streaming support via Server-Sent Events - Support all mellea backends (Ollama, OpenAI, HF, Watsonx, LiteLLM) - Include tool calling and token usage tracking - Add comprehensive test suite (9 tests) - Provide documentation and usage examples - Enable deployment as standalone service Closes generative-computing#521 Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

github-actions · 2026-03-25T23:51:43Z

The PR description has been updated. Please fill out the template for your PR to be reviewed.

mergify · 2026-03-25T23:52:08Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert|release)(?:\(.+\))?:

markstur · 2026-03-26T00:06:03Z

First pass at implementing #521 and need to see if I interpreted it right.

This server has a chat/completions endpoint that acts as a proxy to backends used by mellea. It is not a server that wraps mellea (e.g. an IVR loop). Creating mellea-as-a-backend might be more interesting but I think there is other work going on that might do that. There are some words in the issue that made me not sure which was the goal.

The models list is pretty limited, but I'm thinking I would add a config (like litellm) listing all the provider/model possibilities that should show up in the models list. Otherwise it is just showing the one right now.

Need to confirm this is even going the right direction. It's a bunch of nice code -- mostly generated -- but not sure it hits the objective.

Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

- Add tool calling tests (streaming and non-streaming) - Add multi-model session management tests - Add backend configuration tests (base_url, kwargs) - Add error handling and edge case tests - Remove dead code: convert_messages_to_context() All tests use granite4:micro and pass successfully. Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

planetf1 · 2026-03-26T16:48:11Z

Added comments to issue #521

Thanks for the contribution - but I think whilst it does do what the issue suggests, that isn't what was intended, and what was intended I think is implemented via m serve - so maybe we could think how to improve that? (check for other open issues on serve?)

ajbozarth · 2026-03-26T17:33:26Z

Had this on my plate to review this afternoon, but @planetf1 has pretty much already given my feedback. I'd recommend checking his Issue comment, I agree with his train of thought

markstur · 2026-03-27T14:29:47Z

Let's close this. It's not going the right direction. See the issue #521 for more details related to m serve features.

Thanks for the reviews/feedback.

markstur requested a review from a team as a code owner March 25, 2026 23:51

github-actions Bot added the enhancement New feature or request label Mar 25, 2026

tests: add tool calling test to openai_compat server, use granite

f885402

Signed-off-by: Mark Sturdevant <mark.sturdevant@ibm.com>

markstur mentioned this pull request Mar 26, 2026

Create OpenAI API-compatible HTTP interface for mellea #521

Open

29 tasks

markstur closed this Mar 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add OpenAI-compatible HTTP server for mellea backends#746

feat: add OpenAI-compatible HTTP server for mellea backends#746
markstur wants to merge 3 commits into
generative-computing:mainfrom
markstur:issue521

markstur commented Mar 25, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Mar 25, 2026

Uh oh!

mergify Bot commented Mar 25, 2026

Uh oh!

markstur commented Mar 26, 2026

Uh oh!

planetf1 commented Mar 26, 2026

Uh oh!

ajbozarth commented Mar 26, 2026

Uh oh!

markstur commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

markstur commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Misc PR

Type of PR

Description

Testing

Uh oh!

github-actions Bot commented Mar 25, 2026

Uh oh!

mergify Bot commented Mar 25, 2026

Merge Protections

🟢 Enforce conventional commit

Uh oh!

markstur commented Mar 26, 2026

Uh oh!

planetf1 commented Mar 26, 2026

Uh oh!

ajbozarth commented Mar 26, 2026

Uh oh!

markstur commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

markstur commented Mar 25, 2026 •

edited

Loading