feat(ai): add audio generation support by hwbrzzl · Pull Request #1467 · goravel/framework

hwbrzzl · 2026-05-08T03:42:26Z

Summary

Add a fluent facades.AI().Audio(...) API for text-to-speech generation with provider, model, voice, instructions, timeout, and storage support.
Add OpenAI audio generation support, including default audio model resolution, placeholder voice mapping, and audio response MIME-aware storage behavior.
Simplify AI.Image(...) to use request-level configuration so image and audio requests follow the same fluent setup pattern.

Why

Goravel's AI package already supports text, files, and image generation, but it did not have a first-class text-to-speech flow. This change adds an audio request/response pipeline that matches the existing fluent AI patterns, so applications can generate speech directly through the facade and store the result without needing provider-specific wiring in user code.

package controllers

import (
	"time"

	"github.com/goravel/framework/facades"
)

type AudioController struct{}

func (r *AudioController) Welcome() (string, error) {
	return facades.AI().
		Audio("Welcome to Goravel").
		Provider("openai").
		Model("gpt-4o-mini-tts").
		Female().
		Instructions("Speak clearly and warmly").
		Timeout(30 * time.Second).
		StoreAs("audio/welcome.mp3")
}

This also keeps the public surface more consistent by moving image selection to the fluent request instead of mixing constructor options with request methods. That avoids duplicate ways to set provider and model, while the OpenAI implementation supplies sensible defaults for the initial runtime slice.

codecov · 2026-05-08T03:45:33Z

Codecov Report

❌ Patch coverage is 77.43363% with 51 lines in your changes missing coverage. Please review.
✅ Project coverage is 69.26%. Comparing base (950dc14) to head (dda0b4b).
⚠️ Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
ai/openai/provider.go	69.69%	16 Missing and 4 partials ⚠️
ai/response.go	74.41%	10 Missing and 1 partial ⚠️
ai/audio_request.go	80.00%	9 Missing and 1 partial ⚠️
ai/media_storage.go	77.77%	4 Missing and 4 partials ⚠️
ai/application.go	85.71%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1467      +/-   ##
==========================================
+ Coverage   69.19%   69.26%   +0.07%     
==========================================
  Files         370      373       +3     
  Lines       29338    29523     +185     
==========================================
+ Hits        20300    20450     +150     
- Misses       8106     8135      +29     
- Partials      932      938       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull request overview

This PR adds first-class audio (text-to-speech) generation to the AI facade/API, extends the OpenAI provider to implement audio generation with sensible defaults, and aligns the image generation API to the same fluent request pattern.

Changes:

Introduces fluent AI.Audio(prompt) / AudioRequest / AudioResponse contracts plus framework implementations for generating and storing audio.
Adds OpenAI audio generation support (default audio model config, default voice mapping, response MIME-type handling).
Simplifies AI.Image(...) to remove constructor options and rely on request-level configuration (.Provider(...), .Model(...), etc.).

Reviewed changes

Copilot reviewed 21 out of 21 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
mocks/ai/AudioResponse.go	Adds mock for the new `contracts/ai.AudioResponse` interface.
mocks/ai/AudioRequest.go	Adds mock for the new `contracts/ai.AudioRequest` fluent request interface.
mocks/ai/AudioProvider.go	Adds mock for the new `contracts/ai.AudioProvider` interface.
mocks/ai/AI.go	Updates AI mock to include `Audio(prompt)` and changes `Image(prompt)` signature (removes options).
errors/list.go	Adds centralized error definitions for audio flow (prompt, store path/name, provider capability).
contracts/ai/response.go	Introduces `AudioResponse` contract (content, MIME type, storage, usage, callbacks).
contracts/ai/provider.go	Adds `AudioPrompt` and `AudioProvider` contract for provider implementations.
contracts/ai/config.go	Extends model config with `Models.Audio.Default`.
contracts/ai/audio.go	Adds `AudioRequest` contract defining the fluent audio request API.
contracts/ai/ai.go	Adds `Audio(prompt)` to the AI facade contract and updates `Image(prompt)` signature.
ai/response.go	Implements `audioResponse` (content cloning, MIME-aware filename extension, storage helpers).
ai/openai/provider.go	Implements OpenAI audio generation, default audio model resolution, voice mapping, and response parsing.
ai/openai/provider_test.go	Updates provider config expectations to include the audio default model.
ai/image/image.go	Updates image helper `Of(prompt)` to match the new `AI.Image(prompt)` signature.
ai/image/image_test.go	Adjusts image helper tests to reflect the new `Image(prompt)` signature.
ai/image_request.go	Removes image constructor options parsing; relies on fluent request configuration instead.
ai/audio_voice.go	Defines internal constants for default male/female voice selectors.
ai/audio_storage.go	Adds audio storage implementation (Store/StoreAs) similar to image storage behavior.
ai/audio_request.go	Adds fluent `audioRequest` implementation (provider/model/voice/instructions/timeout + Store/Generate).
ai/application.go	Adds `Application.Audio(prompt)` and provider dispatch for audio (`audio(...)`).
ai/application_test.go	Updates image request test to use fluent request configuration (no constructor options).

hwbrzzl

Addressed the latest annotation in commit 8264e3a by refactoring the shared media storage path normalization and write helpers.

Copilot

Pull request overview

Copilot reviewed 25 out of 25 changed files in this pull request and generated 2 comments.

* origin/master: feat(ai): add audio generation support (goravel#1467) chore: Update non-major dependencies (goravel#1468) refactor(ai): rename agent response interfaces (goravel#1466) feat(ai): add image storage helpers (goravel#1465) chore: Update non-major dependencies (goravel#1464)

feat(ai): add audio generation support

e0e254d

Copilot AI review requested due to automatic review settings May 8, 2026 03:42

hwbrzzl requested a review from a team as a code owner May 8, 2026 03:42

Copilot started reviewing on behalf of hwbrzzl May 8, 2026 03:43 View session

Copilot AI reviewed May 8, 2026

View reviewed changes

Comment thread ai/audio_storage.go Outdated

Comment thread ai/openai/provider.go

Comment thread ai/application.go

hwbrzzl added 2 commits May 8, 2026 14:09

test(ai): cover audio generation flows

6bfc24a

refactor(ai): share media storage path helpers

8264e3a

Copilot AI review requested due to automatic review settings May 8, 2026 06:54

Copilot started reviewing on behalf of hwbrzzl May 8, 2026 06:55 View session

hwbrzzl commented May 8, 2026

View reviewed changes

Comment thread ai/audio_storage.go

Copilot AI reviewed May 8, 2026

View reviewed changes

Comment thread ai/media_storage.go

Comment thread ai/response.go Outdated

hwbrzzl commented May 11, 2026

View reviewed changes

Comment thread ai/media_storage.go

fix(ai): harden media storage path handling

dda0b4b

hwbrzzl merged commit c78a1d9 into master May 11, 2026
19 checks passed

hwbrzzl deleted the bowen/#918-4 branch May 11, 2026 03:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): add audio generation support#1467

feat(ai): add audio generation support#1467
hwbrzzl merged 4 commits into
masterfrom
bowen/#918-4

hwbrzzl commented May 8, 2026

Uh oh!

codecov Bot commented May 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hwbrzzl left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hwbrzzl commented May 8, 2026

Summary

Why

Uh oh!

codecov Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hwbrzzl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented May 8, 2026 •

edited

Loading