Name	Name	Last commit message	Last commit date
parent directory ..
src	src
tests	tests
.env.example	.env.example
.npmrc	.npmrc
README.md	README.md
package.json	package.json
pnpm-lock.yaml	pnpm-lock.yaml

Name

Last commit message

Last commit date

tests

Vercel AI SDK — Transcribe Audio and Generate Speech with Deepgram

Use the Vercel AI SDK's unified interface to transcribe audio and generate speech with Deepgram, using the same API patterns you'd use with any other AI provider. Swap between Deepgram, OpenAI, and others by changing one import.

What you'll build

A Node.js script that does two things: transcribes an audio file using Deepgram's nova-3 model via the AI SDK's transcribe() function, then generates speech audio from text using Deepgram's Aura 2 TTS via the AI SDK's generateSpeech() function. The transcript prints to the console; the generated audio saves to a file you can play back.

Prerequisites

Node.js 18+
Deepgram account — get a free API key

Environment variables

Copy .env.example to .env and fill in your API key:

Variable	Where to find it
`DEEPGRAM_API_KEY`	Deepgram console → API Keys

Install and run

npm install
npm start

To transcribe a different file, set the AUDIO_URL environment variable:

AUDIO_URL=https://example.com/my-audio.wav npm start

How it works

transcribe() from the ai package provides a provider-agnostic transcription interface
deepgram.transcription('nova-3') routes the request through the @ai-sdk/deepgram provider to Deepgram's pre-recorded STT API
The transcript is returned with text, segments (with timestamps), and duration metadata
generateSpeech() provides a provider-agnostic TTS interface
deepgram.speech('aura-2-helena-en') routes through Deepgram's Aura TTS API
The generated audio is saved as a raw PCM file

The key advantage of the AI SDK approach is portability: you can swap deepgram.transcription('nova-3') for openai.transcription('whisper-1') without changing any other code.

Starter templates

If you want a ready-to-run base for your own project, check the deepgram-starters org — there are starter repos for every language and every Deepgram product.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Vercel AI SDK — Transcribe Audio and Generate Speech with Deepgram

What you'll build

Prerequisites

Environment variables

Install and run

How it works

Related

Starter templates

Uh oh!

FilesExpand file tree

050-vercel-ai-sdk-node

Directory actions

More options

Directory actions

More options

Latest commit

History

050-vercel-ai-sdk-node

Folders and files

parent directory

README.md

Vercel AI SDK — Transcribe Audio and Generate Speech with Deepgram

What you'll build

Prerequisites

Environment variables

Install and run

How it works

Related

Starter templates