@@ -19,8 +19,8 @@ Construct vendors directly from `agora_agent`, then bind a client with `Agent(cl
1919
2020| Area | STT classes | LLM classes | TTS classes | Avatar classes |
2121| ---| ---| ---| ---| ---|
22- | ` Area.US ` , ` Area.EU ` , ` Area.AP ` | ` DeepgramSTT ` , ` SpeechmaticsSTT ` , ` MicrosoftSTT ` , ` OpenAISTT ` , ` GoogleSTT ` , ` AmazonSTT ` , ` AssemblyAISTT ` , ` AresSTT ` , ` SarvamSTT ` | ` OpenAI ` , ` AzureOpenAI ` , ` Anthropic ` , ` Gemini ` , ` Groq ` , ` VertexAILLM ` , ` AmazonBedrock ` , ` Dify ` , ` CustomLLM ` | ` ElevenLabsTTS ` , ` MicrosoftTTS ` , ` OpenAITTS ` , ` CartesiaTTS ` , ` GoogleTTS ` , ` AmazonTTS ` , ` DeepgramTTS ` , ` HumeAITTS ` , ` RimeTTS ` , ` FishAudioTTS ` , ` MiniMaxTTS ` , ` MurfTTS ` , ` SarvamTTS ` | ` LiveAvatarAvatar ` , ` HeyGenAvatar ` , ` AkoolAvatar ` , ` AnamAvatar ` , ` GenericAvatar ` |
23- | ` Area.CN ` | ` FengmingSTT ` , ` TencentSTT ` , ` MicrosoftCNSTT ` , ` XfyunSTT ` , ` XfyunBigModelSTT ` , ` XfyunDialectSTT ` | ` AliyunLLM ` , ` BytedanceLLM ` , ` DeepSeekLLM ` , ` TencentLLM ` | ` MiniMaxCNTTS ` , ` TencentTTS ` , ` BytedanceTTS ` , ` MicrosoftCNTTS ` , ` CosyVoiceTTS ` , ` BytedanceDuplexTTS ` , ` StepFunTTS ` | ` SenseTimeAvatar ` |
22+ | ` Area.US ` , ` Area.EU ` , ` Area.AP ` | ` DeepgramSTT ` , ` SpeechmaticsSTT ` , ` MicrosoftSTT ` , ` OpenAISTT ` , ` GoogleSTT ` , ` AmazonSTT ` , ` AssemblyAISTT ` , ` AresSTT ` , ` SarvamSTT ` , ` XaiSTT ` | ` OpenAI ` , ` AzureOpenAI ` , ` Anthropic ` , ` Gemini ` , ` Groq ` , ` VertexAILLM ` , ` AmazonBedrock ` , ` Dify ` , ` CustomLLM ` | ` ElevenLabsTTS ` , ` MicrosoftTTS ` , ` OpenAITTS ` , ` CartesiaTTS ` , ` GoogleTTS ` , ` AmazonTTS ` , ` DeepgramTTS ` , ` HumeAITTS ` , ` RimeTTS ` , ` FishAudioTTS ` , ` MiniMaxTTS ` , ` MurfTTS ` , ` SarvamTTS ` , ` GenericTTS ` , ` XaiTTS ` | ` LiveAvatarAvatar ` , ` HeyGenAvatar ` , ` AkoolAvatar ` , ` AnamAvatar ` , ` GenericAvatar ` |
23+ | ` Area.CN ` | ` FengmingSTT ` , ` TencentSTT ` , ` MicrosoftCNSTT ` , ` XfyunSTT ` , ` XfyunBigModelSTT ` , ` XfyunDialectSTT ` | ` AliyunLLM ` , ` BytedanceLLM ` , ` DeepSeekLLM ` , ` TencentLLM ` | ` MiniMaxCNTTS ` , ` TencentTTS ` , ` BytedanceTTS ` , ` MicrosoftCNTTS ` , ` CosyVoiceTTS ` , ` BytedanceDuplexTTS ` , ` StepFunTTS ` , ` GenericTTS ` | ` SenseTimeAvatar ` , ` SpatiusAvatar ` |
2424
2525Global example:
2626
@@ -85,6 +85,7 @@ tts = MiniMaxCNTTS(
8585| ` max_tokens ` | ` int ` | No | ` None ` | Maximum tokens to generate |
8686| ` system_messages ` | ` List[Dict] ` | No | ` None ` | System messages |
8787| ` greeting_message ` | ` str ` | No | ` None ` | Greeting message |
88+ | ` greeting_audio_url ` | ` str ` | No | ` None ` | Publicly accessible greeting audio URL |
8889| ` failure_message ` | ` str ` | No | ` None ` | Failure message |
8990| ` input_modalities ` | ` List[str] ` | No | ` None ` | Input modalities |
9091| ` output_modalities ` | ` List[str] ` | No | ` None ` | Output modalities |
@@ -93,6 +94,8 @@ tts = MiniMaxCNTTS(
9394| ` greeting_configs ` | ` Dict[str, Any] ` | No | ` None ` | Greeting playback configuration |
9495| ` template_variables ` | ` Dict[str, str] ` | No | ` None ` | Template variables for messages |
9596
97+ ` greeting_configs ` may also include ` audio_download_timeout_ms ` , ` audio_pcm_sample_rate ` , and ` uninterruptible_asr_policy ` .
98+
9699<!-- snippet: fragment -->
97100``` python
98101from agora_agent import OpenAI
@@ -378,6 +381,33 @@ The SDK also includes named helpers for the remaining Agora-supported LLM provid
378381| ` sample_rate ` | ` int ` | No | ` None ` | Audio sample rate |
379382| ` skip_patterns ` | ` List[int] ` | No | ` None ` | Skip patterns |
380383
384+ ### ` GenericTTS `
385+
386+ | Parameter | Type | Required | Default | Description |
387+ | ---| ---| ---| ---| ---|
388+ | ` url ` | ` str ` | Yes | — | Callback address of the generic TTS service |
389+ | ` headers ` | ` Dict[str, str] ` | Yes | — | Custom headers to include in requests to the generic TTS service |
390+ | ` model ` | ` str ` | Yes | — | TTS model name |
391+ | ` voice ` | ` str ` | Yes | — | Voice name |
392+ | ` api_key ` | ` str ` | No | ` None ` | API key for the generic TTS service |
393+ | ` speed ` | ` float ` | No | ` None ` | Speech rate |
394+ | ` sample_rate ` | ` int ` | No | ` None ` | Output audio sample rate in Hz |
395+ | ` response_format ` | ` str ` | No | ` None ` | Output audio format; use ` pcm ` |
396+ | ` instruction ` | ` str ` | No | ` None ` | Additional voice style control instruction |
397+ | ` additional_params ` | ` Dict[str, Any] ` | No | ` None ` | Additional generic TTS parameters |
398+ | ` skip_patterns ` | ` List[int] ` | No | ` None ` | Skip patterns |
399+
400+ ### ` XaiTTS `
401+
402+ | Parameter | Type | Required | Default | Description |
403+ | ---| ---| ---| ---| ---|
404+ | ` api_key ` | ` str ` | Yes | — | xAI API key |
405+ | ` language ` | ` str ` | Yes | — | BCP-47 language code for speech synthesis |
406+ | ` voice_id ` | ` str ` | No | ` None ` | xAI voice identifier |
407+ | ` sample_rate ` | ` int ` | No | ` None ` | Audio sample rate in Hz |
408+ | ` additional_params ` | ` Dict[str, Any] ` | No | ` None ` | Additional xAI TTS parameters |
409+ | ` skip_patterns ` | ` List[int] ` | No | ` None ` | Skip patterns |
410+
381411---
382412
383413## STT Vendors
@@ -471,6 +501,16 @@ For `nova-2` and `nova-3`, omit `api_key` to use Agora-managed credentials. For
471501| ` language ` | ` str ` | Yes | — | Language code (e.g., ` en ` , ` hi ` ) |
472502| ` additional_params ` | ` Dict[str, Any] ` | No | ` None ` | Additional parameters |
473503
504+ ### ` XaiSTT `
505+
506+ | Parameter | Type | Required | Default | Description |
507+ | ---| ---| ---| ---| ---|
508+ | ` api_key ` | ` str ` | Yes | — | xAI API key |
509+ | ` base_url ` | ` str ` | No | ` None ` | WebSocket endpoint URL for the xAI streaming STT API |
510+ | ` sample_rate ` | ` int ` | No | ` None ` | Audio sample rate in Hz |
511+ | ` language ` | ` str ` | No | ` None ` | Language code for speech recognition |
512+ | ` additional_params ` | ` Dict[str, Any] ` | No | ` None ` | Additional xAI STT parameters |
513+
474514---
475515
476516## CN Vendors
@@ -658,6 +698,21 @@ No constructor parameters. Use `FengmingSTT()`.
658698| ` enable ` | ` bool ` | No | ` None ` | Whether to enable the avatar |
659699| ` additional_params ` | ` Dict[str, Any] ` | No | ` None ` | Additional SenseTime avatar parameters |
660700
701+ #### ` SpatiusAvatar `
702+
703+ | Parameter | Type | Required | Default | Description |
704+ | ---| ---| ---| ---| ---|
705+ | ` spatius_api_key ` | ` str ` | Yes | — | Spatius API key |
706+ | ` spatius_app_id ` | ` str ` | Yes | — | Spatius application ID |
707+ | ` spatius_avatar_id ` | ` str ` | Yes | — | Spatius avatar ID |
708+ | ` agora_uid ` | ` str ` | Yes | — | Agora UID used by the avatar service |
709+ | ` agora_token ` | ` str ` | No | ` None ` | RTC token for avatar publisher; generated by AgentSession when omitted |
710+ | ` region ` | ` str ` | No | ` None ` | Spatius service region, for example ` cn-beijing ` |
711+ | ` sample_rate ` | ` int ` | No | ` None ` | Optional avatar-declared sample rate; TTS sample rate should match when set |
712+ | ` session_expire_minutes ` | ` int ` | No | ` None ` | Spatius session validity duration in minutes |
713+ | ` enable ` | ` bool ` | No | ` None ` | Whether to enable the avatar |
714+ | ` additional_params ` | ` Dict[str, Any] ` | No | ` None ` | Additional Spatius avatar parameters |
715+
661716## MLLM Vendors
662717
663718### ` OpenAIRealtime `
0 commit comments