Skip to content

Commit 6ab5238

Browse files
authored
Add new plugins to README (#257)
* Update integrations table * house keeping
1 parent ca47a7d commit 6ab5238

92 files changed

Lines changed: 387 additions & 44620 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

README.md

Lines changed: 11 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -106,22 +106,28 @@ Get a free API key from [Stream](https://getstream.io/). Developers receive **33
106106

107107
| **Plugin Name** | **Description** | **Docs Link** |
108108
|-------------|-------------|-----------|
109-
| AWS | AWS (Bedrock) integration with support for standard LLM (Qwen, Claude with vision), realtime with Nova 2 Sonic, and TTS with AWS Polly | [AWS](https://visionagents.ai/integrations/aws) |
109+
| AWS Bedrock | Realtime speech-to-speech plugin using Amazon Nova models with automatic reconnection | [AWS](https://visionagents.ai/integrations/aws-bedrock) |
110+
| AWS Polly | TTS plugin using Amazon's cloud-based service with natural-sounding voices and neural engine support | [AWS Polly](https://visionagents.ai/integrations/aws-polly) |
110111
| Cartesia | TTS plugin for realistic voice synthesis in real-time voice applications | [Cartesia](https://visionagents.ai/integrations/cartesia) |
111-
| Decart | Real-time video restyling capabilities using generative AI models | [Decart](https://visionagents.ai/integrations/decart) |
112+
| Decart | Real-time AI video transformation service for applying artistic styles and effects to video streams | [Decart](https://visionagents.ai/integrations/decart) |
112113
| Deepgram | STT plugin for fast, accurate real-time transcription with speaker diarization | [Deepgram](https://visionagents.ai/integrations/deepgram) |
113114
| ElevenLabs | TTS plugin with highly realistic and expressive voices for conversational agents | [ElevenLabs](https://visionagents.ai/integrations/elevenlabs) |
115+
| Fast-Whisper | High-performance STT plugin using OpenAI's Whisper model with CTranslate2 for fast inference | [Fast-Whisper](https://visionagents.ai/integrations/fast-whisper) |
114116
| Fish Audio | STT and TTS plugin with automatic language detection and voice cloning capabilities | [Fish Audio](https://visionagents.ai/integrations/fish) |
115117
| Gemini | Realtime API for building conversational agents with support for both voice and video | [Gemini](https://visionagents.ai/integrations/gemini) |
116-
| HeyGen | Realtime interactive avatars powered by [HeyGen](https://heygen.com/) | [Heygen](https://visionagents.ai/integrations/heygen) |
118+
| HeyGen | Realtime interactive avatars powered by [HeyGen](https://heygen.com/) | [HeyGen](https://visionagents.ai/integrations/heygen) |
117119
| Inworld | TTS plugin with high-quality streaming voices for real-time conversational AI agents | [Inworld](https://visionagents.ai/integrations/inworld) |
118120
| Kokoro | Local TTS engine for offline voice synthesis with low latency | [Kokoro](https://visionagents.ai/integrations/kokoro) |
119121
| Moondream | Moondream provides realtime detection and VLM capabilities. Developers can choose from using the hosted API or running locally on their CUDA devices. Vision Agents supports Moondream's Detect, Caption and VQA skills out-of-the-box. | [Moondream](https://visionagents.ai/integrations/moondream) |
120122
| OpenAI | Realtime API for building conversational agents with out of the box support for real-time video directly over WebRTC, LLMs and Open AI TTS | [OpenAI](https://visionagents.ai/integrations/openai) |
123+
| OpenRouter | LLM plugin providing access to multiple providers (Anthropic, Google, OpenAI) through a unified API | [OpenRouter](https://visionagents.ai/integrations/openrouter) |
124+
| Qwen | Realtime audio plugin using Alibaba's Qwen3 with native audio output and built-in speech recognition | [Qwen](https://visionagents.ai/integrations/qwen) |
125+
| Roboflow | Object detection processor using Roboflow's hosted API or local RF-DETR models | [Roboflow](https://visionagents.ai/integrations/roboflow) |
121126
| Smart Turn | Advanced turn detection system combining Silero VAD, Whisper, and neural models for natural conversation flow | [Smart Turn](https://visionagents.ai/integrations/smart-turn) |
127+
| Ultralytics | Real-time pose detection processor using YOLO models with skeleton overlays | [Ultralytics](https://visionagents.ai/integrations/ultralytics) |
122128
| Vogent | Neural turn detection system for intelligent turn-taking in voice conversations | [Vogent](https://visionagents.ai/integrations/vogent) |
123129
| Wizper | STT plugin with real-time translation capabilities powered by Whisper v3 | [Wizper](https://visionagents.ai/integrations/wizper) |
124-
| xAI | xAI (Grok) integration for using powerful language models in conversational AI applications | [xAI](https://visionagents.ai/integrations/xai) |
130+
| xAI | LLM plugin using xAI's Grok models with advanced reasoning and real-time knowledge | [xAI](https://visionagents.ai/integrations/xai) |
125131

126132

127133
## Processors
@@ -230,7 +236,7 @@ While building the integrations, here are the limitations we've noticed (Dec 202
230236

231237
## We are hiring
232238

233-
Join the team behind this project - we’re hiring a Staff Python Engineer to architect, build, and maintain a powerful toolkit for developers integrating voice and video AI into their products.
239+
Join the team behind this project - we’re hiring a Staff Python Engineer to architect, build, and maintain a powerful toolkit for developers integrating voice and video AI into their products.
234240

235241
[Apply here](https://jobs.ashbyhq.com/stream/3bea7dba-54e1-4c71-aa02-712a075842df?utm_source=Jmv9QOkznl)
236242

examples/other_examples/07_function_calling_example/claude_example.py

Lines changed: 0 additions & 56 deletions
This file was deleted.

examples/other_examples/07_function_calling_example/gemini_example.py

Lines changed: 0 additions & 54 deletions
This file was deleted.

examples/other_examples/07_function_calling_example/openai_example.py

Lines changed: 0 additions & 56 deletions
This file was deleted.

examples/other_examples/07_function_calling_example/pyproject.toml

Lines changed: 0 additions & 28 deletions
This file was deleted.

0 commit comments

Comments
 (0)