Skip to content

Commit 529ea03

Browse files
authored
Update README with API and ComfyUI integration (#48)
Added API access and ComfyUI integration sections to README.
1 parent ee050e4 commit 529ea03

1 file changed

Lines changed: 25 additions & 0 deletions

File tree

README.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@
66
<a href="https://github.com/OpenMOSS/MOVA"><img src="https://img.shields.io/badge/Github-Star-yellow?logo=Github&amp"></a>
77
<a href="https://huggingface.co/collections/OpenMOSS-Team/mova"><img src="https://img.shields.io/badge/Huggingface-Download-orange?logo=Huggingface&amp"></a>
88
<a href="https://mosi.cn/models/mova"><img src="https://img.shields.io/badge/Website-View-blue?logo=Website&amp"></a>
9+
<a href="https://studio.mosi.cn/docs/models/mova?src=github"><img src="https://img.shields.io/badge/API-Apply-green?logo=OpenAI&amp"></a>
910
<a href="https://arxiv.org/abs/2602.08794">
1011
<img src="https://img.shields.io/badge/Arxiv-2602.08794-red?logo=Arxiv&amp">
1112
</a>
@@ -26,6 +27,8 @@ We introduce **MOVA** (**MO**SS **V**ideo and **A**udio), a foundation model des
2627
- **Asymmetric Dual-Tower Architecture**: Leverages the power of pre-trained video and audio towers, fused via a bidirectional cross-attention mechanism for rich modality interaction.
2728

2829
## 🔥News!!!
30+
- 2026/03/09: 🎉**MOVA API** is now available! Apply for your API key at [studio.mosi.cn](https://studio.mosi.cn/docs/models/mova?src=github) to start generating videos programmatically.
31+
- 2026/03/09: 🎉**ComfyUI support** is here! Thanks to [@richservo](https://github.com/richservo), you can now use MOVA in ComfyUI at low cost via [comfyui-mova](https://github.com/richservo/comfyui-mova).
2932
- 2026/02/10: 🎉We released **MOVA** [technical report](https://arxiv.org/abs/2602.08794) and update [inference workflow](https://github.com/OpenMOSS/MOVA/pull/29).
3033
- 2026/01/29: 🎉We released **MOVA**, an open-source foundation model for high-fidelity synchronized video–audio generation!!!
3134

@@ -220,6 +223,26 @@ We provide an **[AI-assisted video generation workflow](./workflow)** for end-to
220223

221224
See the [**workflow README**](./workflow/README.md) for setup, configuration (SGLang server, API keys), and usage.
222225

226+
## API Access
227+
228+
MOVA is available as a hosted API on our platform. You can apply for an API key and start generating synchronized video-audio content without managing any infrastructure.
229+
230+
**Apply here**: [studio.mosi.cn/docs/models/mova](https://studio.mosi.cn/docs/models/mova?src=github)
231+
232+
## ComfyUI Integration
233+
234+
MOVA is supported in [ComfyUI](https://github.com/comfyanonymous/ComfyUI) through the community plugin [comfyui-mova](https://github.com/richservo/comfyui-mova) by [@richservo](https://github.com/richservo). It provides two nodes — **MOVA Model Loader** and **MOVA Generate** — for both Image-to-Video-Audio (I2VA) and Text-to-Video-Audio (T2VA) workflows.
235+
236+
### Installation
237+
238+
```bash
239+
cd ComfyUI/custom_nodes
240+
git clone --recursive https://github.com/richservo/comfyui-mova.git
241+
cd comfyui-mova
242+
pip install -r requirements.txt
243+
```
244+
245+
Models download automatically from HuggingFace on first use. See the [comfyui-mova README](https://github.com/richservo/comfyui-mova) for full usage details, VRAM guide, and workflow examples.
223246

224247
## Training
225248
### LoRA Fine-tuning
@@ -284,6 +307,8 @@ All peak usage numbers below are measured on **360p, 8-second** video training s
284307
- [x] SGLang Integration
285308
- [x] Generation Workflow
286309
- [x] Technical Report
310+
- [x] API Access
311+
- [x] ComfyUI Integration
287312
- [ ] Diffusers Integration
288313

289314
## Citation

0 commit comments

Comments
 (0)