You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -26,6 +27,8 @@ We introduce **MOVA** (**MO**SS **V**ideo and **A**udio), a foundation model des
26
27
-**Asymmetric Dual-Tower Architecture**: Leverages the power of pre-trained video and audio towers, fused via a bidirectional cross-attention mechanism for rich modality interaction.
27
28
28
29
## 🔥News!!!
30
+
- 2026/03/09: 🎉**MOVA API** is now available! Apply for your API key at [studio.mosi.cn](https://studio.mosi.cn/docs/models/mova?src=github) to start generating videos programmatically.
31
+
- 2026/03/09: 🎉**ComfyUI support** is here! Thanks to [@richservo](https://github.com/richservo), you can now use MOVA in ComfyUI at low cost via [comfyui-mova](https://github.com/richservo/comfyui-mova).
29
32
- 2026/02/10: 🎉We released **MOVA**[technical report](https://arxiv.org/abs/2602.08794) and update [inference workflow](https://github.com/OpenMOSS/MOVA/pull/29).
30
33
- 2026/01/29: 🎉We released **MOVA**, an open-source foundation model for high-fidelity synchronized video–audio generation!!!
31
34
@@ -220,6 +223,26 @@ We provide an **[AI-assisted video generation workflow](./workflow)** for end-to
220
223
221
224
See the [**workflow README**](./workflow/README.md) for setup, configuration (SGLang server, API keys), and usage.
222
225
226
+
## API Access
227
+
228
+
MOVA is available as a hosted API on our platform. You can apply for an API key and start generating synchronized video-audio content without managing any infrastructure.
MOVA is supported in [ComfyUI](https://github.com/comfyanonymous/ComfyUI) through the community plugin [comfyui-mova](https://github.com/richservo/comfyui-mova) by [@richservo](https://github.com/richservo). It provides two nodes — **MOVA Model Loader** and **MOVA Generate** — for both Image-to-Video-Audio (I2VA) and Text-to-Video-Audio (T2VA) workflows.
Models download automatically from HuggingFace on first use. See the [comfyui-mova README](https://github.com/richservo/comfyui-mova) for full usage details, VRAM guide, and workflow examples.
223
246
224
247
## Training
225
248
### LoRA Fine-tuning
@@ -284,6 +307,8 @@ All peak usage numbers below are measured on **360p, 8-second** video training s
0 commit comments