Skip to content

Latest commit

 

History

History
43 lines (26 loc) · 787 Bytes

File metadata and controls

43 lines (26 loc) · 787 Bytes

Text

LLM Runners

Ollama: https://github.com/ollama/ollama
GPT4All: https://gpt4all.io/
LM Studio: https://lmstudio.ai/
Jan: https://www.jan.ai/

Images

Text-to-Image

Image-to-Image

Video

Text-to-Video

Image-to-Video

Speech

Text-to-Speech

Speech Transcription

Whisper: https://github.com/openai/whisper

Voice Cloning

Chatterbox: https://github.com/resemble-ai/chatterbox
Spark: https://github.com/sparkaudio/spark-tts

Audio

Text-to-Audio

Stable Audio: https://github.com/Stability-AI/stable-audio-3

Unconditional Audio Generation

wavegan: https://github.com/mattjwarren/wavegan
Generate Your Own Music: https://github.com/DolicaAkelloEgwel/Generate_Your_Own_Music
RAVE: https://github.com/acids-ircam/RAVE