Skip to content

Unclear voice cloning instructions / documentation #9

@hybridherbst

Description

@hybridherbst

Hey, I tested Moss-TTS-Nano, especially the voice cloning part.

What I don't understand from the docs is

  • if the source audio transcription can be passed along somewhere (seems not?)
  • what the expected source audio length is – I tried 2s, 3s, 6s, 10s, 30s, and only the 3s-part had somewhat decent results, the others all produced garbage.
  • how to cache a voice profile for multiple generations.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions