Feature Request: Add FunASR Speech-to-Text component for audio document processing

## Feature Request

Haystack is excellent for building NLP/RAG pipelines. A Speech-to-Text component powered by FunASR would enable audio document processing in Haystack pipelines.

**Use case:** Audio/video files → FunASR transcription → text preprocessing → indexing → retrieval

**Why FunASR?**

- **OpenAI-compatible API**: `/v1/audio/transcriptions` endpoint — easy to wrap as a Haystack component
- **SenseVoice**: Ultra-fast ASR (234M params), 50+ languages, 5-10x faster than Whisper
- **Complete pipeline**: VAD + ASR + punctuation + speaker diarization + timestamps
- **Self-hosted**: No API key, runs locally

**Potential Haystack component:**
```python
from funasr import AutoModel

class FunASRTranscriber:
    def __init__(self):
        self.model = AutoModel(
            model="paraformer-zh",
            vad_model="fsmn-vad",
            punc_model="ct-punc",
            spk_model="cam++",
        )
    
    def run(self, audio_path: str):
        result = self.model.generate(input=audio_path)
        return {"documents": [Document(content=r["text"]) for r in result]}
```

- GitHub: https://github.com/modelscope/FunASR (16K+ stars)

### Tasks
- [ ] The code is documented with docstrings and was merged in the `main` branch
- [ ] Docs are published at https://docs.haystack.deepset.ai/
- [ ] There is a Github workflow running the tests for the integration nightly and at every PR
- [ ] A new label named like `integration:<your integration name>` has been added to the list of labels for this [repository](https://github.com/deepset-ai/haystack-core-integrations/labels)
- [ ] The [labeler.yml](https://github.com/deepset-ai/haystack-core-integrations/blob/main/.github/labeler.yml) file has been updated
- [ ] The package has been released on PyPI
- [ ] An integration tile with a usage example has been added to https://github.com/deepset-ai/haystack-integrations
- [ ] The integration has been listed in the [Inventory section](https://github.com/deepset-ai/haystack-core-integrations#inventory) of this repo README
- [ ] The feature was announced through social media


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add FunASR Speech-to-Text component for audio document processing #3375

Feature Request

Tasks

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Feature Request: Add FunASR Speech-to-Text component for audio document processing #3375

Description

Feature Request

Tasks

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions