AgentDish directory
multispeaker
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#23
↓ -14
VibeVoice
Open-source voice AI from Microsoft with both long-form text-to-speech and speech recognition models. The repo highlights 90-minute multi-speaker TTS, 60-minute single-pass ASR, multilingual support, hotwording, and links to docs, Hugging Face, playground, finetuning, and papers. |
Audio / Text-to-Speech / Speech Recognition | 90 | ↓ -14 | 27 days ago | Details |