AgentDish directory

multispeaker

Accepted listings with this tag.

Listing Category Score Trend Checked
#23 ↓ -14
VibeVoice

Open-source voice AI from Microsoft with both long-form text-to-speech and speech recognition models. The repo highlights 90-minute multi-speaker TTS, 60-minute single-pass ASR, multilingual support, hotwording, and links to docs, Hugging Face, playground, finetuning, and papers.

Audio / Text-to-Speech / Speech Recognition 90 ↓ -14 27 days ago Details