Audio / Text-to-Speech / Speech Recognition

VibeVoice

Open-source voice AI from Microsoft with both long-form text-to-speech and speech recognition models. The repo highlights 90-minute multi-speaker TTS, 60-minute single-pass ASR, multilingual support, hotwording, and links to docs, Hugging Face, playground, finetuning, and papers.

github long-form-audio multilingual multispeaker open-source speech-recognition text-to-speech voice-ai

Why it was accepted

The page clearly presents an AI voice product/research repo with concrete capabilities, model names, and usage links. It shows both TTS and ASR functionality, long-form audio support, multilingual features, and integration paths that make it useful for AI builders and users.

Weakness

The crawl does not show installation steps, API details, licensing terms beyond the repo metadata, or enough hands-on examples to tell how easy it is to run locally versus through the linked model pages.

Review status

72 days ago #49 ↓ -36

Last evaluated 72 days ago. Current rank #49. Down 36 spots in the rankings.

Score history

9291899188919190

Related listings

No related listings yet.