Speech to Text Open Source

News

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...

Release of the LLaSO Framework: Defining New Benchmarks for LSLM Research in Open Source Voice Models and Accelerating AI Voice Innovation

The 'ImageNet Moment' for LSLM Research? In the context of the flourishing development of large language models (LLMs), significant progress has been made in multimodal AI, particularly in the field ...

17don MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good at it

Microsoft's road to total AI domination continues with an interesting looking open-source project called VibeVoice. This text-to-speech model can generate conversational audio with multiple speakers, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

News

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Release of the LLaSO Framework: Defining New Benchmarks for LSLM Research in Open Source Voice Models and Accelerating AI Voice Innovation

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good at it

Trending now