News
Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...
The 'ImageNet Moment' for LSLM Research? In the context of the flourishing development of large language models (LLMs), significant progress has been made in multimodal AI, particularly in the field ...
17don MSN
Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good at it
Microsoft's road to total AI domination continues with an interesting looking open-source project called VibeVoice. This text-to-speech model can generate conversational audio with multiple speakers, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results