下拉刷新
Repository Details
Shared bynavbar_avatar
repo_avatar
HelloGitHub Rating
0 ratings
Long-Duration Multi-Role Text-to-Speech Synthesis Framework
FreeMIT
Claim
Collect
Share
17k
Stars
No
Chinese
Python
Language
Yes
Active
4
Contributors
58
Issues
Yes
Organization
None
Latest
2k
Forks
MIT
License
More
VibeVoice image
This project is an open-source text-to-speech framework by Microsoft, designed to address the pain points of traditional TTS systems when generating long-form, multi-role dialogues (such as blogs and audiobooks). It can generate high-quality long audio up to 90 minutes in length with dialogues of 4 different roles in one go based on text, supporting 9 languages including Chinese and English.

Comments

Rating:
No comments yet