Chat with Wiki - CosyVoice

Answer

CosyVoice 是一种语音克隆模型。以下是关于 CosyVoice 的相关信息：

阿里云最新开源模型 FunAudioLLM 中包含 CosyVoice，它能够进行自然的语音生成，具有多语言、音色和情感控制等功能。
在数字人工具中，CosyVoice 被支持用于语音克隆。该工具具有一键安装包，无需配置环境，简单易用。支持生成数字人视频，操作界面中英文可选，系统兼容 Windows、Linux、macOS。使用时需下载 8G+3G 语音模型包，启动模型即可。相关链接：GitHub：https://github.com/modstart-lib/aigcpanel ；官网：https://aigcpanel.com ；https://x.com/imxiaohu/status/1872852432549765195

Content generated by AI large model, please carefully verify (powered by aily)

References

[[Paper](https://fun-audio-llm.github.io/pdf/FunAudioLLM.pdf)][[Code](https://github.com/FunAudioLLM)][Modelscope:[SenseVoice](https://www.modelscope.cn/studios/iic/SenseVoice)[CosyVoice](https://www.modelscope.cn/studios/iic/CosyVoice-300M)][HuggingFace:[SenseVoice](https://huggingface.co/FunAudioLLM/SenseVoiceSmall)CosyVoice]Tongyi SpeechTeamAlibaba GroupAbstract:This report introduces FunAudioLLM,a framework designed to enhance natural voice interactions between humans and large language models(LLMs).At its core are two innovative models:SenseVoice for high-precision multilingual speech recognition,emotion recognition,and audio event detection;and CosyVoice for natural speech generation with multi-language,timbre,and emotion control.SenseVoice delivers exceptionally low latency and supports over 50 languages,while CosyVoice excels in multi-lingual voice generation,zero-shot voice generation,cross-lingual voice cloning,and instruction-following capabilities.The models related to SenseVoice and CosyVoice have been open-sourced on Modelscope and Huggingface,along with the corresponding training,inference,and fine-tuning codes released on GitHub.By integrating these models with LLMs,FunAudioLLM enables applications such as speech translation,emotional voice chat,interactive podcasts,and expressive audiobook narration,thereby pushing the boundaries of voice interaction technology.Contents

XiaoHu.AI日报

🔔Xiaohu.AI日报「12月29日」✨✨✨✨✨✨✨✨1⃣️🛠️数字人工具推荐：开源且适合小白用户特点：一键安装包，无需配置环境，简单易用。功能：生成数字人视频，支持语音合成和声音克隆，操作界面中英文可选。系统兼容：支持Windows、Linux、macOS。模型支持：MuseTalk（文本到语音）、CosyVoice（语音克隆）。使用步骤：下载8G+3G语音模型包，启动模型即可。🔗GitHub：[https://github.com/modstart-lib/aigcpanel](https://github.com/modstart-lib/aigcpanel)🔗官网：[https://aigcpanel.com](https://aigcpanel.com)🔗[https://x.com/imxiaohu/status/1872852432549765195](https://x.com/imxiaohu/status/1872852432549765195)2⃣️🎥Google Veo 2：AI生成逼真的Vlog视频效果：生成的视频接近真实，几乎难以分辨。应用：适合创作和内容制作。🔗[https://x.com/imxiaohu/status/1872984285634019476](https://x.com/imxiaohu/status/1872984285634019476)

XiaoHu.AI日报