实时语音交互数字人,支持端到端语音方案(GLM-4-Voice – THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,支持音色克隆,首包延迟低至3s。
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice – THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice – THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
项目:https://github.com/Henry-23/VideoChat
在线demo:https://www.modelscope.cn/studios/AI-ModelScope/video_chat
- 打赏
- 分享
分享到...
![数字人对话插图1 数字人对话插图1](https://xj520u.com/wp-content/plugins/wzbaibaoxiang/images/wzt_chahao.png)
请选择打赏方式
![数字人对话插图1 数字人对话插图1](https://xj520u.com/wp-content/plugins/wzbaibaoxiang/images/wzt_chahao.png)
![数字人对话插图2 数字人对话插图2](https://xj520u.com/wp-content/uploads/2024/05/1717085370-f8156a63e44a7c2.png)
![数字人对话插图3 数字人对话插图3](https://xj520u.com/wp-content/uploads/2024/05/1717085370-f8156a63e44a7c2.png)
- 微信
- 支付宝
© 版权声明
文章版权归作者所有,未经允许请勿转载。
THE END
暂无评论内容