数字人对话demo

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice – THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，支持音色克隆，首包延迟低至3s。

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice – THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice – THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

项目：https://github.com/Henry-23/VideoChat

在线demo：https://www.modelscope.cn/studios/AI-ModelScope/video_chat