Real-Time Voice Cloning

5秒钟内克隆一个声音，实时生成任意语音。「Clone a voice in 5 seconds to generate arbitrary speech in real-time」

所有者: Corentin Jemine 该所有者的项目 (0)
平台: Linux, Mac, Windows
许可证: Other
分类:

Python

深度学习
主题:

python

deep-learning

tensorflow

pytorch

tts

voice-cloning
喜欢:

5

比较:

Github星跟踪图

Real-Time Voice Cloning

This repository is an implementation of Transfer Learning from Speaker Verification to
Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented yet (don't hesitate to make an issue for that too). Mostly I would recommend giving a quick look to the figures beyond the introduction.

SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Video demonstration (click the picture):

Papers implemented

主要指标

概览

名称与所有者	CorentinJ/Real-Time-Voice-Cloning
主编程语言	Python
编程语言	Python (语言数: 1)
平台	Linux, Mac, Windows
许可证	Other

所有者活动

创建于	2019-05-26 16:56:15
推送于	2025-09-23 15:21:53
最后一次提交	2025-09-23 15:21:53
发布数	0

用户参与

星数	58.7k
关注者数	0.9k
派生数	9.4k
提交数	299
已启用问题?
问题数	1106
打开的问题数	162
拉请求数	49
打开的拉请求数	9
关闭的拉请求数	88

项目设置

已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?