Real-Time Voice Cloning

5秒钟内克隆一个声音，实时生成任意语音。「Clone a voice in 5 seconds to generate arbitrary speech in real-time」

所有者: Corentin Jemine 該所有者的項目 (0)
平台: Linux, Mac, Windows
許可證: Other
分類:

Python

深度學習
主題:

python

deep-learning

tensorflow

pytorch

tts

voice-cloning
喜歡:

5

比較:

Github星跟蹤圖

Real-Time Voice Cloning

This repository is an implementation of Transfer Learning from Speaker Verification to
Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my thesis if you're curious or if you're looking for info I haven't documented yet (don't hesitate to make an issue for that too). Mostly I would recommend giving a quick look to the figures beyond the introduction.

SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Video demonstration (click the picture):

Papers implemented

主要指標

概覽

名稱與所有者	CorentinJ/Real-Time-Voice-Cloning
主編程語言	Python
編程語言	Python (語言數: 1)
平台	Linux, Mac, Windows
許可證	Other

所有者活动

創建於	2019-05-26 16:56:15
推送於	2025-09-23 15:21:53
最后一次提交	2025-09-23 15:21:53
發布數	0

用户参与

星數	58.7k
關注者數	0.9k
派生數	9.4k
提交數	299
已啟用問題?
問題數	1106
打開的問題數	162
拉請求數	49
打開的拉請求數	9
關閉的拉請求數	88

项目设置

已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?