Commit Graph

109 Commits

Author SHA1 Message Date
flysmart
86ea11affd
Fre-GAN (#544)
* 替换了vocoder

* 修改了vocoder_train

* 减谱法

* 美化UI;语音增强;MFCC特征可视化

* 修复了训练fregan模型时的报错

* 增加了可以分析音频特征的独立文件

* 现已支持Fre-GAN声码器的训练

* 修复了训练fregan时保存模型的BUG

* 删除了无用的文件

* 优化了识别声码器模型的方式
2022-05-09 13:22:55 +08:00
Wings Music
875fe15069
Update readme for training encoder (#250) 2021-12-07 19:10:29 +08:00
zzxiang
4728863f9d
Fix inference on cpu device (#241) 2021-11-29 21:10:07 +08:00
hertz
a4daf42868
1k steps to save tmp hifigan model (#240) 2021-11-29 21:09:54 +08:00
harian
b50c7984ab
tacotron.py-Multi GPU with DataParallel (#231) 2021-11-27 20:53:08 +08:00
babysor00
26fe4a047d Differentiate GST token 2021-11-18 22:55:13 +08:00
babysor00
aff1b5313b Order of declared pytorch module matters 2021-11-17 00:12:27 +08:00
babysor
7dca74e032 Change default to use speaker embed for reference 2021-11-13 10:57:45 +08:00
babysor00
a37b26a89c 模型兼容问题加强 Compatibility Enhance of Pretrained Models and code base #209 2021-11-10 23:23:13 +08:00
babysor00
902e1eb537 Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese 2021-11-09 21:08:33 +08:00
babysor00
5c0e53a29a Fix #205 2021-11-09 21:08:28 +08:00
DragonDreamer
4edebdfeba
修复synthesizer/models/tacotron.Encoder注释错误 (#203)
fix Issue#202
2021-11-09 13:59:19 +08:00
babysor00
6c8f3f4515 Allow to select vocoder in web 2021-11-08 23:55:16 +08:00
babysor00
2bd323b7df Update readme 2021-11-07 21:59:03 +08:00
babysor00
3674d8b5c6 Use speaker embedding anyway even with default style 2021-11-07 21:48:15 +08:00
babysor00
80aaf32164 Add max steps control in toolbox 2021-11-06 13:27:11 +08:00
babysor00
c396792b22 Upload new models 2021-10-27 20:19:50 +08:00
babysor00
7c58fe01d1 Concat GST output instead of adding directly with original output 2021-10-23 10:28:32 +08:00
Vega
724194a4de
Add code to control finetune layers (#154) 2021-10-23 10:25:43 +08:00
babysor00
31bc6656c3 Fix bug of importing GST and add more parameters in toolbox 2021-10-21 00:40:00 +08:00
洛竹
aa35fb3139
docs: this repo -> 本代码库 (#157)
Co-authored-by: 洛竹 <youngjuning@aliyun.com>
2021-10-20 22:54:31 +08:00
babysor00
727eafc51b Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese 2021-10-20 00:27:19 +08:00
babysor00
d328ecba81 Reconstruct UI of toolbox 2021-10-20 00:27:13 +08:00
Vega
fad574118c
Update README-CN.md 2021-10-18 13:50:19 +08:00
babysor00
b0c156a537 Add new dataset support to preprocess parameter 2021-10-17 17:21:49 +08:00
Vega
724809abf4
Update README.md 2021-10-15 14:34:29 +08:00
Vega
05cd1a54ea
Add new pretrain model with gst 2021-10-14 01:26:23 +08:00
李子
245099c740
支持data_aishell(SLR33)数据集 (#141)
* 支持data_aishell(SLR33)数据集

* 更新readme
2021-10-12 23:40:27 +08:00
babysor00
6dd2af49fe Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese 2021-10-12 20:02:05 +08:00
babysor00
8b43ec9a64 Fix bug pre-processing magicdata 2021-10-12 20:01:37 +08:00
Vega
2a99f0ff05
Add gst (#137)
* Commit with working GST

* Make it backward compatible

* Add readme
2021-10-12 19:43:29 +08:00
babysor00
a824b54122 补充预处理文档 2021-10-12 09:22:10 +08:00
weida wang
81befb91b0
Update ui.py (#136)
Add minimize and maximize button of window
2021-10-11 17:17:36 +08:00
babysor00
e2017d0314 Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese into main 2021-10-05 10:48:58 +08:00
babysor00
547ac816df Update demo and training param
A
2021-10-05 10:48:54 +08:00
Ji Zhang
6b4ab39601
add alternative download source for dataset (google drive) (#112) 2021-10-03 10:10:40 +08:00
babysor00
b46e7a7866 New web with selecting wav files 2021-10-01 22:13:39 +08:00
babysor00
8a384a1191 Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese into main 2021-10-01 09:33:31 +08:00
Nemo
11154783d8
web tool box update UI (#111)
* web tool box update UI

* update img
2021-10-01 00:32:29 +08:00
AkifSaeed20
d52db0444e
Update launch.json (#109) 2021-10-01 00:22:43 +08:00
babysor00
790d11a58b Allow to train encoder 2021-10-01 00:01:33 +08:00
babysor00
cb82fcfe58 Add Zhihu link (chinese Quora) to readme 2021-09-27 21:59:44 +08:00
babysor00
26ecb7546d Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese into main 2021-09-27 21:45:55 +08:00
babysor00
f64914fca8 add new model 2021-09-27 21:45:34 +08:00
Jenkey2011
512da52775
Update README-CN.md (#102) 2021-09-27 21:22:20 +08:00
babysor00
9c219f05c2 Add file structure in readme 2021-09-26 12:27:57 +08:00
babysor00
4d9e460063 Change synthesizer read to restful style 2021-09-26 10:01:50 +08:00
Vega
0d0b55d3e9
Web server release v2 (#99)
* Init App

* init server.py (#93)

* init server.py

* Update requirements.txt

Add requirement

Co-authored-by: auau <auau@test.com>
Co-authored-by: babysor00 <babysor00@gmail.com>

* Run web.py!

Run web.py!

* Restruct readme and add instruction to use web server

* fix training preprocess of vocoder

* Init App

* init server.py (#93)

* init server.py

* Update requirements.txt

Add requirement

Co-authored-by: auau <auau@test.com>
Co-authored-by: babysor00 <babysor00@gmail.com>

* Run web.py!

Run web.py!

* fix training preprocess of vocoder

* Refactor to restful style

Co-authored-by: balala <Ozgay@users.noreply.github.com>
Co-authored-by: auau <auau@test.com>
2021-09-25 17:07:46 +08:00
hertz
4acfee2a64
Support tensorboard to trace the training of Synthesizer (#98)
* add tensorborad tracing

* add log_every params
2021-09-25 17:06:51 +08:00
babysor00
99269b2046 [Fix bug] remove n processes of embed in pre.py 2021-09-24 20:52:33 +08:00