Commit Graph

38 Commits

Author SHA1 Message Date
AyahaShirane
ad22997614
fixed the issues #372 (#379)
修复了一些参数传递造成的问题,把过时的torch.nn.functional.tanh()改成了torch.tanh()
2022-02-27 11:02:01 +08:00
babysor00
0536874dec Add config file for pretrained 2022-02-23 09:37:39 +08:00
babysor00
8ad9ba2b60 change naming logic of saving trained file for synthesizer to allow shorter interval 2022-01-15 17:56:14 +08:00
D-Blue
b56ec5ee1b
Fix a UserWarning (#273)
Fix a UserWarning in synthesizer/synthesizer_dataset.py, because of converting list of numpy array to torch tensor at Ln.85.
2021-12-20 20:33:12 +08:00
zzxiang
4728863f9d
Fix inference on cpu device (#241) 2021-11-29 21:10:07 +08:00
harian
b50c7984ab
tacotron.py-Multi GPU with DataParallel (#231) 2021-11-27 20:53:08 +08:00
babysor00
26fe4a047d Differentiate GST token 2021-11-18 22:55:13 +08:00
babysor00
aff1b5313b Order of declared pytorch module matters 2021-11-17 00:12:27 +08:00
babysor
7dca74e032 Change default to use speaker embed for reference 2021-11-13 10:57:45 +08:00
babysor00
a37b26a89c 模型兼容问题加强 Compatibility Enhance of Pretrained Models and code base #209 2021-11-10 23:23:13 +08:00
babysor00
902e1eb537 Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese 2021-11-09 21:08:33 +08:00
babysor00
5c0e53a29a Fix #205 2021-11-09 21:08:28 +08:00
DragonDreamer
4edebdfeba
修复synthesizer/models/tacotron.Encoder注释错误 (#203)
fix Issue#202
2021-11-09 13:59:19 +08:00
babysor00
3674d8b5c6 Use speaker embedding anyway even with default style 2021-11-07 21:48:15 +08:00
babysor00
80aaf32164 Add max steps control in toolbox 2021-11-06 13:27:11 +08:00
babysor00
7c58fe01d1 Concat GST output instead of adding directly with original output 2021-10-23 10:28:32 +08:00
Vega
724194a4de
Add code to control finetune layers (#154) 2021-10-23 10:25:43 +08:00
babysor00
31bc6656c3 Fix bug of importing GST and add more parameters in toolbox 2021-10-21 00:40:00 +08:00
李子
245099c740
支持data_aishell(SLR33)数据集 (#141)
* 支持data_aishell(SLR33)数据集

* 更新readme
2021-10-12 23:40:27 +08:00
babysor00
6dd2af49fe Merge branch 'main' of https://github.com/babysor/Realtime-Voice-Clone-Chinese 2021-10-12 20:02:05 +08:00
babysor00
8b43ec9a64 Fix bug pre-processing magicdata 2021-10-12 20:01:37 +08:00
Vega
2a99f0ff05
Add gst (#137)
* Commit with working GST

* Make it backward compatible

* Add readme
2021-10-12 19:43:29 +08:00
babysor00
547ac816df Update demo and training param
A
2021-10-05 10:48:54 +08:00
Vega
0d0b55d3e9
Web server release v2 (#99)
* Init App

* init server.py (#93)

* init server.py

* Update requirements.txt

Add requirement

Co-authored-by: auau <auau@test.com>
Co-authored-by: babysor00 <babysor00@gmail.com>

* Run web.py!

Run web.py!

* Restruct readme and add instruction to use web server

* fix training preprocess of vocoder

* Init App

* init server.py (#93)

* init server.py

* Update requirements.txt

Add requirement

Co-authored-by: auau <auau@test.com>
Co-authored-by: babysor00 <babysor00@gmail.com>

* Run web.py!

Run web.py!

* fix training preprocess of vocoder

* Refactor to restful style

Co-authored-by: balala <Ozgay@users.noreply.github.com>
Co-authored-by: auau <auau@test.com>
2021-09-25 17:07:46 +08:00
hertz
4acfee2a64
Support tensorboard to trace the training of Synthesizer (#98)
* add tensorborad tracing

* add log_every params
2021-09-25 17:06:51 +08:00
Vega
28e6bce570
Web server: Add latest changes (#96)
* Init App

* init server.py (#93)

* init server.py

* Update requirements.txt

Add requirement

Co-authored-by: auau <auau@test.com>
Co-authored-by: babysor00 <babysor00@gmail.com>

* Run web.py!

Run web.py!

* Restruct readme and add instruction to use web server

* fix training preprocess of vocoder

Co-authored-by: balala <Ozgay@users.noreply.github.com>
Co-authored-by: auau <auau@test.com>
2021-09-24 09:47:51 +08:00
Vega
ddd478c0ad
Web server (#94)
* Init App

* init server.py (#93)

* init server.py

* Update requirements.txt

Add requirement

Co-authored-by: auau <auau@test.com>
Co-authored-by: babysor00 <babysor00@gmail.com>

* Run web.py!

Run web.py!

Co-authored-by: balala <Ozgay@users.noreply.github.com>
Co-authored-by: auau <auau@test.com>
2021-09-21 16:56:12 +08:00
babysor00
4178416385 [FIX] Fix preprocessing bug for aishell3 2021-09-19 00:09:16 +08:00
babysor00
5c0cb50c3e Add quick path to preprocess audio, denoise audio when loading in toolbox 2021-08-30 22:22:06 +08:00
babysor00
5950eea895 Support training your own vocoder 2021-08-29 15:43:54 +08:00
babysor00
17d47589c1 Fix compatibility issue of symbols 2021-08-29 00:48:07 +08:00
Nemo
0cc3f569fa [dataset]support aishell3(tested) 2021-08-25 23:11:29 +08:00
babysor00
3c86cd5bca 【bugfix】 fix bug causing non-sense output for long texts 修复多段文字发音错误 2021-08-22 23:44:25 +08:00
babysor00
feb1c7cb88 rename slr68 to magicdata to keep consistent naming convention
(cherry picked from commit bbdad858ebc4d0ee3b720ba22ae3e0ce9732a734)
2021-08-17 21:07:13 +08:00
unknown
aa1c345629 Add Chinese character input support 2021-08-15 22:53:40 +08:00
babysor00
4d6da5d49b Supporting new dataset SLR68 ! try python synthesizer_preprocess_audio.py ...\slr --dataset SLR68 2021-08-13 00:46:21 +08:00
babysor00
856793c9bd Refactor preprocessor of synthesizer to prepare to supprot more datasets 2021-08-11 23:33:43 +08:00
babysor00
e46cd609a2 Init to support Chinese Dataset. 2021-08-07 11:56:00 +08:00