From c21d2c11dd7a62c6dca319857eb404889e341b7c Mon Sep 17 00:00:00 2001 From: Weijia Chen Date: Wed, 11 Aug 2021 09:51:02 +0800 Subject: [PATCH] Update note to avoid using wrong models --- README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 73372d7..2a74b90 100644 --- a/README.md +++ b/README.md @@ -27,9 +27,10 @@ * Install [ffmpeg](https://ffmpeg.org/download.html#get-packages). * Run `pip install -r requirements.txt` to install the remaining necessary packages. -### 2. reuse the pretrained encoder/vocoder -* Download the following models and extract to the root directory of this project. +### 2. Reuse the pretrained encoder/vocoder +* Download the following models and extract to the root directory of this project. Don't use the synthesizer https://github.com/CorentinJ/Real-Time-Voice-Cloning/wiki/Pretrained-models +> Note that we need to specify the newly trained synthesizer model, since the original model is incompatible with the Chinese sympols. It means the demo_cli is not working at this moment. ### 3. Train synthesizer with aidatatang_200zh * Download aidatatang_200zh dataset and unzip: make sure you can access all .wav in *train* folder * Preprocess with the audios and the mel spectrograms: