voice-changer/docker_trainer/README.md

69 lines
2.0 KiB
Markdown
Raw Permalink Normal View History

2023-01-31 11:16:45 +03:00
MMVC Server
----
# 起動方法
(1) Datasetを`trainer/dataset`におく
```sh
trainer/dataset/
├── 00_myvoice
│   ├── text
│   │   ├── emotion001.txt
│   │   ├── emotion002.txt
...
│   │   └── emotion100.txt
│   └── wav
│   ├── emotion001.wav
│   ├── emotion002.wav
...
│   └── emotion100.wav
├── 1205_zundamon
│   ├── text
│   │   ├── emoNormal_001.txt
│   │   ├── emoNormal_002.txt
...
│   │   └── emoNormal_100.txt
│   └── wav
│   ├── emoNormal_001.wav
│   ├── emoNormal_002.wav
...
│   └── emoNormal_100.wav
├── 344_tsumugi
│   ├── text
│   │   ├── VOICEACTRESS100_001.txt
│   │   ├── VOICEACTRESS100_002.txt
...
│   │   └── emoNormal_100.txt
│   └── wav
│   ├── VOICEACTRESS100_001.wav
│   ├── VOICEACTRESS100_002.wav
...
│   └── emoNormal_100.wav
└── multi_speaker_correspondence.txt
```
2023-01-31 12:27:26 +03:00
(2) start_trainer.shをrootにコピー
2023-01-31 11:16:45 +03:00
2023-01-31 12:27:26 +03:00
(3) `bash start_trainer.sh`を実行
2023-01-31 11:16:45 +03:00
(4) Docker内で次のコマンドを実行
batch sizeは適宜調整
```sh
2023-02-01 05:18:10 +03:00
$ cp configs_org/baseconfig.json configs/
2023-01-31 11:16:45 +03:00
$ python3 normalize.py True
$ python3 create_dataset.py -f train_config -s 24000 -m dataset/multi_speaker_correspondence.txt
2023-02-04 22:47:04 +03:00
$ tensorboard --logdir logs --port 5000 --bind_all &
# batch size 変更
2023-01-31 11:16:45 +03:00
$ python3 train_ms.py -c configs/train_config.json -m 20220306_24000 -fg fine_model/G_v15_best.pth -fd fine_model/D_v15_best.pth
2023-02-04 22:47:04 +03:00
$ python3 train_ms.py -c configs/train_config.json -m 20220306_24000
2023-01-31 11:16:45 +03:00
```
2023-02-10 09:38:34 +03:00
(x) テスト
```
$ python3 MMVC_Client/python/conver_test.py -m logs/G_40000.pth -c configs/train_config.json -s 0 -t 101 --input dataset/00_myvoice/wav/emotion011.wav --output dataset/test.wav --f0_scale 3
2023-02-21 03:17:59 +03:00
```
(X) onnx
python3 onnx_export.py --config_file logs/train_config.json --convert_pth logs/G_220000.pth