mirror of https://github.com/w-okada/voice-changer.git synced 2025-01-23 13:35:12 +03:00

リアルタイムボイスチェンジャー Realtime Voice Changer

Go to file

wok 3971a5bb4c update		2024-06-30 16:15:45 +09:00
.github	Update issue.yaml	2023-10-14 04:28:43 +09:00
.vscode	initialize v2	2024-06-03 20:19:11 +09:00
docs	v.2.0.23-alpha	2024-06-13 03:21:30 +09:00
signatures/version1	@vitaliylag has signed the CLA from Pull Request #1224	2024-06-01 03:14:09 +00:00
.gitignore	initialize v2	2024-06-03 20:19:11 +09:00
LICENSE	add license	2023-08-07 10:59:39 +09:00
LICENSE-CLA	update	2023-08-07 20:17:03 +09:00
LICENSE-NOTICE	WIP: Beatrice	2023-11-04 03:29:54 +09:00
README_en.md	update	2024-06-30 16:15:45 +09:00
README_ko.md	update	2024-06-30 16:15:45 +09:00
README.md	update	2024-06-30 16:15:45 +09:00
w_okada's_Voice_Changer_version_2_x.ipynb	Colab を使用して作成されました	2024-06-30 16:12:37 +09:00

README_en.md

VC Client

Japanese Korean

What's New!

v.2.0.30-alpha Colab version released. ⇒ Here
- ngrok is no longer needed. You can use it without a ngrok account.
v.2.0.27-alpha
- Feature
  - Support for Beatrice v2 alpha2: formant changes, improved quality
- Logging enhancement
  - Added download button
- Improvements
  - Prevent double-clicking on upload
  - Display during upload
  - Fixed typo: paththrough -> passthrough
- Bug fixes
  - Added handling for when undefined is returned in the performance monitor
v.2.0.24-alpha Colab version released. ⇒ Here
v.2.0.24-alpha
- Bugfix:
  - Addressed the issue where sound stops when switching modes
- Others:
  - Enhanced logger
  - Improved error screen
v.2.0.23-alpha
- Reorganizing Editions
  - win_std: For typical Windows users. Hardware acceleration via DirectML is available for both ONNX and torch models.
  - win_cuda: For Nvidia GPU owners. Hardware acceleration via CUDA is available for both ONNX and torch models. Requires CUDA 12.4 or later.
  - mac: For Apple Silicon (e.g., M1) users.
- feature
  - Added the capability to adjust the output buffer when operating in client mode
- bugfix:
  - Fixed the issue of retaining index and icon when exporting RVC's torch model to onnx model
- Other:
  - Enhanced logger
v.2.0.20-alpha
- Support for torch-cuda. See the edition description here.
- Bugfix:
  - Unified file encoding to UTF-8
v.2.0.16-alpha
- Added support for experimental version of torch-dml. For a description of the edition, refer to here.
- Bugfix:
  - Fixed the issue where both pth and index files could not be uploaded simultaneously during rvc file upload.
v.2.0.13-alpha
- Added support for onnxruntime-gpu. Release of the CUDA edition.
- Bugfix:
  - Addressed issues with onnxcrepe
  - Fixed ID selection issue in Beatrice v2 API
- Others:
  - Enhanced logger
v.2.0.6-alpha
- New
  - Now compatible with M1 series Macs.
    - Confirmed to work on M1 MBA (Monterey) and M2 Pro MBP (Ventura).
    - Looking for reports on performance with Sonoma.
- Bugfix:
  - Fixed a bug where the pitch would revert when selecting a speaker in Beatrice.
- Others:
  - Enhanced information gathering for debugging purposes.
v.2.0.5-alpha
- VCClient has been rebooted as a second version.
- Major software structure changes have been made to improve extensibility.
- Providing REST API to facilitate client development by third parties.
- Edition system has been completely revamped.
  - The Standard Edition (win) runs on ONNX models by default regardless of the presence of a GPU. Please convert Torch models to ONNX models before use. Hardware acceleration is only effective with ONNX models for users with a GPU.
  - The CUDA Edition (win) is optimized specifically for Nvidia GPUs. It offers further speed enhancements compared to the Standard Edition. Hardware acceleration is only effective with ONNX models.
  - Torch models can also be hardware accelerated using PyTorch models.
  - The Mac Edition is for Mac users with Apple Silicon.
  - Linux users or those with knowledge of Python can clone the repository and run it.
- Currently, only the Standard Edition is available in the Alpha version.

What is VC Client

This is a client software for performing real-time voice conversion using various Voice Conversion (VC) AI. The supported AI for voice conversion are as follows.

RVC(Retrieval-based-Voice-Conversion)
Beatrice JVS Corpus Edition * experimental, (NOT MIT Licnsence see readme) * Only for Windows, CPU dependent

Distribute the load by running Voice Changer on a different PC The real-time voice changer of this application works on a server-client configuration. By running the MMVC server on a separate PC, you can run it while minimizing the impact on other resource-intensive processes such as gaming commentary.

Cross-platform compatibility Supports Windows, Mac (including Apple Silicon M1), Linux, and Google Colaboratory.
We provide a REST API.

You can operate it using HTTP clients that are built into the OS, such as curl.
This allows you to easily achieve the following:
- Users can register processes that call the REST API in shortcuts, such as in .bat files.
- Create simple clients to operate remotely.
- And more.

Download

Please download it from Hugging Face.

Manual

Software Signing

This software is not signed by the developer. A warning message will appear, but you can run the software by clicking the icon while holding down the control key. This is due to Apple's security policy. Running the software is at your own risk.

https://user-images.githubusercontent.com/48346627/212569645-e30b7f4e-079d-4504-8cf8-7816c5f40b00.mp4

Acknowledgments

This software uses the voice data of the free material character "Tsukuyomi-chan," which is provided for free by CV. Yumesaki Rei.

Tsukuyomi-chan Corpus (CV. Yumesaki Rei)

https://tyc.rei-yumesaki.net/material/corpus/

Copyright. Rei Yumesaki

Terms of Use

In accordance with the Tsukuyomi-chan Corpus Terms of Use for the Tsukuyomi-chan Real-time Voice Changer, the use of the converted voice for the following purposes is prohibited.

Criticizing or attacking individuals (the definition of "criticizing or attacking" is based on the Tsukuyomi-chan character license).
Advocating for or opposing specific political positions, religions, or ideologies.
Publicly displaying strongly stimulating expressions without proper zoning.
Publicly disclosing secondary use (use as materials) for others. (Distributing or selling as a work for viewing is not a problem.)

Regarding the Real-time Voice Changer Amitaro, we prohibit the following uses in accordance with the terms of use of the Amitaro's koe-sozai kobo.detail

Regarding the Real-time Voice Changer Kikoto Mahiro, we prohibit the following uses in accordance with the terms of use of Replica doll.detail

Disclaimer

We are not liable for any direct, indirect, consequential, incidental, or special damages arising out of or in any way connected with the use or inability to use this software.