voice-changer/Realtime_Voice_Changer_on_Colab.ipynb

189 lines
6.2 KiB
Plaintext
Raw Normal View History

2023-08-10 14:41:27 +03:00
{
"cells": [
{
"cell_type": "markdown",
"metadata": {
"id": "view-in-github",
"colab_type": "text"
},
"source": [
"<a href=\"https://colab.research.google.com/github/w-okada/voice-changer/blob/master/Realtime_Voice_Changer_on_Colab.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
]
},
2023-08-10 14:41:27 +03:00
{
"cell_type": "markdown",
"metadata": {
"id": "Lbbmx_Vjl0zo"
},
"source": [
"Realtime Voice Changer by w-okada\n",
"---\n",
"\n",
"This is a attempt to run [Realtime Voice Changer](https://github.com/w-okada/voice-changer) on Google Colab.\\\n",
"Colab File updated by [rafacasari](https://github.com/Rafacasari)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"cellView": "form",
"id": "vV1t7PBRm-o6"
},
"outputs": [],
"source": [
"# @title **Always use Colab GPU!**\n",
"# @markdown **A GPU can be used for faster processing.**\\\n",
"# @markdown You can check the Colab GPU running this cell.\\\n",
"# @markdown or use the menu **Runtime** -> **Change runtime** -> **Hardware acceleration** to select a GPU, if needed.\n",
"!nvidia-smi"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "aLypf-RLIK-w"
},
"source": [
"---"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"cellView": "form",
"id": "RhdqDSt-LfGr"
},
"outputs": [],
"source": [
"# @title **[Optional]** Setup/Start Google Drive\n",
2023-08-10 15:18:36 +03:00
"# @markdown Using Google Drive can improve load times, since you will not need to re-download every time you use this.\n",
"\n",
2023-08-10 14:41:27 +03:00
"import os\n",
"from google.colab import drive\n",
"\n",
"if not os.path.exists('/content/drive'):\n",
" drive.mount('/content/drive')\n",
"\n",
"%cd /content/drive/MyDrive"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"cellView": "form",
"id": "86wTFmqsNMnD"
},
"outputs": [],
"source": [
"# @title **[1]** Clone the repository\n",
"# @markdown Clone the repository using this cell, this process should be really fast.\n",
"\n",
"!git clone --depth 1 https://github.com/w-okada/voice-changer.git &> /dev/null\n",
"\n",
"%cd voice-changer/server/\n",
"print(\"\\033[92mSuccessfully cloned the repository, proceed to the next cell!\")"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"cellView": "form",
"id": "LwZAAuqxX7yY"
},
"outputs": [],
"source": [
"# @title **[2]** Install Modules\n",
"# @markdown This cell can take a few minutes to run.\n",
"\n",
"!apt-get install libportaudio2 &> /dev/null\n",
"!pip install onnxruntime-gpu uvicorn faiss-gpu fairseq jedi google-colab moviepy decorator==4.4.2 sounddevice numpy==1.23.5 pyngrok --quiet\n",
"!pip install -r requirements.txt --no-build-isolation --quiet\n",
"# Maybe install Tensor packages?\n",
"#!pip install torch-tensorrt\n",
"#!pip install TensorRT\n",
"print(\"\\033[92mSuccessfully installed all packages!\")"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"cellView": "form",
"id": "lQxvqDH2D1MK"
},
"outputs": [],
"source": [
"# @title **[3]** Setup ngrok account\n",
"# @markdown Setup a free account at [Ngrok](https://dashboard.ngrok.com/signup), then click [this link](https://dashboard.ngrok.com/get-started/your-authtoken) to get your auth token, copy it and place it here:\n",
"\n",
"NgrokToken = '' # @param {type:\"string\"}\n",
"\n",
"!ngrok config add-authtoken {NgrokToken}"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "lLWQuUd7WW9U",
"cellView": "form"
},
"outputs": [],
"source": [
"# @title **[4]** Start Server\n",
"# @markdown **Run this cell AFTER ngrok setup!**\\\n",
"# @markdown This cell will start the server, the first time that you run it will download the example models, so it can take a while and console may be spammed a little bit. **So be fast to open the server link.**\n",
"import portpicker\n",
"PORT = portpicker.pick_unused_port()\n",
"\n",
"from pyngrok import ngrok\n",
"public_url = ngrok.connect(PORT).public_url\n",
2023-08-10 14:46:16 +03:00
"\n",
2023-08-10 14:41:27 +03:00
"print(\"Your server will be available at:\")\n",
"print(public_url)\n",
"print(public_url)\n",
"print(public_url)\n",
"print(\"Starting the server in 3 seconds\")\n",
"\n",
"!sleep 3\n",
"!python3 MMVCServerSIO.py \\\n",
" -p {PORT} \\\n",
" --https False \\\n",
" --content_vec_500 pretrain/checkpoint_best_legacy_500.pt \\\n",
" --content_vec_500_onnx pretrain/content_vec_500.onnx \\\n",
" --content_vec_500_onnx_on true \\\n",
" --hubert_base pretrain/hubert_base.pt \\\n",
" --hubert_base_jp pretrain/rinna_hubert_base_jp.pt \\\n",
" --hubert_soft pretrain/hubert/hubert-soft-0d54a1f4.pt \\\n",
" --nsf_hifigan pretrain/nsf_hifigan/model \\\n",
" --crepe_onnx_full pretrain/crepe_onnx_full.onnx \\\n",
" --crepe_onnx_tiny pretrain/crepe_onnx_tiny.onnx \\\n",
" --rmvpe pretrain/rmvpe.pt \\\n",
" --model_dir model_dir \\\n",
" --samples samples.json \\\n",
" --colab True"
]
}
],
"metadata": {
"accelerator": "GPU",
"colab": {
"provenance": [],
"include_colab_link": true
2023-08-10 14:41:27 +03:00
},
"gpuClass": "standard",
"kernelspec": {
"display_name": "Python 3",
"name": "python3"
},
"language_info": {
"name": "python"
}
},
"nbformat": 4,
"nbformat_minor": 0
}