Automated-Guitar Amplifier Modelling

This repository contains neural network training scripts and trained models of guitar amplifiers and distortion pedals. The 'Results' directory contains some example recurrent neural network models trained to emulate the ht-1 amplifier and Big Muff Pi fuzz pedal, these models are described in this conference paper

Aida DSP contributions

What we implemented / improved

resample feature: just write the desired samplerate for the model and the training script will adjust accordingly!
multiple network types available for training: SimpleRNN (Recurrent Network) with LSTM or GRU cells, ConvSimpleRNN (Convolutional + Recurrent Network)...and other experimental work!
a way to export models generated here in a format compatible with RTNeural
a method to define portions of dataset (train, test, val and more) via csv file following Reaper region markers export csv format
upgraded loss function pre-emphasis filter with A-Weighting FIR filter plus Low Pass filter in cascade see PERCEPTUAL LOSS FUNCTION FOR NEURAL MODELLING OF AUDIO SYSTEMS paper
multiple loss function types support leveraging auraloss python package
a way to generate an ESR vs time audio track, useful to troubleshoot training with complex datasets
a Jupyter script .ipynb to perform training with Google Colab
a docker compose application with CUDA support to run the very same .ipynb locally, see docker-compose.yml
a multi-platform Desktop plugin to run the models generated here on your favourite DAW
a lv2 plugin that is a stripped down version of the previous plugin to be used on embedded linux devices such as RPi, MOD Dwarf, AIDA DSP OS, etc
we now perform model validation during runtime, inside the plugins. This allow us to freely develop the plugin and bring new backend features while at the same time being able to immediately spot regressions as well as improvements in the code that runs the network
we inject metadata including loss (ESR) directly inside the model file, ready for cloud service integration like Tone Hunt

How to use (Google Colab)

Visit the link here to perform the training online

How to use (Local)

You need to use our docker container on your machine: aidadsp/pytorch:latest

Clone this repository

git clone https://github.com/AidaDSP/Automated-GuitarAmpModelling.git
cd Automated-GuitarAmpModelling
git checkout aidadsp_devel
git submodule update --init --recursive

From now on, we will refer to <THIS_DIR> as the path where you launched above commands.

Local use, Jupyter Notebook

docker compose up -d

Jupyter Web UI will be accessible in your browser at http://127.0.0.1:8888, then simply enter the password (aidadsp)

Local use, Bash shell

docker compose up -d
docker exec -it aidadsp bash

now in the docker container bash shell you can run commands. Firstly, you should identify a Configs file that you want to use, tweak it to suits your needs:

python prep_wav.py -l LSTM-12-1.json -n

where -n would apply normalization (advised). If you want to control which portions of the file are used for train, val, test you can inspect the csv file that is mentioned in the config file

Secondly you can perform the training passing always the same Configs file where all the infos are stored

python dist_model.py -l LSTM-12-1.json -slen 24000 --seed 39 -lm 0

where -slen would setup the chunk length used during training, here 24000*1/48000 = 500 [ms] considering 48000 Hz sampling rate. For the other params, please open the script.

Finally you want to convert the model with all the weights exported from pytorch into a format that is suitable for usage with RTNeural library, which in turns is the engine used by our plugins: AIDA-X and aidadsp-lv2. You can do it in the following way:

python modelToRTNeural.py -l LSTM-12-1.json

this file would output a file named model_rtneural.json, which will then be the model to be loaded into the plugins.

Explore some hidden features

we now store metadata inside Config file, so if you save a structure like the following inside your Config file, it will be copied in the final model file

metadata = {
    "name": "My Awesome Device Name",
    "samplerate": "48000",
    "source":"Analog / Digital Hw / Plugin / You Name It",
    "style": "Clean / Breakup / High Gain / You Name It",
    "based": "What is based on, aka the name of the device",
    "author": "Who did this model",
    "dataset": "Describe the dataset used to train",
    "license": "CC BY-NC-ND 4.0 / Whatever LICENSE fits your needs"
}

we perform input / target audio track time alignment. The blips location is provided into the csv file
we can express the train, val and test regions as region markers for the Dataset via csv file. The csv file is self-explanatory and can be imported in Reaper
you can not only calculate ESR on an arbitrary audio track for a given model, but you can also obtain an ESR vs time audio track, to be imported in your DAW, which will let you better troubleshoot your model. With the following command:

python proc_audio.py -l LSTM-12-1.json -i /path/to/input.wav -t /path/to/target.wav -o ./proc.wav

the script will generate a proc_ESR.wav containing the ESR vs time audio track

Prerequisites to run the docker container

Windows Users

Please follow instructions here https://docs.docker.com/desktop/install/windows-install

Mac Users

Please follow instructions here https://docs.docker.com/desktop/install/mac-install

Linux Users

Please follow instructions below

NVIDIA drivers

dpkg -l | grep nvidia-driver
ii  nvidia-driver-510                          510.47.03-0ubuntu0.20.04.1            amd64        NVIDIA driver metapackage

dpkg -S /usr/lib/i386-linux-gnu/libcuda.so
libnvidia-compute-510:i386: /usr/lib/i386-linux-gnu/libcuda.so

NVIDIA Container Toolkit

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
      && distribution="ubuntu20.04" \
      && curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
      && curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit
sudo nvidia-ctk runtime configure --runtime=docker
sudo systemctl restart docker

now you can run containers with gpu support

Using this repository

It is possible to use this repository to train your own models. To model a different distortion pedal or amplifier, a dataset recorded from your target device is required, example datasets recorded from the ht1 and Big Muff Pi are contained in the 'Data' directory.

Cloning this repository

To create a working local copy of this repository, use the following command:

git clone --recurse-submodules https://github.com/Alec-Wright/NeuralGuitarAmpModelling

Python Environment

Using this repository requires a python environment with the 'pytorch', 'scipy', 'tensorboard' and 'numpy' packages installed. Additionally this repository uses the 'CoreAudioML' package, which is included as a submodule. Cloining the repo as described in 'Cloning this repository' ensures the CoreAudioML package is also downloaded.

Processing Audio

The 'proc_audio.py' script loads a neural network model and uses it to process some audio, then saving the processed audio. This is also a good way to check if your python environment is setup correctly. Running the script with no arguments:

python proc_audio.py

will use the default arguments, the script will load the 'model_best.json' file from the directory 'Results/ht1-ht11/' and use it to process the audio file 'Data/test/ht1-input.wav', then save the output audio as 'output.wav' Different arguments can be used as follows

python proc_audio.py 'path/to/input_audio.wav' 'output_filename.wav' 'Results/path/to/model_best.json'

Training Script

the 'dist_model.py' script was used to train the example models in the 'Results' directory. At the top of the file the 'argparser' contains a description of all the training script arguments, as well as their default values. To train a model using the default arguments, simply run the model from the command line as follows:

python dist_model.py

note that you must run this command from a python environment that has the libraries described in 'Python Environment' installed. To use different arguments in the training script you can change the default arguments directly in 'dist_model.py', or you can direct the 'dist_model.py' script to look for a config file that contains different arguments, for example by running the script using the following command:

python dist_model.py -l "ht11.json"

Where in this case the script will look for the file ht11.json in the the 'Configs' directory. To create custom config files, the ht11.json file provided can be edited in any text editor.

During training, the script will save some data to a folder in the Results directory. These are, the lowest loss achieved on the validation set so far in 'bestvloss.txt', as well as a copy of that model 'model_best.json', and the audio created by that model 'best_val_out.wav'. The neural network at the end of the most recent training epoch is also saved, as 'model.json'. When training is complete the test dataset is processed, and the audio produced and the test loss is also saved to the same directory.

A trained model contained in one of the 'model.json' or 'model_best.json' files can be loaded, see the 'proc_audio.py' script for an example of how this is done.

Determinism

If determinism is desired, dist_model.py provides an option to seed all of the random number generators used at once. However, if NVIDIA CUDA is used, you must also handle the non-deterministic behavior of CUDA for RNN calculations as is described in the Rev8 Release Notes. The user can eliminate the non-deterministic behavior of cuDNN RNN and multi-head attention APIs, by setting a single buffer size in the CUBLAS_WORKSPACE_CONFIG environmental variable, for example, :16:8 or :4096:2

CUBLAS_WORKSPACE_CONFIG=:4096:2

or

CUBLAS_WORKSPACE_CONFIG=:16:8

Note: if you're in google colab, the following goes into a cell

!export CUBLAS_WORKSPACE_CONFIG=:4096:2

Tensorboard

The ./dist_model.py has been implemented with PyTorch's Tensorboard hooks. To see the data, run:

tensorboard --bind_all --logdir ./TensorboardData

tensorboard web interface will be available at http://127.0.0.1:6006

In alternative, in your Jupyter script just add a block with

%load_ext tensorboard
%tensorboard --bind_all --logdir TensorboardData

Feedback

This repository is still a work in progress, and I welcome your feedback! Either by raising an Issue or in the 'Discussions' tab

Name		Name	Last commit message	Last commit date
Latest commit History 487 Commits
Configs		Configs
CoreAudioML @ 6fc4c6d		CoreAudioML @ 6fc4c6d
Data		Data
Results		Results
SpecMatch @ fadfbb5		SpecMatch @ fadfbb5
.gitignore		.gitignore
.gitmodules		.gitmodules
AIDA_X_Model_Trainer.ipynb		AIDA_X_Model_Trainer.ipynb
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
colab_functions.py		colab_functions.py
dist_model.py		dist_model.py
docker-compose.yml		docker-compose.yml
modelToRTNeural.py		modelToRTNeural.py
nam_utils.py		nam_utils.py
plot.py		plot.py
prep_wav.py		prep_wav.py
proc_audio.py		proc_audio.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automated-Guitar Amplifier Modelling

Aida DSP contributions

What we implemented / improved

How to use (Google Colab)

How to use (Local)

Clone this repository

Local use, Jupyter Notebook

Local use, Bash shell

Explore some hidden features

Prerequisites to run the docker container

Windows Users

Mac Users

Linux Users

NVIDIA drivers

NVIDIA Container Toolkit

Using this repository

Cloning this repository

Python Environment

Processing Audio

Training Script

Determinism

Tensorboard

Feedback

About

Uh oh!

Releases

Packages

Languages

License

AidaDSP/Automated-GuitarAmpModelling

Folders and files

Latest commit

History

Repository files navigation

Automated-Guitar Amplifier Modelling

Aida DSP contributions

What we implemented / improved

How to use (Google Colab)

How to use (Local)

Clone this repository

Local use, Jupyter Notebook

Local use, Bash shell

Explore some hidden features

Prerequisites to run the docker container

Windows Users

Mac Users

Linux Users

NVIDIA drivers

NVIDIA Container Toolkit

Using this repository

Cloning this repository

Python Environment

Processing Audio

Training Script

Determinism

Tensorboard

Feedback

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages