Drum Sample Classifier and Automatic Drum Type Sorter

This code implements a Pytorch-based CNN that classifies drum samples of whatever types you desire from audio files. It does so by converting them to mel spectrograms and running them through a 4-block convolutional neural network. The structure of it is explained well in the NeuralNetwork02.py file, and I made extensive comments everywhere as it was my first Pytorch project. It is pretrained on 12 classes; Bass, Blip, Bongo, Clap, Cymbal, HiHat, Kick, Rim, SFX, Snare, Tom, and Udu; from my own drum sample collection. But you can use any combination of sound types you want, see retrain section below.

I made it to learn PyTorch, and a drum classifier was an interesting idea with readily available dataset for me as a hobby musician.

Quickstart

Double-click quickstart/quickstart.bat. It runs the included example.mp3 sample through the trained model and prints the results. It will ask whether you want to see the mel spectrogram and CNN feature map plots or not.

The result using the provided 808 drum kick file will look like this:

Class	Confidence	Visualization
Kick	83.5%	█████████████████████████
Tom	9.6%	██
Bass	6.9%	██

Which is a pretty good result, as the 808 kick naturally has some bass and tom character.

Automatic File Sorting

If the quickstart works well, you can copy paste your unsorted drum files into files_inferencesorting/unsorted_files. Run InferenceMultiple05.py which automatically sorts these files into the sorted_files/ folder. Files are copied, so your original files should be save. However, files of same name in sorted_folder will be overwritten.

Install dependencies

Using pip install -r requirements.txt. The CUDA versions of torch/torchaudio are pinned in requirements.txt. I used CUDA version 13.0.

Run inference on your own file

Either by python Inference04.py path/to/your/sample.wav or python Inference04.py path/to/your/sample.wav --enable-plots.

Drag and drop a wav/mp3/flac file into the console, it usually autogenerates the path (such as in VS Code on Windows).

Retrain on your own data

Retraining requires you to have drum samples sorted in folders. They can have any name themselves, and only need to be .mp3 .flac or .wav, but they need to be in the correct subfolders you need to create:

In the given directory folder files_drumtrainingdata/ you need to create subfolder per class, each containing wav/mp3/flac samples. For example, if you have kicks and snares, make two new folders called Kick and Snare. The code will autodetect these folders. Simply run python ModelTrainer03.py which will replace the old .safetensors. The best model is saved automatically to files_modeloutputs/ whenever validation accuracy improves.

File / Folder	Description
files_drumtrainingdata/	Place your sample folders in here
files_modeloutputs/	Saved model weights and class list
quickstart/	Example sample + launcher .bat
Preprocessor01.py	Audio loading, resampling, mel spectrogram conversion, dataset class
NeuralNetwork02.py	CNN architecture
ModelTrainer03.py	Training loop
InferenceSingle04.py	Run the model on a single file. Result is printed to console.
InferenceMultiple05.py	Copies audio files from unsorted folder to the respective sorted destination.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Drum Sample Classifier and Automatic Drum Type Sorter

Quickstart

Automatic File Sorting

Install dependencies

Run inference on your own file

Retrain on your own data

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
files_drumtrainingdata		files_drumtrainingdata
files_inferencesorting		files_inferencesorting
files_modeloutputs		files_modeloutputs
quickstart		quickstart
InferenceMultiple05.py		InferenceMultiple05.py
InferenceSingle04.py		InferenceSingle04.py
LICENSE		LICENSE
ModelTrainer03.py		ModelTrainer03.py
NeuralNetwork02.py		NeuralNetwork02.py
Preprocessor01.py		Preprocessor01.py
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Drum Sample Classifier and Automatic Drum Type Sorter

Quickstart

Automatic File Sorting

Install dependencies

Run inference on your own file

Retrain on your own data

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages