A simple, straightforward tool for class annotation of audio files
This project came to life when I was messing around with a prototype of a neural network for audio classification. I needed a way to annotate some scrapped data that I collected, but was too lazy to open each of the hundreds of files and manually write down their respective classifications.
While there are some great audio annotation tools out there with great features such as diarization, waveform visualization, speaker identification, etc, all I needed was a quick and dirty way to separate audio files in classes.
If you need complex annotation features, checkout some projects such as Audino, audio-annotator and dynitag. If, like me, you want a simple way to classify your data with minimal setup, AudioClass is here for you :)
To get up and running with AudioClass, here's what you need to do:
AudioClass is developed in a linux environment using the Flask framework with Python 3.7, and tested to run on Firefox. It should work with other browsers, but I don't offer any guarantees.
- Clone the repo
git clone https://github.com/glefundes/audio-class.git
- Create and activate a virtual environment (Recommended)
cd audio-class/
python3 -m venv venv
source venv/bin/activate
- Install dependencies
pip install -r requirements.txt
Currently the system is limited to recognize .mp3, .wav, and .ogg files. It may work with other formats but it's not tested. You can try by adding new extensions to the accepted formats list here.
To expose your data to the app, move all the audio files you wish to annotate to the audio_data/
folder located in the project's root
To launch the local flask server and startup AudioClass just cd into the project's root and:
flask run
Now just navigate to localhost:5000/
in your browser and you're set :)
In the startup menu, click 'New Session' and the class setup prompt will appear. You need to setup at least 2 unique class labels in order to begin annotating.
In the main app screen you can navigate through the audio files in the data folder and play them. Select the class to which they belong and submit. The annotation will be recorded and the next file will be loaded automatically.
Current relevant features are:
- Speedy mouse-less annotation: navigate through classes using the right and left arrow keys and submit with the Enter key
- Autoplay checkbox: start to play next audio automatically so you don't have to click play everytime you submit
- Optional observation field: Write something about a particular file so you can remember and find it later without having to inturrupt the session
- Hide already annotated file checkbox: display only files without existing annotations in the current session. Useful to track progress or when loading a previous unfinished session.
When you are done or simply want to take a break, click the 'Download annotation' button to download a .json file with all the files you just annotated. The file has the following format:
{
"classes": [
{
"class_label": "foo",
"index": 0
},
{
"class_label": "bar",
"index": 1
}
],
"files": [
{
"class": 0,
"file": "filename.ogg",
"obs": "isn't this a a cool tool? ;)"
}
]
}
To load and resume a session, click on 'Load previous session' when you start up AudioClass and upload the .json file you downloaded. (All the files from the original session must still be present in the data folder for the session resuming to work!)
Being mainly a ML engineer involved in computer vision projects, I don't claim to be a profficient web developer in any way. I can almost guarantee this project has it's share of inneficiencies, is built with a few bad practices and probably the occasional bug. Feel free to open issues if you encounter unexpected behaviour or to fork/issue a pull request If you wish to contribute by improving the existing code or adding new features. Yay for open source software!
Distributed under the MIT License. See LICENSE
for more information.
Hit me up with any questions about the project!
Gabriel Lefundes Vieira - [email protected]