Audio-Classification-Using-Resnet

Dataset Link :- https://www.kaggle.com/andradaolteanu/gtzan-dataset-music-genre-classification
About the data :- Genre original folder(only this required) - It is a collection of 10 genres with 100 audio files each, all having a length of 30 seconds (the famous GTZAN dataset, the MNIST of sounds).

Implementation Guide

First of all, we have to do some data preprocessing and extract some useful information from our music data so that we can use it for training our model. For this run:-
python prepare_dataset.py
Next there are 2 custom model(one is CNN based) built using PyTorch and it is trained on the preprocessed data. For CNN based model run:-
python audio_cnn_pytorch.py

with acc 44.5%

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
audio_cnn_pytorch.py		audio_cnn_pytorch.py
dataloader.py		dataloader.py
models.py		models.py
predict.py		predict.py
prepare_dataset.py		prepare_dataset.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio-Classification-Using-Resnet

Implementation Guide

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Audio-Classification-Using-Resnet

Implementation Guide

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages