GitXplorerGitXplorer
a

GCommandsPytorch

public
70 stars
21 forks
0 issues

Commits

List of commits on branch master.
Unverified
af9ccfd3d7c92e2850b1cf07214c02bf02c7bc7f

fix dims for greater length

aadiyoss committed 7 years ago
Unverified
efbd30b750d59711c14871daba94dc4fd3ad5277

beauty fix

aadiyoss committed 7 years ago
Unverified
9c55c04f26b7d4a653a05b37927fe1e211fe1f42

Update README.md

aadiyoss committed 7 years ago
Unverified
8fab54b0100b3bf8c0a132fb008ee1f32c64b21d

minor readme updates

aadiyoss committed 7 years ago
Unverified
23fec9bc6f8ea48d1f825fdc9aaf1685f05a634a

update readme

aadiyoss committed 7 years ago
Unverified
f1b39b1c7d1786324b5f6f4a611557f4225dfd6d

rm files

committed 7 years ago

README

The README file for this repository.

ConvNets for Speech Commands Recognition

Training ConvNet models using Google Commands Dataset, implemented in PyTorch.

Features

  • Training and testing ConvNets.
  • Arrange Google Commands Dataset, in an Train, Test, Valid folders for easy loading.
  • Dataset loader.

Installation

Several libraries are needed to be installed in order to extract spectrograms and train the models.

Usage

Google Commands Dataset

Download and extract the Google Commands Dataset.

To make the arrange the data run the following command:

python make_dataset.py <google-command-folder> --out_path <path to save the data the new format>

Custom Dataset

You can also use the data loader and training scripts for your own custom dataset. In order to do so the dataset should be arrange in the following way:

root/up/kazabobo.wav
root/up/asdkojv.wav
root/up/lasdsa.wav
root/right/blabla.wav
root/right/nsdf3.wav
root/right/asd932.wav

Training

Use python run.py --help for more parameters and options.

python run.py --train_path <train_data_path> --valid_path <valid_data_path> --test_path <test_data_path>

Results

Accuracy results for the train, validation and test sets using two ConvNet models (LeNet5 and VGG11).

In order to reproduce the below results just exec the run.py file with default parameters. Results may be improved using deeper models (VGG13, VGG19), or better hyper-parameters optimization.

Model Train acc. Valid acc. Test acc.
LeNet5 99% (50742/51088) 90% (6093/6798) 89% (6096/6835)
VGG11 97% (49793/51088) 94% (6361/6798) 94% (6432/6835)