Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Notifications You must be signed in to change notification settings

faustpy/spec_augment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

14 Commits

Repository files navigation

SpecAugment.py

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

SpecAugment is a SOTA-achieving data augmentation approach on speech recognition. The paper's authors did not publish code that I could find and their implementation was in TensorFlow.

To use:

  1. run install.sh (I recommend to use a unique conda env for the project)
  2. Check out SparseImageWarp.ipynb (a Jupyter notebook) for the functions.

Augmentations

  1. Time Warp (Coming Soon) This augmentation relies on a lot of functionality not yet in Pytorch, so I have to write it from scratch. I am working on it.

  2. Time Mask (DONE!)

  3. Frequency Mask (DONE!)

Let's be friends! @zachcaceres zach.dev

About

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%

AltStyle によって変換されたページ (->オリジナル) /