Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

AnyaCoder/audio-preprocess

Repository files navigation

Fish Audio Preprocessor

PyPI Version

中文文档

This repo contains some scripts for audio processing. Main features include:

  • Video/audio to wav
  • Audio vocal separation
  • Automatic audio slicing
  • Audio loudness matching
  • Audio data statistics (supports determining audio length)
  • Audio resampling
  • Audio transcribe (.lab)
  • Audio transcribe via FunASR (use --model-type funasr to enable, detailed usage can be found at code)
  • Audio transcribe via WhisperX
  • Merge .lab files (example: fap merge-lab ./dataset list.txt "{PATH}|spkname|JP|{TEXT}")

([ ] indicates not completed, [x] indicates completed)

This code has been tested on Ubuntu 22.04 / 20.04 + Python 3.10. If you encounter problems on other versions, feedback is welcome.

Getting Started:

pip install -e .
fap --help

Reference

About

Preprocess Audio for training

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.7%
  • Shell 1.3%

AltStyle によって変換されたページ (->オリジナル) /