Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Add audio transcription script using Faster Whisper #533

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Syed-56 wants to merge 3 commits into wasmerio:main
base: main
Choose a base branch
Loading
from Syed-56:transcribing

Conversation

@Syed-56
Copy link

@Syed-56 Syed-56 commented Jan 9, 2026

PR Title

Add audio transcription script using Faster Whisper

Summary

Added a new Python script that transcribes audio and video files (MP3, WAV, MP4, etc.) into text using the Faster Whisper library. The script automatically installs the library if missing and checks for ffmpeg.

Description

This script allows users to transcribe audio files quickly and efficiently. It supports multiple model sizes (tiny, small, base, medium, large) and allows English-only or multilingual transcription. The script validates the input file, prompts for model choice, and outputs the transcript in the same folder.

The changes are as follows:

  • Added audio_transcribe.py script
  • Auto-installs faster_whisper if missing
  • Checks for ffmpeg and provides installation instructions
  • Supports MP3, WAV, M4A, FLAC, AAC, and MP4 files
  • Prompts user for model size and English-only/multilingual choice
  • Saves transcription output as <audio_file>_transcript.txt
  • Includes user-friendly messages and error handling

Checks

in the repository

  • Made no changes that degrade the functioning of the repository
  • Gave each commit a descriptive title

in the PR

  • Followed the format of the pull_request_template
  • Made the Pull Request concise and self-contained
  • Tested the script locally and verified transcription works

Thank You,

Syed Sultan

Copy link
Author

@Syed-56 Syed-56 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

transcription model Validation and loop made for both model and file. Also some minor positive changes also made.

yooung1 reacted with thumbs up emoji
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

1 more reviewer

@yooung1 yooung1 yooung1 approved these changes

Reviewers whose approvals may not affect merge requirements

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

2 participants

AltStyle によって変換されたページ (->オリジナル) /