speech-to-text

Here are 3,805 public repositories matching this topic...

Language: All

Filter by language

All 3,805 Python 1,518 JavaScript 473 Jupyter Notebook 322 TypeScript 228 Java 156 HTML 136 C# 116 C++ 69 Swift 64 CSS 56

Sort: Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

whisper.cpp

ggml-org / whisper.cpp

Star 42.9k

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Aug 24, 2025
C++

mozilla / DeepSpeech

Star 26.6k

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

Updated Jun 19, 2025
C++

SYSTRAN / faster-whisper

Star 17.9k

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Aug 16, 2025
Python

m-bain / whisperX

Star 17.5k

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Jul 2, 2025
Python

leon

leon-ai / leon

Star 16.6k

🧠 Leon is your open-source personal assistant.

nodejs python bot text-to-speech automation privacy ai offline chatbot artificial-intelligence speech-synthesis assistant speech-recognition personal-assistant speech-to-text leon flite voice-assistant virtual-assistant ai-assistant

Updated Aug 29, 2025
TypeScript

kaldi-asr / kaldi

Star 15.1k

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Jul 22, 2025
Shell

jianchang512 / pyvideotrans

Star 14.1k

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

text-to-speech speech-to-text video-transition

Updated Aug 31, 2025
Python

alphacep / vosk-api

Star 13.1k

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

android python raspberry-pi ios privacy deep-neural-networks deep-learning offline voice-recognition speech-recognition speech-to-text kaldi stt speaker-verification asr speech-to-text-android deepspeech speaker-identification google-speech-to-text vosk

Updated Aug 24, 2025
Jupyter Notebook

speechbrain / speechbrain

Star 10.4k

A PyTorch-based Speech Toolkit

audio deep-learning transformers pytorch voice-recognition speech-recognition speech-to-text language-model speaker-recognition speaker-verification speech-processing audio-processing asr speaker-diarization speechrecognition speech-separation speech-enhancement spoken-language-understanding huggingface speech-toolkit

Updated Aug 13, 2025
Python

Uberi / speech_recognition

Star 8.9k

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated May 18, 2025
Python

KoljaB / RealtimeSTT

Star 8.5k

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

python realtime speech-to-text

Updated Jul 11, 2025
Python

nl8590687 / ASRT_SpeechRecognition

Star 8.2k

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

python tensorflow keras cnn python3 speech-recognition speech-to-text ctc chinese-speech-recognition asrt

Updated Sep 26, 2024
Python

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 12 programming languages

android windows macos linux lazarus raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v object-pascal asr arm32 onnx vits

Updated Sep 1, 2025
C++

TalAter / annyang

Star 6.7k

💬 Speech recognition for your site

voice speech speech-recognition speech-to-text

Updated Aug 7, 2024
JavaScript

FunAudioLLM / SenseVoice

Star 6.5k

Multilingual Voice Understanding Model

multilingual python ai pytorch speech-recognition speech-to-text asr cross-lingual speech-emotion-recognition audio-event-classification aigc llm gpt-4o

Updated Aug 15, 2025
Python

snakers4 / silero-models

Star 5.5k

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

text-to-speech german speech pytorch tts speech-synthesis english speech-recognition spanish colab speech-to-text pretrained-models stt asr capitalization onnx stt-benchmark tts-models torch-hub repunctuation

Updated Oct 18, 2023
Jupyter Notebook

modelscope / FunClip

Star 4.9k

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated Jul 11, 2025
Python

MahmoudAshraf97 / whisper-diarization

Star 4.9k

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated Aug 18, 2025
Jupyter Notebook

voice-pro

abus-aikorea / voice-pro

Star 4.7k

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

text-to-speech translator audiobook podcasts tts speech-synthesis subtitles speech-recognition webui speech-to-text karaoke transcription gradio whisper voice-conversion voice-cloning yt-dlp faster-whisper whisperx

Updated Jul 20, 2025
Python

sanchit-gandhi / whisper-jax

Star 4.6k

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

deep-learning speech-recognition speech-to-text whisper jax

Updated Apr 3, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly