Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@waiter1
waiter1
Follow

Block or report waiter1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 5,016 1,172 Updated Dec 19, 2025

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

C 7,169 1,549 Updated Jul 27, 2025

Fast and accurate machine learning on sparse matrices - matrix factorizations, regression, classification, top-N recommendations.

R 181 30 Updated Feb 17, 2025

本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。

Python 55 10 Updated Aug 30, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,813 1,389 Updated Dec 6, 2023

ReViT - Residual Attention Vision Transformer

Jupyter Notebook 33 2 Updated Feb 29, 2024

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Python 375 19 Updated Aug 25, 2025

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 940 194 Updated Dec 6, 2023

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 249 35 Updated Dec 20, 2022

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 251 32 Updated Sep 13, 2024

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 707 101 Updated Feb 1, 2026

Conformer-based Metric GAN for speech enhancement

Python 412 67 Updated May 3, 2024
Python 44 3 Updated Oct 29, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr...

C++ 10,126 1,137 Updated Feb 2, 2026

Small compression utility

C++ 38 10 Updated Jan 20, 2026

Tools for handling multimodal data in machine learning projects.

Python 1,108 257 Updated Feb 2, 2026
Python 1,352 396 Updated Nov 28, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 12,221 1,230 Updated Apr 30, 2025

End-to-End Speech Processing Toolkit

Python 9,713 2,379 Updated Jan 27, 2026

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Python 719 114 Updated Dec 17, 2025

[ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".

Python 214 15 Updated Aug 1, 2023

mall项目是一套电商系统,包括前台商城系统及后台管理系统,基于Spring Boot+MyBatis实现,采用Docker容器化部署。 前台商城系统包含首页门户、商品推荐、商品搜索、商品展示、购物车、订单流程、会员中心、客户服务、帮助中心等模块。 后台管理系统包含商品管理、订单管理、会员管理、促销管理、运营管理、内容管理、统计报表、财务管理、权限管理、设置等模块。

Java 82,844 29,659 Updated Feb 2, 2026

A list of awesome beginners-friendly projects.

82,267 7,729 Updated Dec 5, 2025

AltStyle によって変換されたページ (->オリジナル) /