Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@chenying99
chenying99
Follow

Block or report chenying99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"

Python 97 3 Updated Dec 18, 2025
Python 6,053 467 Updated Aug 29, 2025

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,838 5,268 Updated Nov 13, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,346 48 Updated Dec 16, 2025

Open-Source Frontier Voice AI

Python 19,180 2,123 Updated Dec 17, 2025

Lightning-Fast, On-Device TTS — running natively via ONNX.

JavaScript 1,907 173 Updated Dec 15, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,162 143 Updated Sep 5, 2024

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 21,029 3,051 Updated Dec 19, 2025

[ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 905 58 Updated Dec 29, 2025

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,537 1,185 Updated Mar 21, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,723 1,704 Updated Feb 29, 2024

A PyTorch library and evaluation platform for end-to-end compression research

Python 1,489 265 Updated Oct 23, 2025

[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

Jupyter Notebook 127 22 Updated Feb 9, 2023

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

Python 3,175 522 Updated Jul 23, 2024

[TMM 2025] Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression

Python 15 3 Updated Mar 28, 2025

This is an open-source repository based on our paper, primarily applied in the field of remote sensing image compression.

Python 18 5 Updated May 15, 2024

Paper list: deep learning based video compression

272 28 Updated Sep 27, 2025

The paper list about deep learning based image compression

190 19 Updated Sep 27, 2025

Code release for DynamicTanh (DyT)

Python 1,031 86 Updated Mar 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,461 12,273 Updated Dec 29, 2025

A truly open version of gpt-oss which shows the entire pre-training from scratch

Python 81 30 Updated Sep 4, 2025

Learn the building blocks of how to build gpt-oss from scratch

Python 107 30 Updated Sep 23, 2025

Model and tool for computing audio feature representations based on VAE

Jupyter Notebook 7 1 Updated Jun 16, 2019

the official TangXu's group released codes about the Remote Sensing images classificaiton

Python 61 9 Updated Jan 1, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,696 85 Updated Feb 11, 2025

Official repo for CFG-Zero*

Python 695 24 Updated May 2, 2025

Pusa: Thousands Timesteps Video Diffusion Model

Python 669 46 Updated Sep 5, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V...

Python 36,109 5,094 Updated Dec 28, 2025

Official implementation of PVT series

Python 1,874 254 Updated Oct 27, 2022

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,131 609 Updated Oct 27, 2023
Next

AltStyle によって変換されたページ (->オリジナル) /