Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@newuserforstudy
newuserforstudy
Follow
HAUT; Control Science and Engineering; NLP
  • CG
  • China Foshan

Block or report newuserforstudy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Videodl: A lightweight video downloader written in pure python. (轻量级视频下载器,优先高清无水印,支持抖音,快手,小红书,B站,TikTok,YouTube,FIFA+,优酷,腾讯,爱奇艺,1905电影网,乐视,芒果,咪咕,PPTV,搜狐,Facebook,Twitter,新浪微博,今日头条,网易公开课,全民K歌,CCTV央视...

Python 1,084 234 Updated Feb 18, 2026

Visual tracking library based on PyTorch.

Python 3,485 611 Updated Aug 8, 2024

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,904 9,838 Updated Feb 16, 2026

[AAAI 2025] Event-Enhanced Blurry Video Super-Resolution

Python 451 47 Updated Nov 11, 2025

DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.

Python 611 59 Updated Apr 17, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 34,366 4,267 Updated Aug 6, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,560 5,965 Updated Aug 16, 2024

Rembg is a tool to remove images background

Python 21,901 2,228 Updated Feb 3, 2026

OpenStereo: A Comprehensive Benchmark for Stereo Matching

Python 853 106 Updated Feb 11, 2026

SOS IROS 2018 GOOGLE; StereoNet ECCV2018 GOOGLE; ActiveStereoNet ECCV2018 Oral GOOGLE; HITNET CVPR2021 GOOGLE;PLUME Uber ATG

Python 721 125 Updated Feb 5, 2022

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 788 137 Updated Apr 11, 2024

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec...

Python 1,768 159 Updated Feb 12, 2026

Spark-TTS Inference Code

Python 10,915 1,167 Updated Apr 9, 2025

Lidar Obstacle Detection

C++ 261 75 Updated Sep 25, 2019

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.

Python 5,962 586 Updated Feb 13, 2026

A two stage lightweight and high performance license plate recognition in MTCNN and LPRNet

Jupyter Notebook 685 174 Updated Jan 22, 2024

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 1,037 213 Updated Aug 28, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,803 11,776 Updated Dec 15, 2025

DeepStream SDK Python bindings and sample applications

Jupyter Notebook 1,788 534 Updated Oct 14, 2025

[TPAMI 2022] GAN Inversion: A Survey

TeX 1,131 80 Updated Feb 7, 2025

A list of all named GANs!

Python 14,691 2,553 Updated Oct 6, 2023

PyTorch implementations of Generative Adversarial Networks.

Python 17,430 4,100 Updated Jun 18, 2024

A collection of awesome text-to-image generation studies.

TeX 747 40 Updated Dec 25, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,812 6,778 Updated Feb 19, 2026

A curated list of awesome computer vision resources

23,065 4,431 Updated May 17, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,751 3,262 Updated Feb 18, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 70,639 13,532 Updated Feb 19, 2026

PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.

Python 34 2 Updated Mar 11, 2025

中国大模型

6,385 548 Updated Nov 30, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,685 507 Updated Jul 18, 2024
Next

AltStyle によって変換されたページ (->オリジナル) /