sdh newuserforstudy
Stars
Videodl: A lightweight video downloader written in pure python. (轻量级视频下载器,优先高清无水印,支持抖音,快手,小红书,B站,TikTok,YouTube,FIFA+,优酷,腾讯,爱奇艺,1905电影网,乐视,芒果,咪咕,PPTV,搜狐,Facebook,Twitter,新浪微博,今日头条,网易公开课,全民K歌,CCTV央视...
Visual tracking library based on PyTorch.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
[AAAI 2025] Event-Enhanced Blurry Video Super-Resolution
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Rembg is a tool to remove images background
OpenStereo: A Comprehensive Benchmark for Stereo Matching
SOS IROS 2018 GOOGLE; StereoNet ECCV2018 GOOGLE; ActiveStereoNet ECCV2018 Oral GOOGLE; HITNET CVPR2021 GOOGLE;PLUME Uber ATG
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec...
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.
A two stage lightweight and high performance license plate recognition in MTCNN and LPRNet
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Robust Speech Recognition via Large-Scale Weak Supervision
DeepStream SDK Python bindings and sample applications
PyTorch implementations of Generative Adversarial Networks.
A collection of awesome text-to-image generation studies.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A curated list of awesome computer vision resources
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A high-throughput and memory-efficient inference and serving engine for LLMs
PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.
A large-scale 7B pretraining language model developed by BaiChuan-Inc.