Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@wavelet2008
wavelet2008
Follow
vllm,self-driving, 3d slam

Block or report wavelet2008

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 185,352 31,164 Updated Feb 11, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,559 1,880 Updated Jan 9, 2026

Community maintained hardware plugin for vLLM on Ascend

C++ 1,661 827 Updated Feb 11, 2026

State-of-the-art 2D and 3D Face Analysis Project

Python 27,851 5,925 Updated Feb 2, 2026

InspireFace is a cross-platform face recognition SDK developed in C/C++, supporting multiple operating systems and various backend types for inference, such as CPU, GPU, and NPU.

C++ 177 36 Updated Sep 22, 2025

Easy to use device for connecting "old" measuring units (water, power, gas, ...) to the digital world

C++ 8,053 835 Updated Jan 30, 2026

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,804 760 Updated Sep 22, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, ...

Python 12,635 1,201 Updated Feb 11, 2026

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,898 155 Updated Jan 22, 2026

https://hf.co/hexgrad/Kokoro-82M

JavaScript 5,622 638 Updated Aug 6, 2025

[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,775 193 Updated Dec 16, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,965 289 Updated May 15, 2025

RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。

Python 837 145 Updated Apr 2, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,589 2,605 Updated Jun 26, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,223 1,812 Updated Feb 26, 2025

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,631 171 Updated Oct 15, 2025

研究GOT-OCR-项目落地加速,不限语言

Python 62 4 Updated Oct 24, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 9,341 701 Updated Jan 3, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 54,234 4,511 Updated Feb 9, 2026

real time face swap and one-click video deepfake with only a single image

Python 79,416 11,577 Updated Feb 11, 2026

A model that achieve dual detection(Infrared+RGB) with rotation

Python 3 1 Updated Aug 5, 2024

Quick exploration into fine tuning florence 2

Jupyter Notebook 339 30 Updated Sep 19, 2024

yolov10 瑞芯微 rknn 板端 C++部署,使用平台 rk3588。

C 74 11 Updated Jul 18, 2024

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,222 1,177 Updated Mar 14, 2025

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,330 86 Updated Apr 15, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 6,209 583 Updated Feb 26, 2025

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,874 140 Updated Jul 5, 2024

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

Python 402 69 Updated Feb 6, 2025
Next

AltStyle によって変換されたページ (->オリジナル) /