Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@fpshuang
fpshuang
Follow

HUANG Pengsheng fpshuang

😅
SDE VLM | HFT Dev | Ex - @SenseTime
  • China
  • 11:31 (UTC +08:00)

Block or report fpshuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NEO Series: Native Vision-Language Models from First Principles

Python 842 29 Updated Jun 25, 2026

SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models

Python 73 7 Updated May 28, 2026

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 370 20 Updated Apr 17, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,444 305 Updated Jun 26, 2026

Open-source unified multimodal model

Python 6,045 539 Updated May 4, 2026
Python 49 4 Updated May 9, 2026

[ICLR 2026] The official implementation of "Efficient-SAM2: Accelerating SAM2 with Object-Aware Visual Encoding and Memory Retrieval"

Jupyter Notebook 38 Updated Feb 9, 2026

A lightweight inference engine supporting speculative speculative decoding (SSD).

Python 960 73 Updated May 10, 2026

Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression

Python 312 6 Updated Jan 27, 2026

[ACL 2026] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Python 92 2 Updated Jan 22, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,763 6,791 Updated Jun 29, 2026

MOVA: Towards Scalable and Synchronized Video–Audio Generation

Python 1,060 89 Updated Jun 18, 2026

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Python 632 30 Updated Jan 17, 2026

释放你的安卓微信内部存储空间,一键解放微信存储空间的工具。

Rich Text Format 1,069 159 Updated Jan 19, 2026

LightRFT (Light Reinforcement Fine-Tuning) is an advanced reinforcement learning fine-tuning framework designed for Large Language Models (LLMs) and Vision-Language Models (VLMs).

Python 19 3 Updated Jan 12, 2026

The open-source CapCut alternative

TypeScript 60,398 6,539 Updated Jun 21, 2026

RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2

Python 41 3 Updated Mar 12, 2026

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python 729 78 Updated May 14, 2026

Easiest and laziest way for building multi-agent LLMs applications.

Python 3,848 393 Updated Jun 29, 2026

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,804 282 Updated Mar 28, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,846 148 Updated Jul 10, 2025
Rocq Prover 371 12 Updated Sep 20, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,242 1,106 Updated Jun 2, 2026

CLIP-like model evaluation

Python 813 103 Updated Mar 19, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,371 1,284 Updated Jun 29, 2026

OpenMMLab Model Compression Toolbox and Benchmark.

Python 1,672 245 Updated Jun 11, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2,212 235 Updated May 20, 2024

DeepGEMM: clean and efficient BLAS kernel library on GPU

Cuda 7,444 1,072 Updated Jun 24, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,723 1,072 Updated Apr 30, 2026
Next

AltStyle によって変換されたページ (->オリジナル) /