Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@DaFuCoding
DaFuCoding
Follow

Yanfu Ren DaFuCoding

🎯
Focusing
Vision Algorithm Engineer

Block or report DaFuCoding

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

πŸͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 21,933 2,160 Updated Feb 14, 2026

Codebase for FinePDFs

Python 176 28 Updated Jan 9, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 194,746 33,619 Updated Feb 15, 2026

Expert Specialized Fine-Tuning

Python 729 263 Updated May 22, 2025

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.

Python 6,260 464 Updated Feb 10, 2026

MiniMax-M2, a model built for Max coding & agentic workflows.

2,403 191 Updated Nov 13, 2025

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 28,331 4,049 Updated Feb 12, 2026

Post-training with Tinker

Python 2,836 319 Updated Feb 11, 2026

Salesforce Enterprise Deep Research

Python 1,109 177 Updated Jan 30, 2026

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,889 1,338 Updated Feb 13, 2026

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Python 112 6 Updated Feb 2, 2026

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 402 15 Updated Jan 29, 2026

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 588 35 Updated Dec 9, 2024

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 907 77 Updated Feb 2, 2026

Scalable data pre processing and curation toolkit for LLMs

Python 1,403 218 Updated Feb 13, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,743 3,337 Updated Feb 14, 2026

RewardAnything: Generalizable Principle-Following Reward Models

Python 45 2 Updated Jun 11, 2025

Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data

Python 131 13 Updated Jan 31, 2026

Bash is all You need - Write a nano Claude Code 0 - 1

Python 17,041 3,620 Updated Feb 15, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,542 295 Updated Feb 15, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,370 776 Updated Jan 21, 2026

Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".

Python 118 2 Updated Feb 7, 2026

A live reading list for LLM data synthesis (Updated to July, 2025).

450 38 Updated Aug 26, 2025

πŸ¦› CHONK docs with Chonkie ✨ β€” The lightweight ingestion library for fast, efficient and robust RAG pipelines

Python 3,747 251 Updated Feb 14, 2026
Python 565 49 Updated Nov 20, 2024

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

JavaScript 13,366 1,327 Updated Jan 24, 2026

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 331 50 Updated Feb 14, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,827 215 Updated Feb 15, 2026

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo...

JavaScript 3,181 416 Updated Sep 29, 2025
Next

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /