@DaFuCoding DaFuCoding Follow

@DaFuCoding

DaFuCoding

🎯

Focusing

Yanfu Ren DaFuCoding

🎯

Focusing

Vision Algorithm Engineer

55 followers · 39 following

Stars

Showing results

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 21,933 2,160 Updated Feb 14, 2026

huggingface / finepdfs

Codebase for FinePDFs

Python 176 28 Updated Jan 9, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 194,746 33,619 Updated Feb 15, 2026

deepseek-ai / ESFT

Expert Specialized Fine-Tuning

Python 729 263 Updated May 22, 2025

MiroMindAI / MiroThinker

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.

Python 6,260 464 Updated Feb 10, 2026

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,403 191 Updated Nov 13, 2025

HKUDS / LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 28,331 4,049 Updated Feb 12, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,836 319 Updated Feb 11, 2026

SalesforceAIResearch / enterprise-deep-research

Salesforce Enterprise Deep Research

Python 1,109 177 Updated Jan 30, 2026

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,889 1,338 Updated Feb 13, 2026

GAIR-NLP / MegaScience

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Python 112 6 Updated Feb 2, 2026

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 402 15 Updated Jan 29, 2026

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 588 35 Updated Dec 9, 2024

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 907 77 Updated Feb 2, 2026

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,403 218 Updated Feb 13, 2026

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,743 3,337 Updated Feb 14, 2026

WisdomShell / RewardAnything

RewardAnything: Generalizable Principle-Following Reward Models

Python 45 2 Updated Jun 11, 2025

OpenDataArena / OpenDataArena-Tool

Tools for OpenDataArena: Fair, Open, and Transparent Arena for Data

Python 131 13 Updated Jan 31, 2026

shareAI-lab / learn-claude-code

Bash is all You need - Write a nano Claude Code 0 - 1

Python 17,041 3,620 Updated Feb 15, 2026

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,542 295 Updated Feb 15, 2026

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

10,370 776 Updated Jan 21, 2026

pengr / DataMan

Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".

Python 118 2 Updated Feb 7, 2026

pengr / LLM-Synthetic-Data

A live reading list for LLM data synthesis (Updated to July, 2025).

450 38 Updated Aug 26, 2025

chonkie-inc / chonkie

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines

Python 3,747 251 Updated Feb 14, 2026

ConardLi / easy-dataset

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

JavaScript 13,366 1,327 Updated Jan 24, 2026

nvidia-cosmos / cosmos-rl

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 331 50 Updated Feb 14, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,827 215 Updated Feb 15, 2026

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo...

JavaScript 3,181 416 Updated Sep 29, 2025

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly