Eric Tao shadowfall09
Highlights
- Pro
Stars
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对...
Collection of Summer 2026 tech internships!
Fully open reproduction of DeepSeek-R1
verl: Volcano Engine Reinforcement Learning for LLMs
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, ...
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
An Autonomous LLM Agent for Complex Task Solving
My learning notes for ML SYS.
slime is an LLM post-training framework for RL Scaling.
Train your Agent model via our easy and efficient framework
PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe...
ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution
scikit-mobility: mobility analysis in Python
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under al...
Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"
Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"
This code is a version of implement of the essay named Deep Inception Networks: A General End-to-End Framework for Multi-asset Quantitative Strategies
Dive2Pitts: An End-to-End RAG System for Pittsburgh-CMU-Centric Question Answering