Peng-Cheng Zou tornadozou
-
NANJING UNIVERSITY OF AERONAUTICS AND ASTRONAUTICS
- Nanjing, China
Stars
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
RelBench: Relational Deep Learning Benchmark
PipelineLLM 是一个系统性的大语言模型(LLM)后训练学习项目,涵盖从监督微调(SFT)到偏好优化(DPO)、强化学习(RLHF/PPO/GRPO)再到持续学习(Continual Learning)的完整技术栈。
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
Python implementations of contextual bandits algorithms
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
A large-scale multi-modal pre-trained model
Minimal code for A Generalist Agent
An educational resource to help anyone learn deep reinforcement learning.
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Methods and Implements of Deep Clustering
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Tobe2d / Photo_Wake-Up
Forked from lev1khachatryan/Tex-An_MeshPhoto Wake-Up: 3D Character Animation from a Single Photo
Deploy SQLFlow service mesh on Windows, macOS, and Linux desktop computers
Book about interpretable machine learning
Implementation of RLNs, as described in https://arxiv.org/abs/1805.06440
Source code for Neural Information Processing Systems (NeurIPS) 2018 paper "Multi-Task Learning as Multi-Objective Optimization"
Code for EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
A large annotated semantic parsing corpus for developing natural language interfaces.
The Resources for "Natural Language to Logical Form" ; "自然语言转逻辑形式"研究资料收集。
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、...