GitHub - PaddlePaddle/PaddleFormers: PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.

PaddlePaddle/PaddleFormers

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 185 Commits
.github		.github
examples		examples
paddleformers		paddleformers
scripts		scripts
tests		tests
.copyright.hook		.copyright.hook
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

News | Highlights | Installation | Quickstart | Community

PaddleFormers is a Transformer model library built on the PaddlePaddle deep learning framework, delivering both ease of use and high-performance capabilities. It provides a unified model definition interface, modular training components, and comprehensive distributed training strategies specifically designed for large language model development pipelines. This enables developers to train large models efficiently with minimal complexity, making it suitable for diverse scenarios ranging from academic research to industrial applications.

News

[2025年06月28日] 🎉 PaddleFormers 0.1 is officially released! This initial version supports SFT/DPO training paradigms, configurable distributed training via unified Trainer API, and integrates PEFT, MergeKit, and Quantization APIs for diverse LLM applications.

Highlights

⚙️ Simplified Distributed Training

Implements 4D parallel strategies through unified Trainer API, lowering the barrier to distributed LLM training.

🛠 Efficient Post-Training

Integrates Packing dataflow and FlashMask operators for SFT/DPO training, eliminating padding waste and boosting throughput.

💾 Industrial Storage Solution

Features Unified Checkpoint storage tools for LLMs, enabling training resumption and dynamic resource scaling. Additionally implements asynchronous storage (up to 95% faster) and Optimizer State Quantization (78% storage reduction), ensuring industrial training meets both efficiency and stability requirements.

Installation

Requires Python 3.8+ and PaddlePaddle 3.1+.

# Install via pip
pip install paddleformers
# Install development version
git clone https://github.com/PaddlePaddle/PaddleFormers.git
cd PaddleFormers
pip install -e .

Quickstart

Text Generation

This example shows how to load Qwen model for text generation with PaddleFormers Auto API:

from paddleformers.transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-0.6B-Base")
model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-0.6B-Base", dtype="bfloat16", convert_from_hf=True)
input_features = tokenizer("Give me a short introduction to large language model.", return_tensors="pd")
outputs = model.generate(**input_features, max_new_tokens=128)
print(tokenizer.batch_decode(outputs[0], skip_special_tokens=True))

SFT Training

Getting started with supervised fine-tuning (SFT) using PaddleFormers:

from paddleformers.trl import SFTConfig, SFTTrainer
from datasets import load_dataset
dataset = load_dataset("ZHUI/alpaca_demo", split="train")
training_args = SFTConfig(output_dir="Qwen/Qwen3-0.6B-SFT", device="gpu", model_init_kwargs={"convert_from_hf": True})
trainer = SFTTrainer(
 args=training_args,
 model="Qwen/Qwen3-0.6B-Base",
 train_dataset=dataset,
)
trainer.train()

Community

We welcome all contributions! See CONTRIBUTING.md for guidelines.

License

This repository's source code is available under the Apache 2.0 License.

About

PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.

Code of conduct

Contributing

Activity

Custom properties

Stars

12.9k stars

Watchers

179 watching

Forks

2.1k forks

Report repository

Releases 3

PaddleFormers v0.3 Latest

Sep 18, 2025

+ 2 releases

Packages

No packages published

Contributors 38

+ 24 contributors

Languages

Python 99.8%
Other 0.2%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

PaddlePaddle/PaddleFormers

Folders and files

Latest commit

History

Repository files navigation

News | Highlights | Installation | Quickstart | Community

News

Highlights

⚙️ Simplified Distributed Training

🛠 Efficient Post-Training

💾 Industrial Storage Solution

Installation

Quickstart

Text Generation

SFT Training

Community

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages

Uh oh!

Contributors 38

Languages

License

PaddlePaddle/PaddleFormers

Folders and files

Latest commit

History

Repository files navigation

News | Highlights | Installation | Quickstart | Community

News

Highlights

⚙️ Simplified Distributed Training

🛠 Efficient Post-Training

💾 Industrial Storage Solution

Installation

Quickstart

Text Generation

SFT Training

Community

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 38

Languages

Packages