更新PaddleNLP的LR Scheduler,建议统一使用Trainer里面的Scheduler · PaddlePaddle/PaddleNLP · Discussion #4351

JunnYu
Jan 5, 2023
Collaborator

1. LR Scheduler 升级更新

Huggingface的 https://github.com/huggingface/transformers/blob/main/src/transformers/trainer_utils.py

class SchedulerType(Enum):
 LINEAR = "linear"
 COSINE = "cosine"
 COSINE_WITH_RESTARTS = "cosine_with_restarts"
 POLYNOMIAL = "polynomial"
 CONSTANT = "constant"
 CONSTANT_WITH_WARMUP = "constant_with_warmup"

而当前的PaddleNLP的Trainer中只支持下面几个,建议更新。

class SchedulerType(ExplicitEnum):
 LINEAR = "linear"
 COSINE = "cosine"
 CONSTANT = "constant"
 CONSTANT_WITH_WARMUP = "constant_with_warmup"

2. 统一使用Trainer里面的LR Scheduler(与HF对齐)

https://github.com/PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/transformers/optimization.py 有旧的LR Scheduler,建议废弃,未来的新代码需要升级成最新的 Trainer 里面的 Scheduler,这样在使用体验上对习惯HF的用户来说更友好。
image

Replies: 1 comment

ZHUI
Jan 5, 2023
Collaborator

这两个应该需要补上。

 COSINE_WITH_RESTARTS = "cosine_with_restarts"
 POLYNOMIAL = "polynomial"

这两个可以考虑合并到 Trainer 里面一起支持

 "CosineAnnealingWithWarmupDecay",
 "LinearAnnealingWithWarmupDecay",

0 replies

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

更新PaddleNLP的LR Scheduler,建议统一使用Trainer里面的Scheduler #4351

Uh oh!

{{title}}

Uh oh!

JunnYu
Jan 5, 2023
Collaborator

1. LR Scheduler 升级更新

2. 统一使用Trainer里面的LR Scheduler(与HF对齐)

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

ZHUI
Jan 5, 2023
Collaborator

Select a reply

Uh oh!

更新PaddleNLP的LR Scheduler,建议统一使用Trainer里面的Scheduler #4351

Uh oh!

JunnYu Jan 5, 2023 Collaborator

1. LR Scheduler 升级更新

2. 统一使用Trainer里面的LR Scheduler(与HF对齐)

Replies: 1 comment

Uh oh!

ZHUI Jan 5, 2023 Collaborator

JunnYu
Jan 5, 2023
Collaborator

ZHUI
Jan 5, 2023
Collaborator