-
Notifications
You must be signed in to change notification settings - Fork 921
Pull requests: NVIDIA/FasterTransformer
Pull requests list
Fix shape mismatch on the masked_tokens param in decoder masked multi-head attention kernel.
#773
opened Oct 24, 2023 by
FengDSP
Loading...
[BugFix] GPT inference error when pipeline_para_size > 1 and int8_mode != 0
#750
opened Aug 23, 2023 by
00why00
Loading...
[Doc] Add
projects section in README which is developed based on FasterTransformer
#731
opened Jul 25, 2023 by
lvhan028
Loading...
Add triton fastertransformer backend support for deberta
#725
opened Jul 19, 2023 by
sfc-gh-zhwang
Loading...
fix: initialize tiled_prompt_lengths_buf_ to zero in gptneox
#716
opened Jul 13, 2023 by
yandai
Loading...
Huggingface gptj convert script supports sharded checkpoint
#695
opened Jun 29, 2023 by
skyser2003
Loading...
ProTip!
Updated in the last three days: updated:>2025年10月21日.