Releases: codefuse-ai/MFTCoder

MFTCoder v0.4.3: Bugfix

11 Jun 02:22

@chencyudel chencyudel

v0.4.3

cc55b06

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

MFTCoder v0.4.3: Bugfix Latest

Latest

Bugfix: Remove default tensor board writer which may cause permission problem

P.S. If you have problem like "permission denied" of "/home/admin", please try the new fixed release v0.4.3

Assets 2

1 person reacted

MFTCoder v0.4.2: Support more open source models; Support QLoRA + Deepspeed ZeRO3 / FSDP

04 Jun 04:06

@chencyudel chencyudel

v0.4.2

d0b8457

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

MFTCoder v0.4.2: Support more open source models; Support QLoRA + Deepspeed ZeRO3 / FSDP

Support more open source models like Qwen2, Qwen2-moe, Starcoder2, etc.
Support QLoRA + Deepspeed ZeRO3 / FSDP, which is efficient for very large models.

Assets 2

MFTCoder v0.3.0: Support more open source models, support Self-Paced Loss, support FSDP

19 Jan 11:16

@chencyudel chencyudel

v0.3.0

e5243da

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

MFTCoder v0.3.0: Support more open source models, support Self-Paced Loss, support FSDP

Updates:

Mainly for MFTCoder-accelerate.
It now supports more open source models like Mistral, Mixtral(MoE), DeepSeek-coder, chatglm3.
It supports FSDP as an option.
It also supports Self-paced Loss as a solution for convergence balance in Multitask Fine-tuning.

Assets 2

v0.1.0 release: Multi Task Fintuning Framework for Multiple base modles

27 Dec 08:20

@chencyudel chencyudel

v0.1.0

7946e4f

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

v0.1.0 release: Multi Task Fintuning Framework for Multiple base modles

We released MFTCoder which supports finetuning Code Llama, Llama, Llama2, StarCoder, ChatGLM2, CodeGeeX2, Qwen, and GPT-NeoX models with LoRA/QLoRA.
mft_peft_hf is based on the HuggingFace Accelerate and deepspeed framework.
mft_atorch is based on the ATorch frameworks, which is a fast distributed training framework of LLM.

Assets 2

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Releases: codefuse-ai/MFTCoder

MFTCoder v0.4.3: Bugfix

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

MFTCoder v0.4.2: Support more open source models; Support QLoRA + Deepspeed ZeRO3 / FSDP

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

MFTCoder v0.3.0: Support more open source models, support Self-Paced Loss, support FSDP

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Updates:

Uh oh!

v0.1.0 release: Multi Task Fintuning Framework for Multiple base modles

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!