lora_conversion_utils: replace lora up/down with a/b even if `transformer.` in key #12101

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

sayakpaul merged 2 commits into huggingface:main from Beinsezii:beinsezii/flux_lora_peft_layers_kohya_names

Aug 8, 2025

Merged

lora_conversion_utils: replace lora up/down with a/b even if `transformer.` in key #12101

sayakpaul merged 2 commits into huggingface:main from Beinsezii:beinsezii/flux_lora_peft_layers_kohya_names

Aug 8, 2025

Conversation

Beinsezii

Copy link

Contributor

@Beinsezii Beinsezii commented Aug 8, 2025 •

edited

Loading

What does this PR do?

Saw some Flux.DEV loras in our DB with keys like

transformer.single_transformer_blocks.0.attn.to_k.alpha : Tensor @ torch.Size([])
transformer.single_transformer_blocks.0.attn.to_k.lora_down.weight : Tensor @ torch.Size([10, 3072])
transformer.single_transformer_blocks.0.attn.to_k.lora_up.weight : Tensor @ torch.Size([3072, 10])
transformer.single_transformer_blocks.0.attn.to_q.alpha : Tensor @ torch.Size([])
transformer.single_transformer_blocks.0.attn.to_q.lora_down.weight : Tensor @ torch.Size([8, 3072])
transformer.single_transformer_blocks.0.attn.to_q.lora_up.weight : Tensor @ torch.Size([3072, 8])
transformer.single_transformer_blocks.0.attn.to_v.alpha : Tensor @ torch.Size([])
transformer.single_transformer_blocks.0.attn.to_v.lora_down.weight : Tensor @ torch.Size([8, 3072])
transformer.single_transformer_blocks.0.attn.to_v.lora_up.weight : Tensor @ torch.Size([3072, 8])
transformer.single_transformer_blocks.0.norm.linear.alpha : Tensor @ torch.Size([])
transformer.single_transformer_blocks.0.norm.linear.lora_down.weight : Tensor @ torch.Size([9, 3072])
transformer.single_transformer_blocks.0.norm.linear.lora_up.weight : Tensor @ torch.Size([9216, 9])
transformer.single_transformer_blocks.0.proj_mlp.alpha : Tensor @ torch.Size([])
transformer.single_transformer_blocks.0.proj_mlp.lora_down.weight : Tensor @ torch.Size([9, 3072])
transformer.single_transformer_blocks.0.proj_mlp.lora_up.weight : Tensor @ torch.Size([12288, 9])
transformer.single_transformer_blocks.0.proj_out.alpha : Tensor @ torch.Size([])
transformer.single_transformer_blocks.0.proj_out.lora_down.weight : Tensor @ torch.Size([7, 15360])
transformer.single_transformer_blocks.0.proj_out.lora_up.weight : Tensor @ torch.Size([3072, 7])

So basically PEFT layers but Kohya adapter names. This might be a mistake on the trainer part but after picking around in the converter for a bit I figured out it can be an easy one line fix so that's what I've done here. I don't have the civit.ai URLs at the moment so I don't have a public link to weights.

The proj_out layers still fail but so does every other peft lora with proj out layers against main currently so I think that's an unrelated bug.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@sayakpaul

@Beinsezii


 lora_conversion_utils: replace lora up/down with a/b even if transfor...

8b7ed29

...mer. in key

@Beinsezii

Copy link

Contributor Author

Beinsezii commented Aug 8, 2025

Bisect shows the proj_out problem commit as bc34fa8 so I can open an issue for that if need be

@sayakpaul

Copy link

Member

sayakpaul commented Aug 8, 2025

Can you show an example state dict? The changes you're introducing might be backwards-breaking.

@Beinsezii

Copy link

Contributor Author

Beinsezii commented Aug 8, 2025

The changes you're introducing might be backwards-breaking.

I assumed this would be impossible because lora_down and lora_up aren't read by peft anywhere? The diffusers loader mixin has a check for lora_down.weight which is hardcoded to use the sd1/xl unet converter which for flux models results in an empty rank dict and later an index err because there's no unet blocks.

diffusers/src/diffusers/loaders/peft.py

Lines 227 to 230 in 4b17fa2

# check with first key if is not in peft format

first_key = next(iter(state_dict.keys()))

if "lora_A" not in first_key:

state_dict = convert_unet_state_dict_to_peft(state_dict)

Can you show an example state dict?

https://huggingface.co/Beinsezii/peft_kohya_lora/blob/main/pytorch_lora_weights.safetensors

sayakpaul

sayakpaul approved these changes

Aug 8, 2025

View reviewed changes

@sayakpaul


 Merge branch 'main' into beinsezii/flux_lora_peft_layers_kohya_names

3560daf

@sayakpaul

Copy link

Member

sayakpaul commented Aug 8, 2025

I understand now. Thanks!

@HuggingFaceDocBuilderDev

Copy link

HuggingFaceDocBuilderDev commented Aug 8, 2025

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@sayakpaul sayakpaul merged commit 3c0531b into huggingface:main

Aug 8, 2025

11 checks passed

@Beinsezii

Copy link

Contributor Author

Beinsezii commented Aug 8, 2025

Nice looks like a8e4797 fixed the proj_out too

@sayakpaul

Copy link

Member

sayakpaul commented Aug 8, 2025

Yes, hopefully, we will not run into those nasty issues for a while :)

@Beinsezii Beinsezii deleted the beinsezii/flux_lora_peft_layers_kohya_names branch

August 8, 2025 19:57

Labels

None yet

3 participants

@Beinsezii @sayakpaul @HuggingFaceDocBuilderDev

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

lora_conversion_utils: replace lora up/down with a/b even if `transformer.` in key #12101

lora_conversion_utils: replace lora up/down with a/b even if `transformer.` in key #12101

Uh oh!

Conversation

@Beinsezii Beinsezii commented Aug 8, 2025 •

edited

Loading

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Beinsezii commented Aug 8, 2025

Uh oh!

sayakpaul commented Aug 8, 2025

Uh oh!

Beinsezii commented Aug 8, 2025

Uh oh!

sayakpaul commented Aug 8, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2025

Uh oh!

Uh oh!

Beinsezii commented Aug 8, 2025

Uh oh!

sayakpaul commented Aug 8, 2025

Uh oh!

Uh oh!

lora_conversion_utils: replace lora up/down with a/b even if transformer. in key #12101

lora_conversion_utils: replace lora up/down with a/b even if transformer. in key #12101

Uh oh!

Conversation

@Beinsezii Beinsezii commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Beinsezii commented Aug 8, 2025

Uh oh!

sayakpaul commented Aug 8, 2025

Uh oh!

Beinsezii commented Aug 8, 2025

Uh oh!

sayakpaul commented Aug 8, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2025

Uh oh!

Uh oh!

Beinsezii commented Aug 8, 2025

Uh oh!

sayakpaul commented Aug 8, 2025

Uh oh!

Uh oh!

lora_conversion_utils: replace lora up/down with a/b even if `transformer.` in key #12101

lora_conversion_utils: replace lora up/down with a/b even if `transformer.` in key #12101

@Beinsezii Beinsezii commented Aug 8, 2025 •

edited

Loading