Search code, repositories, users, issues, pull requests...

Copy link

Contributor

@leffff leffff commented Oct 21, 2025

This PR adds support for 10 sec Kandinsky 5.0 model herd.

import torch
from diffusers import Kandinsky5T2VPipeline
from diffusers.utils import export_to_video
# Load the pipeline
pipe = Kandinsky5T2VPipeline.from_pretrained(
 "ai-forever/Kandinsky-5.0-T2V-Lite-sft-10s-Diffusers", 
 torch_dtype=torch.bfloat16
)
pipe = pipe.to("cuda")
# Generate video
prompt = [
 "Photorealistic closeup video of two intricately detailed pirate ships locked in a fierce battle, complete with cannon fire and billowing sails, as they sail through the swirling waters of a steaming cup of coffee. The ships are miniature but highly realistic, with wooden textures and flags fluttering in the liquid breeze. Coffee splashes and foam ripple around them as they maneuver through the turbulent surface, dodging each other's attacks. A detailed reflection of the battle appears on the glossy surface of the coffee, adding to the dynamic realism. The camera pans and zooms to capture every dramatic moment of the high-seas clash within this tiny, unexpected world.",
 "Bad quality",
]
negative_prompt = "Static, 2D cartoon, cartoon, 2d animation, paintings, images, worst quality, low quality, ugly, deformed, walking backwards"
pipe.transformer.set_attention_backend("flex")
output = pipe(
 prompt=prompt,
 negative_prompt=negative_prompt,
 height=512,
 width=768,
 num_frames=241,
 num_inference_steps=50,
 guidance_scale=5.0,
 num_videos_per_prompt=1,
 generator=torch.Generator(42)
)

output.12.mp4

leffff and others added 30 commits

October 4, 2025 10:10


 add transformer pipeline first version

d53f848


 updates

7db6093


 fix 5sec generation

a0cf07f


 Merge branch 'huggingface:main' into main

0bd738f


 rewrite Kandinsky5T2VPipeline to diffusers style

c8f3a36


 Merge branch 'huggingface:main' into main

86b6c2b


 add multiprompt support

723d149


 remove prints in pipeline

22e14bd


 add nabla attention

70fa62b


 Merge branch 'huggingface:main' into main

07e11b2


 Wrap Transformer in Diffusers style

45240a7


 fix license

43bd1e8


 Merge branch 'huggingface:main' into main

f35c279


 fix prompt type

149fd53


 Merge branch 'main' of https://github.com/leffff/diffusers

e3a3e9d


 add gradient checkpointing and peft support

7af80e9


 add usage example

04efb19


 Merge branch 'main' into main

4aa22f3


 Update src/diffusers/pipelines/kandinsky5/pipeline_kandinsky.py

235f0d5

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>


 Update src/diffusers/pipelines/kandinsky5/pipeline_kandinsky.py

88a8eea

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>


 Update src/diffusers/pipelines/kandinsky5/pipeline_kandinsky.py

f52f3b4

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>


 Update src/diffusers/pipelines/kandinsky5/pipeline_kandinsky.py

0190e55

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>


 Update src/diffusers/models/transformers/transformer_kandinsky.py

d62dffc

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>


 remove unused imports


 Merge branch 'huggingface:main' into main

d5dcd94


 add 10 second models support

b615d5c


 Merge branch 'main' of https://github.com/leffff/diffusers

6a0233e

@leffff @yiyixuxu


 Update src/diffusers/pipelines/kandinsky5/pipeline_kandinsky.py

588c12a

Co-authored-by: YiYi Xu <yixu310@gmail.com>


 remove no_grad and simplified prompt paddings

327ab84

@leffff @yiyixuxu


 Update src/diffusers/pipelines/kandinsky5/pipeline_kandinsky.py

9b06afb

Co-authored-by: YiYi Xu <yixu310@gmail.com>


 Merge branch 'huggingface:main' into main

91133e0

@sayakpaul

Copy link

Member

sayakpaul commented Oct 22, 2025

@leffff let's add the tests and docs as well.

@yiyixuxu

Copy link

Collaborator

yiyixuxu commented Oct 22, 2025

ok, let's just use this PR to add docs and tests?

Copy link

Contributor Author

leffff commented Oct 22, 2025

Okay


 add docs

25f2e9c

Copy link

Contributor Author

leffff commented Oct 23, 2025

Please checkout the docs


 Merge branch 'huggingface:main' into main

e45c036

yiyixuxu

yiyixuxu reviewed

Oct 23, 2025

Copy link

Collaborator

@yiyixuxu yiyixuxu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks!

docs/source/en/api/pipelines/kandinsky_v5.md Outdated Show resolved Hide resolved

@yiyixuxu

Copy link

Collaborator

yiyixuxu commented Oct 23, 2025

@bot /style

@github-actions

Copy link

Contributor

github-actions bot commented Oct 23, 2025 •

edited

Loading

Style bot fixed some files and pushed the changes.

github-actions bot and others added 3 commits

October 23, 2025 17:44

@github-actions


 Apply style fixes

3bbc232


 Merge branch 'huggingface:main' into main

e181f13


 update docs

dd6bf39

Copy link

Contributor Author

leffff commented Oct 24, 2025

@yiyixuxu plz check the new docs version!

@yiyixuxu


 Merge branch 'main' into main

add757b

yiyixuxu

yiyixuxu approved these changes

Oct 24, 2025

Copy link

Collaborator

@yiyixuxu yiyixuxu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks really good! thanks!

@sayakpaul

Copy link

Member

sayakpaul commented Oct 24, 2025

@leffff could you also add kandinsky_v5 to _toctree.yml?

Copy link

Contributor Author

leffff commented Oct 24, 2025

Okay!

leffff added 2 commits

October 24, 2025 21:43


 add kandinsky5 to toctree

5fb528b


 Merge branch 'main' of https://github.com/leffff/diffusers

c9c1190

Copy link

Contributor Author

leffff commented Oct 24, 2025

@sayakpaul @yiyixuxu done!

Copy link

Contributor Author

leffff commented Oct 25, 2025

Please review and merge!

sayakpaul

sayakpaul approved these changes

Oct 25, 2025

Copy link

Member

@sayakpaul sayakpaul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! We should also add tests. Could you please do that too?

@stevhliu please also review the docs.

src/diffusers/models/transformers/transformer_kandinsky.py Show resolved Hide resolved

Copy link

Contributor Author

leffff commented Oct 25, 2025

Okay!

leffff and others added 2 commits

October 27, 2025 15:53


 Merge branch 'huggingface:main' into main

47dd246


 add tests

d2a206e

Copy link

Contributor Author

leffff commented Oct 27, 2025

Please check tests

sayakpaul

sayakpaul reviewed

Oct 27, 2025

tests/pipelines/kandinsky5/test_kandinsky5.py

self.assertEqual(output_with_embeds.shape, output_with_prompt.shape)

@slow

Copy link

Member

@sayakpaul sayakpaul Oct 27, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can leave this one out for now.

sayakpaul

sayakpaul reviewed

Oct 27, 2025

diffusers/tests/pipelines/test_pipelines_common.py

tests/pipelines/kandinsky5/test_kandinsky5.py

max_diff = np.abs(output.detach().cpu().numpy() - output_loaded.detach().cpu().numpy()).max()

self.assertLess(max_diff, 1e-4)

def test_prompt_embeds(self):

Copy link

Member

@sayakpaul sayakpaul Oct 27, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should be able to test it with:

Line 2072 in dc6bd15

def test_encode_prompt_works_in_isolation(self, extra_required_param_value_dict=None, atol=1e-4, rtol=1e-4):

sayakpaul

sayakpaul approved these changes

Oct 27, 2025