Vulkan: Select device with env variable, and skip initialize for unused devices. #629

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

stduhpf wants to merge 2 commits into leejet:master

from stduhpf:vk-device

Open

Vulkan: Select device with env variable, and skip initialize for unused devices. #629

stduhpf wants to merge 2 commits into leejet:master from stduhpf:vk-device

Conversation

stduhpf

Copy link

Contributor

@stduhpf stduhpf commented Mar 14, 2025 •

edited

Loading

Introduces the SD_VK_DEVICE env variable that can be used to chose the device to run inference on.

Also backend was initialized on every device for no apparent reason.

I know it was already possible to hide some devices from the Vulkan backend by setting the GGML_VK_VISIBLE_DEVICES env variable, but this isn't ideal, as it affects every program that uses Vulkan GGML.

(Maybe the same should be done for CUDA and SYCL backends)

stduhpf added 2 commits

March 14, 2025 19:53

@stduhpf


 Vulkan: Select device with env variable, and do not initialize unused...

927af46

... devices

@stduhpf


 better logging and exception handling

d7a0a4d

@stduhpf stduhpf mentioned this pull request

Apr 6, 2025

How to choose which Vulkan device to run on? #650

Closed

@wbruna

Copy link

Contributor

wbruna commented Jun 19, 2025

Works for me... kind of 🙂 I don't think SD_VK_DEVICE can be a replacement for GGML_VK_VISIBLE_DEVICES:

SD_VK_DEVICE=0 ./sd (...)
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon RX 7600 XT (RADV NAVI33) (radv) | uma: 0 | fp16: 1 | warp size: 64 | shared memory: 65536 | matrix cores: KHR_coopmat
[INFO ] stable-diffusion.cpp:195 - Vulkan: Using device 0

SD_VK_DEVICE=1 ./sd (...)
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon RX 7600 XT (RADV NAVI33) (radv) | uma: 0 | fp16: 1 | warp size: 64 | shared memory: 65536 | matrix cores: KHR_coopmat
[WARN ] stable-diffusion.cpp:191 - Cannot find targeted vulkan device (1). Falling back to device 0.
[INFO ] stable-diffusion.cpp:195 - Vulkan: Using device 0

GGML_VK_VISIBLE_DEVICES=0 ./sd (...)
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon Vega 11 Graphics (RADV RAVEN) (radv) | uma: 1 | fp16: 1 | warp size: 64 | shared memory: 65536 | matrix cores: none
[INFO ] stable-diffusion.cpp:195 - Vulkan: Using device 0

GGML_VK_VISIBLE_DEVICES=1 ./sd (...)
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon RX 7600 XT (RADV NAVI33) (radv) | uma: 0 | fp16: 1 | warp size: 64 | shared memory: 65536 | matrix cores: KHR_coopmat
[INFO ] stable-diffusion.cpp:195 - Vulkan: Using device 0

But it's still useful when used together:

GGML_VK_VISIBLE_DEVICES=0,1 SD_VK_DEVICE=1 ./sd (...)
ggml_vulkan: Found 2 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon Vega 11 Graphics (RADV RAVEN) (radv) | uma: 1 | fp16: 1 | warp size: 64 | shared memory: 65536 | matrix cores: none
ggml_vulkan: 1 = AMD Radeon RX 7600 XT (RADV NAVI33) (radv) | uma: 0 | fp16: 1 | warp size: 64 | shared memory: 65536 | matrix cores: KHR_coopmat
[INFO ] stable-diffusion.cpp:195 - Vulkan: Using device 1

As I understand it, ggml excludes the iGPU by default, so it doesn't even show up on the device list unless it's explicitly enabled.

(aside: I really wish there was a way to specify the devices by a stable ID. Notice how the '0' device points to different GPUs depending on the context...)

@wbruna wbruna mentioned this pull request

Jun 19, 2025

Allow @stduhpf and @Green-Sky to approve PRs #686

Open

@stduhpf

Copy link

Contributor Author

stduhpf commented Jul 25, 2025

Ok I finally compiled sdcpp with Rocm, and at least it's selecting device 0 by default (and it's running faster than I thought compared to Vulkan), but It makes me think having an option to select backend device when initializing context could be a nice idea for all backends.

I'm thining maybe passing --device ID via cli, and also a --list-devices argument to know which id corresponds to what device?

It could also be worth it to find a way to select device for model parts, like end text encoders and VAE to device 1 and diffusion model to device 0 for example?

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan: Select device with env variable, and skip initialize for unused devices. #629

Are you sure you want to change the base?

Vulkan: Select device with env variable, and skip initialize for unused devices. #629

Uh oh!

Conversation

@stduhpf stduhpf commented Mar 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

wbruna commented Jun 19, 2025

Uh oh!

stduhpf commented Jul 25, 2025

Uh oh!

Uh oh!

Vulkan: Select device with env variable, and skip initialize for unused devices. #629

Are you sure you want to change the base?

Vulkan: Select device with env variable, and skip initialize for unused devices. #629

Uh oh!

Conversation

@stduhpf stduhpf commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wbruna commented Jun 19, 2025

Uh oh!

stduhpf commented Jul 25, 2025

Uh oh!

Uh oh!

@stduhpf stduhpf commented Mar 14, 2025 •

edited

Loading