Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commit 15619e7

Browse files
sayakpaulstevhliu
andauthored
Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
1 parent 6a8b3c9 commit 15619e7

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

‎docs/source/en/quantization/overview.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,8 @@ Initialize [`~quantizers.PipelineQuantizationConfig`] with the following paramet
3636
3737
- `components_to_quantize` specifies which component(s) of the pipeline to quantize. Typically, you should quantize the most compute intensive components like the transformer. The text encoder is another component to consider quantizing if a pipeline has more than one such as [`FluxPipeline`]. The example below quantizes the T5 text encoder in [`FluxPipeline`] while keeping the CLIP model intact.
3838

39+
`components_to_quantize` accepts either a list for multiple models or a string for a single model.
40+
3941
The example below loads the bitsandbytes backend with the following arguments from [`~quantizers.quantization_config.BitsAndBytesConfig`], `load_in_4bit`, `bnb_4bit_quant_type`, and `bnb_4bit_compute_dtype`.
4042

4143
```py
@@ -62,7 +64,6 @@ pipe = DiffusionPipeline.from_pretrained(
6264
image = pipe("photo of a cute dog").images[0]
6365
```
6466

65-
`components_to_quantize` doesn't have to be a list. You can also pass: `components_to_quantize="transformer"`.
6667

6768
### Advanced quantization
6869

0 commit comments

Comments
(0)

AltStyle によって変換されたページ (->オリジナル) /