Fix image upcasting #7858

tolgacangoz · 2024-05-04T12:52:17Z

Thanks for the opportunity to fix #7854

What does this PR do?

This PR proposes to fix image upcasting before vae.encode() when using fp16 and vae.config.force_upcast==True with xformers or torch>=2.0 installed. Casting with .to(next(iter(self.vae.post_quant_conv.parameters())).dtype) is supposed to be preferred before vae.decode() not vae.encode().

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.
@yiyixuxu @kadirnar

bghira · 2024-05-06T05:45:52Z

the same issue exists in the StableDiffusionPipeline :) would you like to tackle that one too? latents must be cast to vae dtype during decode

tolgacangoz · 2024-05-06T16:45:50Z

I need to understand. Wouldn't this throw an error:

import torch
from diffusers import StableDiffusionPipeline

pipe = StableDiffusionPipeline.from_pretrained("runwayml/stable-diffusion-v1-5",
                           torch_dtype=torch.float16, variant='fp16').to("cuda")
image = pipe("a photo of an astronaut riding a horse on mars").images[0]

Or, is it a case in MPS?

bghira · 2024-05-06T16:47:15Z

it's actually something that occurs on ROCm which masquerades as CUDA

HuggingFaceDocBuilderDev · 2024-05-08T02:29:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu

thanks!

tolgacangoz · 2024-05-08T06:43:10Z

Thanks for merging!

Fix image's upcasting before `vae.encode()` when using `fp16` Co-authored-by: YiYi Xu <yixu310@gmail.com>

kadirnar · 2024-05-08T19:24:29Z

Thanks @standardAI ❤️

Fix image's upcasting before vae.encode() when using fp16

6a6093d

tolgacangoz changed the title ~~Fix image upcasting before vae.encode() when using fp16~~ Fix image upcasting before vae.encode() when using fp16 May 4, 2024

tolgacangoz changed the title ~~Fix image upcasting before vae.encode() when using fp16~~ Fix image upcasting before vae.encode() when using fp16 with xformers or torch==2.0 installed May 4, 2024

tolgacangoz changed the title ~~Fix image upcasting before vae.encode() when using fp16 with xformers or torch==2.0 installed~~ Fix image upcasting before vae.encode() when using fp16 with xformers or torch>=2.0 installed May 4, 2024

tolgacangoz changed the title ~~Fix image upcasting before vae.encode() when using fp16 with xformers or torch>=2.0 installed~~ Fix image upcasting May 4, 2024

yiyixuxu approved these changes May 8, 2024

View reviewed changes

Merge branch 'main' into fix-dtype-leditspp

26aa725

yiyixuxu merged commit d50baf0 into huggingface:main May 8, 2024
15 checks passed

tolgacangoz mentioned this pull request May 8, 2024

Fix latents.dtype before vae.decode() at ROCm devices in StableDiffusionPipelines #7886

Draft

6 tasks

tolgacangoz deleted the fix-dtype-leditspp branch May 8, 2024 06:43

lawrence-cj pushed a commit to lawrence-cj/diffusers that referenced this pull request May 8, 2024

Fix image upcasting (huggingface#7858)

85f28b3

Fix image's upcasting before `vae.encode()` when using `fp16` Co-authored-by: YiYi Xu <yixu310@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix image upcasting #7858

Fix image upcasting #7858

tolgacangoz commented May 4, 2024 •

edited

Loading

bghira commented May 6, 2024

tolgacangoz commented May 6, 2024

bghira commented May 6, 2024

HuggingFaceDocBuilderDev commented May 8, 2024

yiyixuxu left a comment

tolgacangoz commented May 8, 2024

kadirnar commented May 8, 2024

Fix image upcasting #7858

Fix image upcasting #7858

Conversation

tolgacangoz commented May 4, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

bghira commented May 6, 2024

tolgacangoz commented May 6, 2024

bghira commented May 6, 2024

HuggingFaceDocBuilderDev commented May 8, 2024

yiyixuxu left a comment

Choose a reason for hiding this comment

tolgacangoz commented May 8, 2024

kadirnar commented May 8, 2024

tolgacangoz commented May 4, 2024 •

edited

Loading