BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

Even_Adder@lemmy.dbzer0.com · 5 months ago

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

db0@lemmy.dbzer0.com · edit-2 5 months ago

8x smaller! That’s pretty bonkers and awesome! I hope this will become the standard as it will allow the AI Horde workers to server 10x more models each and even faster (as it will cut down the loading times). I hope it doesn’t break Loras though.

Even_Adder@lemmy.dbzer0.com · 5 months ago

Unfortunately, this won’t work with SD3 since it doesn’t use a U-Net. I wonder how many people are still training 1.5 models?

db0@lemmy.dbzer0.com · 5 months ago

What about sdxl?

Even_Adder@lemmy.dbzer0.com · 5 months ago

It does, so hopefully this will work on it too.

Lemmy Tagginator@utter.online · 5 months ago

deleted by creator

PipedLinkBot@feddit.rocks · 5 months ago

Here is an alternative Piped link(s):

https://piped.video/cBMrc1cY4bs

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

- YouTube

Abstract