PLEASE QUANTIZE MODELS (FP8 ) :) #7

lijackcoder · 2024-12-23T10:33:17Z

It would be be great if you could create fp8 versions of the models :) thanks for lower vram or faster generation

zhuang2002 · 2025-01-05T05:31:04Z

Thank you so much for your attention and support. To help reduce memory overhead, since ColorFlow does not rely on text conditions, I would kindly suggest avoiding the CFG strategy and instead using empty text for inference. Additionally, you could pre-save the T5 model’s output for empty text inputs, which would allow you to skip loading the T5 model entirely during inference. I’ve had quite a lot on my plate recently, but I’ll make sure to update the code with these optimizations as soon as I can. Thank you for your understanding and patience!

nitinmukesh · 2025-01-05T08:05:38Z

Looking forward to memory optimizations when you get time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PLEASE QUANTIZE MODELS (FP8 ) :) #7

PLEASE QUANTIZE MODELS (FP8 ) :) #7

lijackcoder commented Dec 23, 2024

zhuang2002 commented Jan 5, 2025

nitinmukesh commented Jan 5, 2025

PLEASE QUANTIZE MODELS (FP8 ) :) #7

PLEASE QUANTIZE MODELS (FP8 ) :) #7

Comments

lijackcoder commented Dec 23, 2024

zhuang2002 commented Jan 5, 2025

nitinmukesh commented Jan 5, 2025