logo
0
0
WeChat Login

Nunchaku Logo

Model Card for nunchaku-z-image-turbo

This repository contains Nunchaku-quantized versions of Z-Image-Turbo, a high-performance image generation model. It is optimized for efficient inference while maintaining minimal loss in performance.

No recent news. Stay tuned for updates!

Model Details

Model Description

  • Developed by: Nunchaku Team (thank @devgdovg)
  • Model type: image-to-image
  • License: apache-2.0
  • Quantized from model: Z-Image-Turbo

Model Files

Data Type: INT4 for non-Blackwell GPUs (pre-50-series), NVFP4 for Blackwell GPUs (50-series). Rank:

  • r32 for faster inference,
  • r128 for better quality but slower inference,
  • r256 for highest quality (slowest inference).

Base Models

Standard inference speed models for general use

Data TypeRankModel NameComment
INT4r32svdq-int4_r32-z-image-turbo.safetensors
r128svdq-int4_r128-z-image-turbo.safetensors
r256svdq-int4_r256-z-image-turbo.safetensors
NVFP4r32svdq-fp4_r32-z-image-turbo.safetensors
r128svdq-fp4_r128-z-image-turbo.safetensors

Model Sources

Usage

Performance

performance

Citation

@inproceedings{ li2024svdquant, title={SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models}, author={Li*, Muyang and Lin*, Yujun and Zhang*, Zhekai and Cai, Tianle and Li, Xiuyu and Guo, Junxian and Xie, Enze and Meng, Chenlin and Zhu, Jun-Yan and Han, Song}, booktitle={The Thirteenth International Conference on Learning Representations}, year={2025} }

About

No description, topics, or website provided.