This is an IC-LoRA trained on top of LTX-2.3-22b, enabling 16 bit High Dynamic Range generations from the LTX model. This allows both Text/Image driven generations as well as video conversion from 8 bit SDR to 16 bit HDR.
It is based on the LTX-2 foundation model.
IC LoRA enables conditioning video generation on reference video frames at inference time, allowing fine-grained video-to-video control on top of a text-to-video, base model. It allows also the usage of an initial image for image-to-video, and generate audio-visual output.
IC LoRA uses a reference control signal, i.e. a video that is positionally aligned to the generated video and contains the reference for context. To allow for added efficiency, the reference video can be smaller, so it consumes less tokens. The reference downscale factor determines the expected downscaling of the reference video compared to the generated resolution. To signify the expected reference size, the checkpoint name will have a 'ref' denominator followed by the scale relative to the output resolution.
ltx-2.3-22b-ic-lora-hdr-x.x.safetensors
See the LTX-2-community-license for full terms.
models/loras.The model was trained using proprietary HDR dataset.
@article{korem2026hdr,
title={HDR Video Generation via Latent Alignment with Logarithmic Encoding},
author={Korem, Naomi Ken and Oumoumad, Mohamed and Cain, Harel and Yosef, Matan Ben and Jelercic, Urska and Bibi, Ofir and Inger, Yaron and Patashnik, Or and Cohen-Or, Daniel},
journal={arXiv preprint arXiv:2604.11788},
year={2026}
}