High-Performance Merged Text-to-Video Model
Built on WAN 2.1 and fused with research-grade components for cinematic motion, detail, and speed — optimized for ComfyUI and rapid iteration in as few as 6 steps.
Merged models for faster, richer motion & detail — high performance even at just 8 steps.
📌 Important: To match the quality shown here, use the linked workflows or make sure to follow the recommended settings outlined below.
We finally cooked up FusionX LoRAs!! 🧠💥
This is huge – now you can plug FusionX into your favorite workflows as a LoRA on top of the Wan base models and SkyReels models!🔌💫
You can still stick with the base FusionX Model if you already use it, but if you would rather have more control over the "FusionX" strength and a speed boost, then this might be for you.
Oh, and there’s a nice speed boost too! ⚡
Example: (RTX 5090)
Bonus: You can bump up the FusionX LoRA strength and lower your steps for a huge speed boost while testing/drafting.
Example: strength 2.00 with 3 steps takes 72 seconds.
Or lower the strength to experiment with a less “FusionX” look. ⚡🔍
We’ve got:
After lots of testing 🧪, the video quality with the LoRA is just as good (and sometimes even better! 💯)
That’s thanks to it being trained on the fp16 version of FusionX 🧬💎
These are compressed GIF previews for quick viewing — final video outputs are higher quality.












💡 ComfyUI workflows can be found here:
👉 Workflow Collection (WIP)
📦 Model files (T2V, I2V, Phantom, VACE):
👉 Main Hugging Face Repo
Want to see what FusionX can do? Check out these real outputs generated using the latest workflows and settings:
Text-to-Video
👉 Watch Examples
Image-to-Video
👉 Watch Examples
Phantom Mode
👉 Watch Examples
VACE Integration
👉 Watch Examples
A powerful text-to-video model built on top of WAN 2.1 14B, merged with several research-grade models to boost:
Comparable with closed-source solutions, but open and optimized for ComfyUI workflows.
This model includes the following merged components:
All merged models use permissive open licenses (Apache 2.0 / MIT).
11024x576: Start at 11080x720: Start at 23–9uni_pcflowmatch_causvid (better for some details)12 works best in most casesdmp++_sde/beta12124Wan2.1-T2V-14B5 blocks and adjust as neededteacacheWant better cinematic prompts? Try the WAN Cinematic Video Prompt Generator GPT — it adds visual richness and makes a big difference in quality. Download Here
We’re building a friendly space to chat, share outputs, and get help.
Merged under permissive licenses:
This model is for research, education, and personal use only. Commercial use is your own responsibility. Please consult a legal advisor before monetizing outputs.
And thanks to the open-source community!