logo
0
0
WeChat Login

🌀 Wan2.1_14B_FusionX

High-Performance Merged Text-to-Video Model
Built on WAN 2.1 and fused with research-grade components for cinematic motion, detail, and speed — optimized for ComfyUI and rapid iteration in as few as 6 steps.

Merged models for faster, richer motion & detail — high performance even at just 8 steps.

📌 Important: To match the quality shown here, use the linked workflows or make sure to follow the recommended settings outlined below.


🚨✨Hey guys! Just a quick update!

We finally cooked up FusionX LoRAs!! 🧠💥
This is huge – now you can plug FusionX into your favorite workflows as a LoRA on top of the Wan base models and SkyReels models!🔌💫 You can still stick with the base FusionX Model if you already use it, but if you would rather have more control over the "FusionX" strength and a speed boost, then this might be for you.

Oh, and there’s a nice speed boost too! ⚡
Example: (RTX 5090)

  • FusionX as a full base model: 8 steps = 160s ⏱️
  • FusionX as a LoRA on Wan 2.1 14B fp8 T2V: 8 steps = 120s 🚀

Bonus: You can bump up the FusionX LoRA strength and lower your steps for a huge speed boost while testing/drafting.
Example: strength 2.00 with 3 steps takes 72 seconds.
Or lower the strength to experiment with a less “FusionX” look. ⚡🔍

We’ve got:

  • T2V (Text to Video) 🎬 – works perfectly with VACE ⚙️
  • I2V (Image to Video) 🖼️➡️📽️
  • A dedicated Phantom LoRA 👻
    The new LoRA's are HERE Note: The LoRa's are not meant to be put on top of the FusionX main models and instead you would use them with the Wan base models. New workflows are HERE 🛠️🚀

After lots of testing 🧪, the video quality with the LoRA is just as good (and sometimes even better! 💯)
That’s thanks to it being trained on the fp16 version of FusionX 🧬💎


🌀 Preview Gallery

These are compressed GIF previews for quick viewing — final video outputs are higher quality.

FusionX_00020
FusionX_00021
FusionX_00022
FusionX_00023
FusionX_00024
FusionX_00025
FusionX_00026
FusionX_00027
FusionX_00028
FusionX_00029
FusionX_00030
FusionX_00031


📂 Workflows & Model Downloads

🧠 GGUF Variants:


🎬 Example Videos

Want to see what FusionX can do? Check out these real outputs generated using the latest workflows and settings:


🚀 Overview

A powerful text-to-video model built on top of WAN 2.1 14B, merged with several research-grade models to boost:

  • Motion quality
  • Scene consistency
  • Visual detail

Comparable with closed-source solutions, but open and optimized for ComfyUI workflows.


💡 Inside the Fusion

This model includes the following merged components:

  • CausVid – Causal motion modeling for better flow and dynamics
  • AccVideo – Better temporal alignment and speed boost
  • MoviiGen1.1 – Cinematic smoothness and lighting
  • MPS Reward LoRA – Tuned for motion and detail
  • Custom LoRAs – For texture, clarity, and small detail enhancements

All merged models use permissive open licenses (Apache 2.0 / MIT).


🔧 Usage Details

Text-to-Video

  • CGF: Must be set to 1
  • Shift:
    • 1024x576: Start at 1
    • 1080x720: Start at 2
    • For realism → lower values
    • For stylized → test 3–9
  • Scheduler:
    • Recommended: uni_pc
    • Alternative: flowmatch_causvid (better for some details)

Image-to-Video

  • CGF: 1
  • Shift: 2 works best in most cases
  • Scheduler:
    • Recommended: dmp++_sde/beta
  • To boost motion and reduce slow-mo effect:
    • Frame count: 121
    • FPS: 24

🛠 Technical Notes

  • Works in as few as 6 steps
  • Best quality at 8–10 steps
  • Drop-in replacement for Wan2.1-T2V-14B
  • Up to 50% faster rendering, especially with SageAttn
  • Works natively and with Kaji Wan Wrapper
    Wrapper GitHub
  • Do not re-add merged LoRAs (CausVid, AccVideo, MPS)
  • Feel free to add other LoRAs for style/variation
  • Native WAN workflows also supported (slightly slower)

🧪 Performance Tips

  • RTX 5090 → ~138 sec/video at 1024x576 / 81 frames
  • If VRAM is limited:
    • Enable block swapping
    • Start with 5 blocks and adjust as needed
  • Use SageAttn for ~30% speedup (wrapper only)
  • Do not use teacache
  • "Enhance a video" (tested): Adds vibrance (try values 2–4)
  • "SLG" not tested — feel free to explore

🧠 Prompt Help

Want better cinematic prompts? Try the WAN Cinematic Video Prompt Generator GPT — it adds visual richness and makes a big difference in quality. Download Here


📣 Join The Community

We’re building a friendly space to chat, share outputs, and get help.

  • Motion LoRAs coming soon
  • Tips, updates, and support from other users

👉 Join the Discord


⚖️ License

Merged under permissive licenses:

  • Apache 2.0 / MIT
  • You can use, modify, and redistribute
  • You must retain original license info
  • Outputs are not necessarily licensed — do your due diligence

This model is for research, education, and personal use only. Commercial use is your own responsibility. Please consult a legal advisor before monetizing outputs.


🙏 Credits

  • WAN Team (base model)
  • aejion (AccVideo)
  • Tianwei Yin (CausVid)
  • ZuluVision (MoviiGen)
  • Alibaba PAI (MPS LoRA)
  • Kijai (ComfyUI Wrapper)

And thanks to the open-source community!


About

No description, topics, or website provided.