comfyui2-2025/aimodelsWan14BT2VFusioniX_Phantom

Public

WeChat Login

Code Issues Pull requests Events Packages Insights

main

Branch

Tag

Forkfromtiaotiaomls/aimodelsWan14BT2VFusioniX_Phantom

DeeJayT<vrgamedevgirl84@users.noreply.huggingface.co>

Update README.md

0562d1f8

49 commits

FusionX_LoRa
images
videos
.gitattributes
LICENSE
README.md
Wan14BT2VFusioniX_Phantom_fp16.safetensors
Wan14BT2VFusioniX_fp16_.safetensors
Wan14Bi2vFusioniX.safetensors
Wan14Bi2vFusioniX_fp16.safetensors
WanT2V_MasterModel.safetensors
placeholder

🌀 Wan2.1_14B_FusionX

High-Performance Merged Text-to-Video Model
Built on WAN 2.1 and fused with research-grade components for cinematic motion, detail, and speed — optimized for ComfyUI and rapid iteration in as few as 6 steps.

Merged models for faster, richer motion & detail — high performance even at just 8 steps.

📌 Important: To match the quality shown here, use the linked workflows or make sure to follow the recommended settings outlined below.

🚨✨Hey guys! Just a quick update!

We finally cooked up FusionX LoRAs!! 🧠💥
This is huge – now you can plug FusionX into your favorite workflows as a LoRA on top of the Wan base models and SkyReels models!🔌💫 You can still stick with the base FusionX Model if you already use it, but if you would rather have more control over the "FusionX" strength and a speed boost, then this might be for you.

Oh, and there’s a nice speed boost too! ⚡
Example: (RTX 5090)

FusionX as a full base model: 8 steps = 160s ⏱️
FusionX as a LoRA on Wan 2.1 14B fp8 T2V: 8 steps = 120s 🚀

Bonus: You can bump up the FusionX LoRA strength and lower your steps for a huge speed boost while testing/drafting.
Example: strength 2.00 with 3 steps takes 72 seconds.
Or lower the strength to experiment with a less “FusionX” look. ⚡🔍

We’ve got:

T2V (Text to Video) 🎬 – works perfectly with VACE ⚙️
I2V (Image to Video) 🖼️➡️📽️
A dedicated Phantom LoRA 👻
The new LoRA's are HERE Note: The LoRa's are not meant to be put on top of the FusionX main models and instead you would use them with the Wan base models. New workflows are HERE 🛠️🚀

After lots of testing 🧪, the video quality with the LoRA is just as good (and sometimes even better! 💯)
That’s thanks to it being trained on the fp16 version of FusionX 🧬💎

🌀 Preview Gallery

These are compressed GIF previews for quick viewing — final video outputs are higher quality.

FusionX_00020
FusionX_00021
FusionX_00022
FusionX_00023
FusionX_00024
FusionX_00025
FusionX_00026
FusionX_00027
FusionX_00028
FusionX_00029
FusionX_00030
FusionX_00031

📂 Workflows & Model Downloads

💡 ComfyUI workflows can be found here:
👉 Workflow Collection (WIP)
📦 Model files (T2V, I2V, Phantom, VACE):
👉 Main Hugging Face Repo

🧠 GGUF Variants:

🎬 Example Videos

Want to see what FusionX can do? Check out these real outputs generated using the latest workflows and settings:

Text-to-Video
👉 Watch Examples
Image-to-Video
👉 Watch Examples
Phantom Mode
👉 Watch Examples
VACE Integration
👉 Watch Examples

🚀 Overview

A powerful text-to-video model built on top of WAN 2.1 14B, merged with several research-grade models to boost:

Motion quality
Scene consistency
Visual detail

Comparable with closed-source solutions, but open and optimized for ComfyUI workflows.

💡 Inside the Fusion

This model includes the following merged components:

CausVid – Causal motion modeling for better flow and dynamics
AccVideo – Better temporal alignment and speed boost
MoviiGen1.1 – Cinematic smoothness and lighting
MPS Reward LoRA – Tuned for motion and detail
Custom LoRAs – For texture, clarity, and small detail enhancements

All merged models use permissive open licenses (Apache 2.0 / MIT).

🔧 Usage Details

Text-to-Video

CGF: Must be set to 1
Shift:
- 1024x576: Start at 1
- 1080x720: Start at 2
- For realism → lower values
- For stylized → test 3–9
Scheduler:
- Recommended: uni_pc
- Alternative: flowmatch_causvid (better for some details)

Image-to-Video

CGF: 1
Shift: 2 works best in most cases
Scheduler:
- Recommended: dmp++_sde/beta
To boost motion and reduce slow-mo effect:
- Frame count: 121
- FPS: 24

🛠 Technical Notes

Works in as few as 6 steps
Best quality at 8–10 steps
Drop-in replacement for Wan2.1-T2V-14B
Up to 50% faster rendering, especially with SageAttn
Works natively and with Kaji Wan Wrapper
Wrapper GitHub
Do not re-add merged LoRAs (CausVid, AccVideo, MPS)
Feel free to add other LoRAs for style/variation
Native WAN workflows also supported (slightly slower)

🧪 Performance Tips

RTX 5090 → ~138 sec/video at 1024x576 / 81 frames
If VRAM is limited:
- Enable block swapping
- Start with 5 blocks and adjust as needed
Use SageAttn for ~30% speedup (wrapper only)
Do not use teacache
"Enhance a video" (tested): Adds vibrance (try values 2–4)
"SLG" not tested — feel free to explore

🧠 Prompt Help

Want better cinematic prompts? Try the WAN Cinematic Video Prompt Generator GPT — it adds visual richness and makes a big difference in quality. Download Here

📣 Join The Community

We’re building a friendly space to chat, share outputs, and get help.

Motion LoRAs coming soon
Tips, updates, and support from other users

👉 Join the Discord

⚖️ License

Merged under permissive licenses:

Apache 2.0 / MIT
You can use, modify, and redistribute
You must retain original license info
Outputs are not necessarily licensed — do your due diligence

This model is for research, education, and personal use only. Commercial use is your own responsibility. Please consult a legal advisor before monetizing outputs.