logo
0
0
WeChat Login

ACE-Step 1.5 XL — SFT (4B DiT)

Project | Hugging Face | ModelScope | Space Demo | Discord | Tech Report

Model Details

This is the XL (4B) SFT variant of ACE-Step 1.5 — a supervised fine-tuned model with ~4B parameters. SFT provides higher audio quality with CFG (Classifier-Free Guidance) support for fine-grained prompt adherence control.

XL Architecture

ParameterValue
DiT Decoder hidden_size2560
DiT Decoder layers32
DiT Decoder attention heads32
Encoder hidden_size2048
Encoder layers8
Total params~4B
Weights size (bf16)~18.8 GB
Inference steps50 (with CFG)

GPU Requirements

VRAMSupport
≥12 GBWith CPU offload + INT8 quantization
≥16 GBWith CPU offload
≥20 GBWithout offload
≥24 GBFull quality (XL + 4B LM)

All LM models (0.6B / 1.7B / 4B) are fully compatible with XL.

Key Features

  • 💰 Commercial-Ready: Trained on legally compliant datasets. Generated music can be used for commercial purposes.
  • 📚 Safe Training Data: Licensed music, royalty-free/public domain, and synthetic (MIDI-to-Audio) data.
  • 🎯 CFG Support: Fine-tune prompt adherence with guidance scale control.
  • 🔮 Highest Quality: SFT + 4B parameters = the highest quality variant.

Quick Start

# Install ACE-Step git clone https://github.com/ace-step/ACE-Step-1.5.git cd ACE-Step-1.5 pip install -e . # Download this model huggingface-cli download ACE-Step/acestep-v15-xl-sft --local-dir ./checkpoints/acestep-v15-xl-sft # Run with Gradio UI python acestep --config-path acestep-v15-xl-sft

Model Zoo

XL (4B) DiT Models

DiT ModelCFGStepsQualityDiversityTasksHugging FaceModelScope
acestep-v15-xl-base50HighHighAll (extract, lego, complete)LinkLink
acestep-v15-xl-sft50Very HighMediumStandardThis repoLink
acestep-v15-xl-turbo8Very HighMediumStandardLinkLink

LM Models (all compatible with XL)

LM ModelParamsAudio UnderstandingCompositionHugging FaceModelScope
acestep-5Hz-lm-0.6B0.6BMediumMediumLinkLink
acestep-5Hz-lm-1.7B1.7BMediumMediumIncluded in mainIncluded in main
acestep-5Hz-lm-4B4BStrongStrongLinkLink

Acknowledgements

This project is co-led by ACE Studio and StepFun.

Citation

@misc{gong2026acestep, title={ACE-Step 1.5: Pushing the Boundaries of Open-Source Music Generation}, author={Junmin Gong, Yulin Song, Wenxiao Zhao, Sen Wang, Shengyuan Xu, Jing Guo}, howpublished={\url{https://github.com/ace-step/ACE-Step-1.5}}, year={2026}, note={GitHub repository} }

About

No description, topics, or website provided.
Language
Python100%