Compatibility Matrix#
The table below shows every supported model and the optimizations supported for them.
The symbols used have the following meanings:
✅ = Full compatibility
❌ = No compatibility
⭕ = Does not apply to this model
Models x Optimization#
The HuggingFace Model ID can be passed directly to from_pretrained() methods, and sglang-diffusion will use the
optimal
default parameters when initializing and generating videos.
Video Generation Models#
Model Name |
Hugging Face Model ID |
Resolutions |
TeaCache |
Sliding Tile Attn |
Sage Attn |
Video Sparse Attention (VSA) |
Sparse Linear Attention (SLA) |
Sage Sparse Linear Attention (SageSLA) |
|---|---|---|---|---|---|---|---|---|
FastWan2.1 T2V 1.3B |
|
480p |
⭕ |
⭕ |
⭕ |
✅ |
❌ |
❌ |
FastWan2.2 TI2V 5B Full Attn |
|
720p |
⭕ |
⭕ |
⭕ |
✅ |
❌ |
❌ |
Wan2.2 TI2V 5B |
|
720p |
⭕ |
⭕ |
✅ |
⭕ |
❌ |
❌ |
Wan2.2 T2V A14B |
|
480p |
❌ |
❌ |
✅ |
⭕ |
❌ |
❌ |
Wan2.2 I2V A14B |
|
480p |
❌ |
❌ |
✅ |
⭕ |
❌ |
❌ |
HunyuanVideo |
|
720×1280 |
❌ |
✅ |
✅ |
⭕ |
❌ |
❌ |
FastHunyuan |
|
720×1280 |
❌ |
✅ |
✅ |
⭕ |
❌ |
❌ |
Wan2.1 T2V 1.3B |
|
480p |
✅ |
✅ |
✅ |
⭕ |
❌ |
❌ |
Wan2.1 T2V 14B |
|
480p, 720p |
✅ |
✅ |
✅ |
⭕ |
❌ |
❌ |
Wan2.1 I2V 480P |
|
480p |
✅ |
✅ |
✅ |
⭕ |
❌ |
❌ |
Wan2.1 I2V 720P |
|
720p |
✅ |
✅ |
✅ |
⭕ |
❌ |
❌ |
TurboWan2.1 T2V 1.3B |
|
480p |
✅ |
❌ |
❌ |
❌ |
✅ |
✅ |
TurboWan2.1 T2V 14B |
|
480p |
✅ |
❌ |
❌ |
❌ |
✅ |
✅ |
TurboWan2.1 T2V 14B 720P |
|
720p |
✅ |
❌ |
❌ |
❌ |
✅ |
✅ |
TurboWan2.2 I2V A14B |
|
720p |
✅ |
❌ |
❌ |
❌ |
✅ |
✅ |
Note:
1.Wan2.2 TI2V 5B has some quality issues when performing I2V generation. We are working on fixing this issue.
2.SageSLA Based on SpargeAttn. Install it first with pip install git+https://github.com/thu-ml/SpargeAttn.git --no-build-isolation
Image Generation Models#
Model Name |
HuggingFace Model ID |
Resolutions |
|---|---|---|
FLUX.1-dev |
|
Any resolution |
FLUX.2-dev |
|
Any resolution |
FLUX.2-Klein |
|
Any resolution |
Z-Image-Turbo |
|
Any resolution |
GLM-Image |
|
Any resolution |
Qwen Image |
|
Any resolution |
Qwen Image 2512 |
|
Any resolution |
Qwen Image Edit |
|
Any resolution |
Verified LoRA Examples#
This section lists example LoRAs that have been explicitly tested and verified with each base model in the SGLang Diffusion pipeline.
Important: LoRAs that are not listed here are not necessarily incompatible. In practice, most standard LoRAs are expected to work, especially those following common Diffusers or SD-style conventions. The entries below simply reflect configurations that have been manually validated by the SGLang team.
Verified LoRAs by Base Model#
Base Model |
Supported LoRAs |
|---|---|
Wan2.2 |
|
Wan2.1 |
|
Z-Image-Turbo |
|
Qwen-Image |
|
Qwen-Image-Edit |
|
Flux |
|
Special requirements#
Sliding Tile Attention#
Currently, only Hopper GPUs (H100s) are supported.