Caching Acceleration#
These variables configure caching acceleration for Diffusion Transformer (DiT) models. SGLang supports multiple caching strategies - see caching documentation for an overview.
Cache-DiT Configuration#
See cache-dit documentation for detailed configuration.
Environment Variable |
Default |
Description |
|---|---|---|
|
false |
Enable Cache-DiT acceleration |
|
1 |
First N blocks to always compute |
|
0 |
Last N blocks to always compute |
|
4 |
Warmup steps before caching |
|
0.24 |
Residual difference threshold |
|
3 |
Max continuous cached steps |
|
false |
Enable TaylorSeer calibrator |
|
1 |
TaylorSeer order (1 or 2) |
|
none |
SCM preset (none/slow/medium/fast/ultra) |
|
dynamic |
SCM caching policy |
|
not set |
Custom SCM compute bins |
|
not set |
Custom SCM cache bins |
Cloud Storage#
These variables configure S3-compatible cloud storage for automatically uploading generated images and videos.
Environment Variable |
Default |
Description |
|---|---|---|
|
not set |
Set to |
|
not set |
The name of the S3 bucket |
|
not set |
Custom endpoint URL (for MinIO, OSS, etc.) |
|
us-east-1 |
AWS region name |
|
not set |
AWS Access Key ID |
|
not set |
AWS Secret Access Key |