Caching Acceleration#

These variables configure caching acceleration for Diffusion Transformer (DiT) models. SGLang supports multiple caching strategies - see caching documentation for an overview.

Cache-DiT Configuration#

See cache-dit documentation for detailed configuration.

Environment Variable

Default

Description

SGLANG_CACHE_DIT_ENABLED

false

Enable Cache-DiT acceleration

SGLANG_CACHE_DIT_FN

1

First N blocks to always compute

SGLANG_CACHE_DIT_BN

0

Last N blocks to always compute

SGLANG_CACHE_DIT_WARMUP

4

Warmup steps before caching

SGLANG_CACHE_DIT_RDT

0.24

Residual difference threshold

SGLANG_CACHE_DIT_MC

3

Max continuous cached steps

SGLANG_CACHE_DIT_TAYLORSEER

false

Enable TaylorSeer calibrator

SGLANG_CACHE_DIT_TS_ORDER

1

TaylorSeer order (1 or 2)

SGLANG_CACHE_DIT_SCM_PRESET

none

SCM preset (none/slow/medium/fast/ultra)

SGLANG_CACHE_DIT_SCM_POLICY

dynamic

SCM caching policy

SGLANG_CACHE_DIT_SCM_COMPUTE_BINS

not set

Custom SCM compute bins

SGLANG_CACHE_DIT_SCM_CACHE_BINS

not set

Custom SCM cache bins

Cloud Storage#

These variables configure S3-compatible cloud storage for automatically uploading generated images and videos.

Environment Variable

Default

Description

SGLANG_CLOUD_STORAGE_TYPE

not set

Set to s3 to enable cloud storage

SGLANG_S3_BUCKET_NAME

not set

The name of the S3 bucket

SGLANG_S3_ENDPOINT_URL

not set

Custom endpoint URL (for MinIO, OSS, etc.)

SGLANG_S3_REGION_NAME

us-east-1

AWS region name

SGLANG_S3_ACCESS_KEY_ID

not set

AWS Access Key ID

SGLANG_S3_SECRET_ACCESS_KEY

not set

AWS Secret Access Key