> ## Documentation Index
> Fetch the complete documentation index at: https://docs.sglang.io/llms.txt
> Use this file to discover all available pages before exploring further.

# Supported models

> See which families of SGLang-compatible models are actively maintained.

SGLang supports model families across text generation, retrieval, and reward workflows. Browse the sections below for the primary product paths and jump to the detail pages when you are ready to explore a specific class.

### Text generation

<CardGroup cols={3}>
  <Card title="Large language models" mode="card" className="max-w-sm mx-auto" href="./supported-models/generative_models" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/LLM-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=80bc5d25ca2efb94e7f03f7eeb273966" width="940" height="525" data-path="cards/LLM-card.png">
    Production-tuned Llama and Qwen families validated for high-throughput
    serving.
  </Card>

  <Card title="Vision language models" mode="card" className="max-w-sm mx-auto" href="./supported-models/multimodal_language_models" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/VLM-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=5e67e4ee203c70433f89d235342c2913" width="940" height="525" data-path="cards/VLM-card.png">
    Vision-text hybrids that stay responsive on multi-GPU setups.
  </Card>

  <Card title="Diffusion language models" mode="card" className="max-w-sm mx-auto" href="./sglang-diffusion/index" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/dLLM-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=c5c712f679d8b30140e4b2bb7441d678" width="940" height="525" data-path="cards/dLLM-card.png">
    Score-based and diffusion backbones for structured text generation
    workflows.
  </Card>
</CardGroup>

### Retrieval and ranking

<CardGroup cols={3}>
  <Card title="Embedding models" mode="card" className="max-w-sm mx-auto" href="./supported-models/embedding_models" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/Embedding-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=40c6a4cc650ab767eec7c55bbf38dfae" width="940" height="525" data-path="cards/Embedding-card.png">
    Dense and sparse embeddings optimized with FlashInfer kernels.
  </Card>

  <Card title="Rerank models" mode="card" className="max-w-sm mx-auto" href="./supported-models/rerank_models" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/Rerank-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=7ac79d52ce265d28d6f857c6e3eb915f" width="940" height="525" data-path="cards/Rerank-card.png">
    Low-latency rerankers for multi-stage retrieval pipelines.
  </Card>

  <Card title="Classification models" mode="card" className="max-w-sm mx-auto" href="./supported-models/classify_models" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/Classification-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=5a18c67e4c53bcf34c372fe89a44d5e4" width="940" height="525" data-path="cards/Classification-card.png">
    Lightweight classifiers covering safety, intent, and context filters.
  </Card>
</CardGroup>

### Specialized models

<CardGroup cols={3}>
  <Card title="Reward models" mode="card" className="max-w-sm mx-auto" href="./supported-models/reward_models" img="https://mintcdn.com/lmsysorg/9EeweA8O0P-HvuCu/cards/Reward-card.png?fit=max&auto=format&n=9EeweA8O0P-HvuCu&q=85&s=56bdba827b92d64f0a7fa24ab635d1f9" width="940" height="525" data-path="cards/Reward-card.png">
    RLHF and reward scoring pipelines optimized for production latency.
  </Card>
</CardGroup>
