vLLMvLLM/Recipes
DocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Qwen

Qwen·10 recipesHuggingFace

Multimodal

5
Qwen3.6-35B-A3B
35B / 3B
moe
BF1684GFP842G
v0.17.0+→
Qwen3.5-397B-A17B
397B / 17B
moe
BF16953GNVFP4238G
v0.17.0+→
Qwen3-ASR-1.7B
2.3B
dense
BF164G
v0.12.0+→
Qwen3-VL-235B-A22B-Instruct
235B / 22B
moe
BF16564GFP8282GNVFP4141G
v0.11.0+→
Qwen2.5-VL-72B-Instruct
72B
dense
BF16173GINT443G
v0.7.0+→

Text

4
Qwen3Guard-Gen-8B
8B
dense
BF1619GBF1610GBF164G
v0.10.0+→
Qwen3-Next-80B-A3B-Instruct
80B / 3B
moe
BF16192GFP896GNVFP448G
v0.10.0+→
Qwen3-Coder-480B-A35B-Instruct
480B / 35B
moe
BF161152GFP8576GNVFP4288G
v0.10.0+→
Qwen3-235B-A22B-Instruct-2507
235B / 22B
moe
BF16564GFP8240GNVFP4141G
v0.10.0+→

Omni

1
Qwen-Image
20B
dense
BF1648GFP824GINT824G
v0.18.0+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API