vLLMvLLM/Recipes
DocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Google

Google·5 recipesHuggingFace

Multimodal

4
gemma-4-26B-A4B-it
26B / 4B
moe
BF1664G
v0.19.1+→
gemma-4-31B-it
31B
dense
BF16210GNVFP419G
v0.19.1+→
gemma-4-E2B-it
5B
dense
BF1613G
v0.19.1+→
gemma-4-E4B-it
8B
dense
BF1620G
v0.19.1+→

Text

1
translategemma-27b-it
27B
dense
BF1665GBF1665GBF1610G
v0.14.1+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API