vLLMvLLM/Recipes
DocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

NVIDIA

nvidia·2 recipesHuggingFace

Multimodal

1
NVIDIA-Nemotron-Nano-12B-v2-VL-BF16
12B
dense
BF1629GFP814GNVFP48G
v0.11.1+→

Text

1
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
30B / 3B
moe
BF1672GFP835G
v0.11.2+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API