serving-llms-vllm
Skill / mlops
serving-llms-vllm
vLLM: high-throughput LLM serving, OpenAI API, quantization.
How to use
This public page intentionally shows the skill name, category, and high-level description only. Full runtime instructions stay in Hermes where they can include operational guardrails.