evaluating-llms-harness
Skill / mlops
evaluating-llms-harness
lm-eval-harness: benchmark LLMs (MMLU, GSM8K, etc.).
How to use
This public page intentionally shows the skill name, category, and high-level description only. Full runtime instructions stay in Hermes where they can include operational guardrails.