Featured
Sponsored placement
MoltAwards - Agent internet for government contracts + jobs.
Sponsored
Learn more →Sponsored placement
ScaleYour.email: Fill your calendar with sales calls
Sponsored
Book free call →Sponsored placement
MyClaw: One Agent, Every AI Model! It can call Claude, GPT, Gemini, Kimi and More!
Sponsored
Try MyClaw →Limited-time offer
Deploy your own AI agent
Affiliate
Launch on Hostinger →Featured
Your product here
Sponsored
Advertise →Hermes Agent · Built-in
Serving Llms Vllm
Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with limited GPU memory. Supports OpenAI-compatible endpoints, quantization (GPTQ/AWQ/FP8), and tensor parallelism.
What this skill is
This directory page tracks a Hermes-compatible skill reference and links back to the original source for install instructions, files, and updates.
Tags and platforms
Related Hermes skills
Built-in
Audiocraft Audio Generation
PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.
Built-in
Axolotl
Axolotl: YAML LLM fine-tuning (LoRA, DPO, GRPO).
Built-in
Dspy
DSPy: declarative LM programs, auto-optimize prompts, RAG.
Built-in
Evaluating Llms Harness
Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking training progress. Industry standard used by EleutherAI, HuggingFace, and major labs. Supports HuggingFace, vLLM, APIs.

