Hermes Agent · Optional

nemo-curator

GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cleaning web data, or deduplicating large corpora.

MlopsOptionalv1.0.0MIT

What this skill is

This directory page tracks a Hermes-compatible skill reference and links back to the original source for install instructions, files, and updates.

Tags and platforms

Data ProcessingNeMo CuratorData CurationGPU AccelerationDeduplicationQuality FilteringNVIDIARAPIDSPII RedactionMultimodalLLM Training Data

Featured

Your product here

Show your offer to OpenClaw operators and AI builders across every page and blog.

Advertise