Vibevoice

Run it on Hostinger →up to 70% off + an extra 10% with code ZACAARON10 Free API →

Installation

clawhub install javier887/vibevoice

Summary

Local text-to-speech using Microsoft's VibeVoice model. Generates natural Spanish voice audio, perfect for WhatsApp voice messages.

SKILL.md

VibeVoice TTS

Local text-to-speech using Microsoft's VibeVoice model. Generates natural Spanish voice audio, perfect for WhatsApp voice messages.

Quick Start

bash

# Basic usage
{baseDir}/scripts/vv.sh "Hola, esto es una prueba" -o /tmp/audio.ogg

# From file
{baseDir}/scripts/vv.sh -f texto.txt -o /tmp/audio.ogg

# Different voice
{baseDir}/scripts/vv.sh "Texto" -v en-Wayne -o /tmp/audio.ogg

# Adjust speed (0.5-2.0)
{baseDir}/scripts/vv.sh "Texto" -s 1.2 -o /tmp/audio.ogg

Configuration

Setting	Default	Description
Voice	`sp-Spk1_man`	Spanish male voice (slight Mexican accent)
Speed	`1.15`	15% faster than normal
Format	`.ogg`	Opus codec for WhatsApp

Available Voices

Spanish:

sp-Spk1_man - Male, slight Mexican accent (default)

English:

en-Wayne - Male
en-Denise - Female
Other voices in ~/VibeVoice/demo/voices/streaming_model/

Output Formats

.ogg - Opus codec (WhatsApp compatible, recommended)
.mp3 - MP3 format
.wav - Uncompressed WAV

For WhatsApp

Always use .ogg format with asVoice=true in the message tool:

bash

# Generate
{baseDir}/scripts/vv.sh "Tu mensaje aquí" -o /tmp/mensaje.ogg

# Send via message tool
message action=send channel=whatsapp to="+34XXXXXXXXX" filePath=/tmp/mensaje.ogg asVoice=true

Requirements

GPU: NVIDIA with ~2GB VRAM
VibeVoice: Installed at ~/VibeVoice
ffmpeg: For audio conversion
Python 3.10+: With torch, torchaudio

Performance

RTF: ~0.24x (generates faster than realtime)
1 minute of audio ≈ 15 seconds to generate

Notes

First run loads model (~10s), subsequent runs are faster
Audio rule: Only send voice if user requests it or speaks via audio
Keep text under 1500 chars for best quality

Score

0–100

45/ 100

Grade

Popularity10/30

461 installs — early traction.

Completeness18/30

Documented: description, one-line install, catalog metadata. Missing: SKILL.md body.

Trust6/25

Limited provenance information — review the source before installing.

Freshness11/15

Updated within the last 6 months.

Scored automatically from popularity, completeness, trust, and freshness — computed only from data in our catalog, never fabricated.

Proud of your score? Add this badge to your README.

Paste a snippet into your GitHub README. The badge updates automatically and links back to this page.

Score badge

Markdown

[![Vibevoice skill](https://www.remoteopenclaw.com/skills/javier887/vibevoice/badges/score.svg)](https://www.remoteopenclaw.com/skills/javier887/vibevoice)

HTML

<a href="https://www.remoteopenclaw.com/skills/javier887/vibevoice"><img src="https://www.remoteopenclaw.com/skills/javier887/vibevoice/badges/score.svg" alt="Vibevoice skill"/></a>