Quick overview
Runs image, video, talking head, and voice synthesis jobs on a remote GPU server over SSH. Scripts wrap ComfyUI, SadTalker, and Voxtral pipelines and return output file paths when generation completes. Supports multiple models per media type with configurable style, duration, and language parameters.
Offloads GPU-intensive generation to a dedicated server so the local machine stays free and jobs run in the background without blocking other work.
Common tasks
- Generating photorealistic images from text prompts for mockups or content
- Creating short AI-generated videos for social media or product demos
- Animating a static portrait photo into a lip-synced talking head video
- Synthesizing voiceover audio in multiple languages and voice genders
- Producing agent-narrated demo videos with an animated avatar and synced speech
Install paths
Primary command
openclaw install bowen31337/ai-media
ClawHub installer
npx clawhub@latest install bowen31337/ai-media
OpenClaw CLI
openclaw skills install bowen31337/ai-media
Direct OpenClaw install
openclaw install bowen31337/ai-media
Skill metadata
- Category: DevOps & Cloud
- Language: Markdown
- Version: 1.0.1
- Security status: Suspicious
Review upstream source
The full public SKILL.md body is not directly fetchable for this entry right now, so this page is using the best available catalog metadata. Review the upstream source page for the latest files, version history, and security scan details: https://clawhub.ai/bowen31337/ai-media



