All categories 27 skills

Speech & Transcription Skills & Plugins

Showing 27 curated AI tools in the Speech & Transcription category. Extend your OpenClaw (formerly Moltbot) local assistant with these safe, install-ready ClawHub integrations.

Speech & Transcription

Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the user needs to convert an audio file to text (specifically Amharic), or translate text between languages (e.g., Amharic to English). Requires 'x-api-key'.

npx clawhub@latest install addis-assistant-stt
Speech & Transcription

Command-line blogging platform for AI agents. Register, verify, and publish markdown posts to AI Agent Blogs (www.eggbrt.com). Use when agents need to publish blog posts, share learnings, document discoveries, or maintain a public knowledge base. Full API support for publishing, discovery (browse all blogs/posts), comments, and voting. Requires API key (stored in ~/.agent-blog-key or AGENT_BLOG_API_KEY env var) for write operations; browsing is unauthenticated. Complete OpenAPI 3.0 specification available.

npx clawhub@latest install agent-voice
Speech & Transcription

Interact with Akaunting open-source accounting software via REST API. Use for creating invoices, tracking income/expenses, managing accounts, and bookkeeping automation. Triggers on accounting, bookkeeping, invoicing, expenses, income tracking, or Akaunting mentions.

npx clawhub@latest install akaunting
Speech & Transcription

Control Amazon Alexa devices and smart home via the `alexacli` CLI.

npx clawhub@latest install alexa-cli
Speech & Transcription

Announce text throughout the house via AirPlay speakers using Airfoil + ElevenLabs TTS.

npx clawhub@latest install announcer
Speech & Transcription

Transcribe audio/video with AssemblyAI (local upload or URL), plus subtitles + paragraph/sentence exports.

npx clawhub@latest install assemblyai-transcribe
Speech & Transcription

Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it to high-quality audio. Supports multiple formats (audiobook, podcast, educational), custom lengths, and voice effects. Use when asked to create audio content, make a podcast, generate an audiobook, or produce educational audio. Returns MP3 audio file via MEDIA token.

npx clawhub@latest install audio-gen
Speech & Transcription

Generate audio replies using TTS. Trigger with "read it to me [public URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken response. Also responds to "speak", "say it", "voice reply".

npx clawhub@latest install audio-reply-skill
Speech & Transcription

RAM-safe voice transcription with auto-chunking — works on 16GB machines without crashes

npx clawhub@latest install auto-whisper-safe
Speech & Transcription

Remove AI-generated jargon and restore human voice to text.

npx clawhub@latest install brw-de-ai-ify
Speech & Transcription

A RESTful service for high-quality text-to-speech using Qwen3 and specialized voice cloning. Optimized for reusing a specific voice prompt to avoid re-computation.

npx clawhub@latest install chichi-speech
Speech & Transcription

You are connected to a live user session via voice.

npx clawhub@latest install niczy
Speech & Transcription

Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) Clone their voice or someone else's voice, (2) Generate speech that sounds like a specific person, (3) Create personalized voice messages, (4) Multi-lingual voice cloning (speak any language with cloned voice).

npx clawhub@latest install clonev
Speech & Transcription

Generate draft articles, outlines, and editorial content matching a distinctive analytical, skeptical voice with sharp critical commentary, conversational tone, and strategic humor.

npx clawhub@latest install critical-article-writer
Speech & Transcription

Give your agent a voice — and ears. The Cult of Carcinization is the bot-first gateway to ScrappyLabs TTS and STT. Speak with 20+ voices, design your own from a text description, transcribe audio to text, and evolve into a permanent bot identity. No human signup required.

npx clawhub@latest install cult-of-carcinization
Speech & Transcription

Generate speech audio using Deepdub and attach it as a MEDIA file (Telegram-compatible).

npx clawhub@latest install deepdub-tts
Speech & Transcription

— command-line interface for Deepgram speech-to-text.

npx clawhub@latest install deepgram
Speech & Transcription

DELLIGHT.AI is an AI startup in DIFC, Dubai.

npx clawhub@latest install dellight-cro-revenue-ops
Speech & Transcription

Real-time OCR and data extraction API by Veryfi (https://veryfi.com). Extract structured data from receipts, invoices, bank statements, W-9s, purchase orders, bills of lading, and any other document. Use when you need to OCR documents, extract fields, parse receipts/invoices, bank statements, classify documents, detect fraud, or get raw OCR text from any document.

npx clawhub@latest install documents-ai
Speech & Transcription

Text-to-Speech service using Doubao (Volcano Engine) API with 200+ voices, interactive voice selection, and multilingual support

npx clawhub@latest install doubao-api-open-tts
Speech & Transcription

Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS, Whisper transcription with diarization, and RVC voice conversion. Use when the user needs TTS, transcription, or voice conversion.

npx clawhub@latest install eachlabs-voice-audio
Speech & Transcription

Work with the easyVerein v2.0 REST API (members, contacts, events, invoices, bookings, custom fields, etc.). Use for full API coverage: endpoint discovery, auth, request/response schemas, and example cURL calls.

npx clawhub@latest install easyverein-api
Speech & Transcription

Create, manage, and deploy ElevenLabs conversational AI agents. Use when the user wants to work with voice agents, list their agents, create new ones, or manage agent configurations.

npx clawhub@latest install elevenlabs-agents
Speech & Transcription

ElevenLabs music generation.

npx clawhub@latest install clawdbotborges
Speech & Transcription

Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.

npx clawhub@latest install elevenlabs-transcribe
Speech & Transcription

ElevenLabs TTS - the best ElevenLabs integration for OpenClaw. ElevenLabs Text-to-Speech with emotional audio tags, ElevenLabs voice synthesis for WhatsApp, ElevenLabs multilingual support. Generate realistic AI voices using ElevenLabs API.

npx clawhub@latest install elevenlabs-tts
Speech & Transcription

High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.

npx clawhub@latest install elevenlabs-voices