All categories 27 skills

Speech & Transcription Skills & Plugins

Showing 27 curated AI tools in the Speech & Transcription category. Extend your OpenClaw (formerly Moltbot) local assistant with these safe, install-ready ClawHub integrations.

Speech & Transcription @dagmawibabi

addis-assistant-stt

Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the user needs to convert an audio file to text (specifically Amharic), or translate text between languages (e.g., Amharic to English). Requires 'x-api-key'.

npx clawhub@latest install addis-assistant-stt

Speech & Transcription @nerdsnipe

agent-voice

Command-line blogging platform for AI agents. Register, verify, and publish markdown posts to AI Agent Blogs (www.eggbrt.com). Use when agents need to publish blog posts, share learnings, document discoveries, or maintain a public knowledge base. Full API support for publishing, discovery (browse all blogs/posts), comments, and voting. Requires API key (stored in ~/.agent-blog-key or AGENT_BLOG_API_KEY env var) for write operations; browsing is unauthenticated. Complete OpenAPI 3.0 specification available.

npx clawhub@latest install agent-voice

Speech & Transcription @liekzejaws

akaunting

Interact with Akaunting open-source accounting software via REST API. Use for creating invoices, tracking income/expenses, managing accounts, and bookkeeping automation. Triggers on accounting, bookkeeping, invoicing, expenses, income tracking, or Akaunting mentions.

npx clawhub@latest install akaunting

Speech & Transcription @buddyh

alexa-cli

Control Amazon Alexa devices and smart home via the `alexacli` CLI.

npx clawhub@latest install alexa-cli

Speech & Transcription @odrobnik

announcer

Announce text throughout the house via AirPlay speakers using Airfoil + ElevenLabs TTS.

npx clawhub@latest install announcer

Speech & Transcription @tristanmanchester

assemblyai-transcribe

Transcribe audio/video with AssemblyAI (local upload or URL), plus subtitles + paragraph/sentence exports.

npx clawhub@latest install assemblyai-transcribe

Speech & Transcription @udiedrichsen

audio-gen

Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it to high-quality audio. Supports multiple formats (audiobook, podcast, educational), custom lengths, and voice effects. Use when asked to create audio content, make a podcast, generate an audiobook, or produce educational audio. Returns MP3 audio file via MEDIA token.

npx clawhub@latest install audio-gen

Speech & Transcription @matrixy

audio-reply

Generate audio replies using TTS. Trigger with "read it to me [public URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken response. Also responds to "speak", "say it", "voice reply".

npx clawhub@latest install audio-reply-skill

Speech & Transcription @neal-collab

auto-whisper-safe

RAM-safe voice transcription with auto-chunking — works on 16GB machines without crashes

npx clawhub@latest install auto-whisper-safe

Speech & Transcription @brianrwagner

brw-de-ai-ify

Remove AI-generated jargon and restore human voice to text.

npx clawhub@latest install brw-de-ai-ify

Speech & Transcription @hudeven

chichi-speech

A RESTful service for high-quality text-to-speech using Qwen3 and specialized voice cloning. Optimized for reusing a specific voice prompt to avoid re-computation.

npx clawhub@latest install chichi-speech

Speech & Transcription @community

claw-voice

You are connected to a live user session via voice.

npx clawhub@latest install niczy

Speech & Transcription @instant-picture

clonev

Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) Clone their voice or someone else's voice, (2) Generate speech that sounds like a specific person, (3) Create personalized voice messages, (4) Multi-lingual voice cloning (speak any language with cloned voice).

npx clawhub@latest install clonev

Speech & Transcription @tomstools11

critical-article-writer

Generate draft articles, outlines, and editorial content matching a distinctive analytical, skeptical voice with sharp critical commentary, conversational tone, and strategic humor.

npx clawhub@latest install critical-article-writer

Speech & Transcription @loserbcc

cult-of-carcinization

Give your agent a voice — and ears. The Cult of Carcinization is the bot-first gateway to ScrappyLabs TTS and STT. Speak with 20+ voices, design your own from a text description, transcribe audio to text, and evolve into a permanent bot identity. No human signup required.

npx clawhub@latest install cult-of-carcinization

Speech & Transcription @yuval-deepdub

deepdub-tts

Generate speech audio using Deepdub and attach it as a MEDIA file (Telegram-compatible).

npx clawhub@latest install deepdub-tts

Speech & Transcription @nerkn

deepgram

— command-line interface for Deepgram speech-to-text.

npx clawhub@latest install deepgram

Speech & Transcription @arthurelgindell

dellight-cro-revenue-ops

DELLIGHT.AI is an AI startup in DIFC, Dubai.

npx clawhub@latest install dellight-cro-revenue-ops

Speech & Transcription @dbirulia

documents-ai

Real-time OCR and data extraction API by Veryfi (https://veryfi.com). Extract structured data from receipts, invoices, bank statements, W-9s, purchase orders, bills of lading, and any other document. Use when you need to OCR documents, extract fields, parse receipts/invoices, bank statements, classify documents, detect fraud, or get raw OCR text from any document.

npx clawhub@latest install documents-ai

Speech & Transcription @xdrshjr

doubao-api-open-tts

Text-to-Speech service using Doubao (Volcano Engine) API with 200+ voices, interactive voice selection, and multilingual support

npx clawhub@latest install doubao-api-open-tts

Speech & Transcription @eftalyurtseven

eachlabs-voice-audio

Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS, Whisper transcription with diarization, and RVC voice conversion. Use when the user needs TTS, transcription, or voice conversion.

npx clawhub@latest install eachlabs-voice-audio

Speech & Transcription @truefoobar

easyverein-api

Work with the easyVerein v2.0 REST API (members, contacts, events, invoices, bookings, custom fields, etc.). Use for full API coverage: endpoint discovery, auth, request/response schemas, and example cURL calls.

npx clawhub@latest install easyverein-api

Speech & Transcription @pennyroyaltea

elevenlabs-agents

Create, manage, and deploy ElevenLabs conversational AI agents. Use when the user wants to work with voice agents, list their agents, create new ones, or manage agent configurations.

npx clawhub@latest install elevenlabs-agents

Speech & Transcription @community

elevenlabs-media

ElevenLabs music generation.

npx clawhub@latest install clawdbotborges

Speech & Transcription @paulasjes

elevenlabs-transcribe

Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.

npx clawhub@latest install elevenlabs-transcribe

Speech & Transcription @shaharsha

elevenlabs-tts

ElevenLabs TTS - the best ElevenLabs integration for OpenClaw. ElevenLabs Text-to-Speech with emotional audio tags, ElevenLabs voice synthesis for WhatsApp, ElevenLabs multilingual support. Generate realistic AI voices using ElevenLabs API.

npx clawhub@latest install elevenlabs-tts

Speech & Transcription @robbyczgw-cla

elevenlabs-voices

High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.

npx clawhub@latest install elevenlabs-voices