Skip to main content

AI Skills for Whisper

Discover 16+ Speech-to-text

Install any skill with /learn

/learn @owner/skill-name

Browse AI Skills for Whisper

openclaw openclaw / Voice to Report

996

project-managementwhisperopenaifrontendreact +2

openclaw openclaw / clawdbites

996

Extract recipes from Instagram reels. Use when a user sends an Instagram reel link and wants to get the recipe from the caption. Parses ingredients, instructions, and macros into a clean format.

claudemarketingwhisperffmpegfrontendremotion +1

openclaw openclaw / clips-machine

996

Transform long videos into viral short-form clips. Auto-detect best moments, add trendy captions, export for TikTok/Reels/Shorts. Self-contained, no external modules. 100% free tools.

content-mediaffmpegwhisperfrontendremotion

openclaw openclaw / universal-voice-agent

996

Real-time goal-oriented voice calling agent. Use when you need to make phone calls with a specific objective: place orders, make reservations, customer service, encouragement calls, or any conversational goal. Haiku runs the call in real-time with your voice (ElevenLabs), transcribes responses (G...

claudesalestwiliowhisperfrontenddesign

openclaw openclaw / loom-workflow

996

AI-native workflow analyzer for Loom recordings. Breaks down recorded business processes into structured, automatable workflows. Use when: - Analyzing Loom videos to understand workflows - Extracting steps, tools, and decision points from screen recordings - Generating Lobster workflow files from...

claudeoperationsffmpegwhisperfrontendremotion

openclaw openclaw / video-subtitles

996

Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.

content-mediawhisperffmpegfrontendremotion

openclaw openclaw / walkie-talkie

996

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

productivitywhatsappffmpeg

openclaw openclaw / walkie-talkie

996

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

productivitywhatsappffmpeg

openclaw openclaw / walkie-talkie

996

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

productivitywhatsappffmpeg

openclaw openclaw / walkie-talkie

996

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

productivitywhatsappffmpeg

openclaw openclaw / voice-ui

996

Self-evolving voice assistant UI. Talk to your AI, ask it to improve itself, and watch the code update in real-time.

claudedevelopmentwhisperfrontendgit

majiayu000 majiayu000 / ai-transcript-analyzer

80

Analyze transcript files using OpenAI API (gpt-5-mini) to extract insights, summaries, key topics, quotes, and action items. This skill should be used when users have transcript files (from WhisperKit, YouTube, podcasts, meetings, etc.) and want AI-powered analysis, summaries, or custom insights ...

data-analyticsopenaiwhisperfrontendplaywright +3

majiayu000 majiayu000 / autocut-shorts

80

Main orchestration skill for automatic creation of short-form content (TikTok, YouTube Shorts, Instagram Reels) from long videos. Fully automated workflow: download video, transcribe, detect highlights (transcript + laughter + sentiment + scenes), trim segments, resize to 9:16 portrait, and add s...

marketingffmpegwhisperfrontendreact +4

majiayu000 majiayu000 / create-movie

80

Orchestrated movie creation for Horus persona. Guides through phases: Research → Script → Build Tools → Generate → Assemble. Uses Docker-isolated coding environment, free/open-source tools only, with full memory integration.

claudemarketingdockerffmpegfrontenddesign +7

majiayu000 majiayu000 / faion-multimodal-ai

80

Multimodal AI: vision, image/video generation, speech-to-text, text-to-speech, voice synthesis.

claudedevelopmentopenaielevenlabsfrontendremotion +7

majiayu000 majiayu000 / audio-transcribe

80

使用 Whisper 将音频/视频转换为文字,支持词级别时间戳。Use when user wants to 语音转文字, 音频转文字, 视频转文字, 字幕生成, transcribe audio, speech to text, generate subtitles, 识别语音.

marketingwhisperfrontendremotion