Skip to main content

AI Skills for Whisper

Discover 46+ Speech-to-text

Installation guide →

Browse AI Skills for Whisper

sickn33 sickn33 / audio-transcriber

21.5K

Automates audio-to-text transcription, generating professional Markdown documentation and summaries for meetings and lectures.

openclaw
83
69

openclaw openclaw / loom-workflow

2.2K

Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.

openclaw
92
98

openclaw openclaw / youtube-editor

2.2K

Automates YouTube video editing with transcription, analysis, and thumbnail generation, enhancing content creation efficiency.

openclaw
75
99

openclaw openclaw / walkie-talkie

2.2K

Facilitates voice conversations on WhatsApp by transcribing audio and responding with TTS, enhancing user interaction.

75
100

openclaw openclaw / eachlabs-voice-audio

2.2K

Facilitates text-to-speech, speech-to-text, and voice conversion using EachLabs AI models for enhanced audio processing.

openclaw
75
53

openclaw openclaw / youtube-notification-analysis

2.2K

Analyzes YouTube notifications for investment insights, extracting video subtitles and executing trades based on financial content.

openclaw
75
97

openclaw openclaw / clips-machine

2.2K

Transforms long videos into viral short clips with auto-detection of highlights and trendy captions for social media platforms.

openclaw
67
89

openclaw openclaw / video-subtitles

2.2K

Generates SRT subtitles from video/audio with translation support, enabling easy captioning and transcription for social media.

67
100

openclaw openclaw / whatsapp-voice-talk

2.2K

Processes WhatsApp voice messages in real-time, transcribing and detecting intents to enable conversational interfaces.

openclaw
67
89

openclaw openclaw / auto-whisper-safe

2.2K

Enables RAM-safe voice transcription with auto-chunking for efficient processing on 16GB machines using OpenAI Whisper.

openclaw
67
100

openclaw openclaw / Video Captions

2.2K

Generates professional captions and subtitles with precise timing and styling for various video platforms.

openclaw
67
99

openclaw openclaw / digital-human-training

2.2K

Provides comprehensive guidance for training and deploying interactive digital humans, from voice cloning to real-time interaction.

openclaw
67
100

openclaw openclaw / Audio

2.2K

Enhances and converts audio files with noise removal, normalization, and transcription for podcasts and other workflows.

openclaw
58
100

openclaw openclaw / Video

2.2K

Processes and optimizes videos for various platforms, offering features like compression, captioning, and format conversion.

openclaw
58
100

openclaw openclaw / OpenClaw Voice Skill

2.2K

Facilitates voice conversations with AI using Whisper STT and ElevenLabs TTS, enabling audio recording and transcript management.

openclaw
0
95

Dokhacgiakhoa Dokhacgiakhoa / voice-ai-engine-development

374

Architects real-time Voice AI agents with low-latency communication, utilizing advanced speech processing and AI technologies.

openclaw
67
100

Microck Microck / Video Processor

133

Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

majiayu000 majiayu000 / audio-transcribe

106

Transcribes audio and video to text using Whisper, supporting word-level timestamps for accurate subtitle generation.

openclaw
83
100

majiayu000 majiayu000 / gastrohem-media-processor

106

Automates the processing of audio and image files from WhatsApp, providing transcription and OCR capabilities for efficient media management.

openclaw
83
100

TerminalSkills TerminalSkills / video-subtitles

10

Generates and burns subtitles into videos, transcribes audio, and converts subtitle formats using Whisper and FFmpeg.

openclawclaude-code
92
95