AI Skills for Whisper
Discover 46+ Speech-to-text
Browse AI Skills for Whisper
sickn33 / audio-transcriber
Automates audio-to-text transcription, generating professional Markdown documentation and summaries for meetings and lectures.
openclaw / loom-workflow
Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.
openclaw / youtube-editor
Automates YouTube video editing with transcription, analysis, and thumbnail generation, enhancing content creation efficiency.
openclaw / walkie-talkie
Facilitates voice conversations on WhatsApp by transcribing audio and responding with TTS, enhancing user interaction.
openclaw / eachlabs-voice-audio
Facilitates text-to-speech, speech-to-text, and voice conversion using EachLabs AI models for enhanced audio processing.
openclaw / youtube-notification-analysis
Analyzes YouTube notifications for investment insights, extracting video subtitles and executing trades based on financial content.
openclaw / clips-machine
Transforms long videos into viral short clips with auto-detection of highlights and trendy captions for social media platforms.
openclaw / video-subtitles
Generates SRT subtitles from video/audio with translation support, enabling easy captioning and transcription for social media.
openclaw / whatsapp-voice-talk
Processes WhatsApp voice messages in real-time, transcribing and detecting intents to enable conversational interfaces.
openclaw / auto-whisper-safe
Enables RAM-safe voice transcription with auto-chunking for efficient processing on 16GB machines using OpenAI Whisper.
openclaw / Video Captions
Generates professional captions and subtitles with precise timing and styling for various video platforms.
openclaw / digital-human-training
Provides comprehensive guidance for training and deploying interactive digital humans, from voice cloning to real-time interaction.
openclaw / Audio
Enhances and converts audio files with noise removal, normalization, and transcription for podcasts and other workflows.
openclaw / Video
Processes and optimizes videos for various platforms, offering features like compression, captioning, and format conversion.
openclaw / OpenClaw Voice Skill
Facilitates voice conversations with AI using Whisper STT and ElevenLabs TTS, enabling audio recording and transcript management.
Dokhacgiakhoa / voice-ai-engine-development
Architects real-time Voice AI agents with low-latency communication, utilizing advanced speech processing and AI technologies.
Microck / Video Processor
Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.
majiayu000 / audio-transcribe
Transcribes audio and video to text using Whisper, supporting word-level timestamps for accurate subtitle generation.
majiayu000 / gastrohem-media-processor
Automates the processing of audio and image files from WhatsApp, providing transcription and OCR capabilities for efficient media management.
TerminalSkills / video-subtitles
Generates and burns subtitles into videos, transcribes audio, and converts subtitle formats using Whisper and FFmpeg.