Skip to main content

AI Skills for Whisper

Discover 46+ Speech-to-text

Installation guide →

Browse AI Skills for Whisper

sickn33 sickn33 / daily

37.5K

Provides a comprehensive reference for building real-time voice and multimodal AI applications using Daily, enabling seamless integration of AI services.

openclaw
75
100

sickn33 sickn33 / audio-transcriber

21.5K

Automates audio-to-text transcription, generating professional Markdown documentation and summaries for meetings and lectures.

claudecopilot
83
69

nicepkg nicepkg / transcribe-and-analyze

190

Transcribes audio and video from URLs using WhisperKit and analyzes transcripts with AI upon request.

openclaw
92
94

aiskillstore aiskillstore / video-processor

278

Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

Dokhacgiakhoa Dokhacgiakhoa / voice-ai-engine-development

374

Architects real-time Voice AI agents with low-latency communication, utilizing advanced speech processing and AI technologies.

openclaw
67
100

Microck Microck / Video Processor

133

Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

aiskillstore aiskillstore / audio-transcriber

278

Transforms audio recordings into structured Markdown documentation with intelligent summaries and speaker identification.

github-copilotclaude-code
67
69

majiayu000 majiayu000 / audio-transcribe

106

Transcribes audio and video to text using Whisper, supporting word-level timestamps for accurate subtitle generation.

openclaw
83
100

majiayu000 majiayu000 / gastrohem-media-processor

106

Automates the processing of audio and image files from WhatsApp, providing transcription and OCR capabilities for efficient media management.

openclaw
83
100

GeorgeDoors888 GeorgeDoors888 / bilibili-transcript

2

Transcribes Bilibili videos to text with high accuracy, providing detailed summaries and formatted transcripts in multiple languages.

100
97

majiayu000 majiayu000 / create-movie

106

Facilitates comprehensive movie creation through a structured workflow, utilizing AI tools for research, scripting, and assembly.

openclaw
75
100

GeorgeDoors888 GeorgeDoors888 / expression-coach

2

Enhances personal expression skills through voice practice, real-time feedback, and data analysis for effective communication.

openclaw
92
88

mattnigh mattnigh / Video Processor

22

Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

mattnigh mattnigh / gastrohem-media-processor

22

Automates the processing of audio and image files from WhatsApp, providing transcription and OCR capabilities for efficient media management.

openclaw
83
100

majiayu000 majiayu000 / faion-multimodal-ai

106

Facilitates multimodal AI applications including image/video generation and speech synthesis for diverse use cases.

openclaw
67
99

alsk1992 alsk1992 / voice

53

Enables voice recognition and control for trading applications, enhancing user interaction through wake words and speech commands.

openclaw
75
70

Activer007 Activer007 / Video Processor

7

Processes video files for audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

majiayu000 majiayu000 / groq-inference

2

Enables ultra-fast LLM inference using the GROQ API for real-time applications in chat, vision, and audio processing.

openclaw
83
98

diegosouzapw diegosouzapw / audio-transcriber

2

Transforms audio recordings into structured Markdown documentation with intelligent summaries and speaker identification.

openclaw
83
69

MudassarAbrar MudassarAbrar / audio-transcriber

1

Transforms audio recordings into structured Markdown documentation with intelligent summaries and speaker identification.

openclaw
83
69