AI Skills for FFmpeg
Discover 279+ Video/audio processing
Browse AI Skills for FFmpeg
affaan-m / video-editing
Streamlines video editing workflows using AI for efficient cutting, structuring, and enhancing of real footage.
sickn33 / audio-transcriber
Automates audio-to-text transcription, generating professional Markdown documentation and summaries for meetings and lectures.
openclaw / voice-note-to-midi
Converts voice notes and melodic recordings into quantized MIDI files using machine learning for pitch detection and intelligent processing.
openclaw / video-ad-analyzer
Analyzes video ads using Gemini Vision AI for frame extraction, OCR, audio transcription, and scene analysis.
openclaw / loom-workflow
Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.
openclaw / ffmpeg-master
Facilitates advanced video and audio processing tasks using FFmpeg for transcoding, filtering, and metadata manipulation.
openclaw / veo3-video-gen
Generates and stitches short videos using Google Veo 3.x and the Gemini API for ads and product demos with a reproducible CLI workflow.
openclaw / yt-to-blog
Transforms YouTube videos into a comprehensive content suite, including blog posts, social media threads, and video clips.
openclaw / voice-reply
Enables offline text-to-speech responses using local Piper voices, perfect for generating voice replies in multiple languages.
openclaw / veo3-gen
Generates and stitches short videos using Google Veo 3.x and the Gemini API for ads and product demos with a CLI workflow.
openclaw / remotion-excalidraw-tts
Creates narrated videos from Excalidraw diagrams using TTS and Remotion, ideal for engaging explainer content.
openclaw / ugc-manual
Generates lip-sync videos by combining an image with a user's audio recording, preserving exact audio timing for personalized content.
openclaw / youtube-video-analyzer
Analyzes YouTube videos by synchronizing audio transcripts with visual frames for detailed step-by-step guides.
openclaw / sergei-mikhailov-stt
Converts voice messages to text using Yandex SpeechKit, enabling seamless audio transcription in OpenClaw-connected applications.
openclaw / material-report
Analyzes ad videos to generate detailed markdown reports, frameworks, and storyboards for improved performance in advertising.
openclaw / zhipu-asr
Transcribes Chinese audio files to text using Zhipu AI's GLM-ASR model, enhancing accuracy with context prompts and custom hotwords.
openclaw / clawcut
Generates AI-powered short videos from topics or reference videos, perfect for social media and e-commerce content creation.
openclaw / video-analyzer
Analyzes video content by extracting frames at regular intervals, aiding in scene understanding and content review.
openclaw / elevenlabs-transcribe
Transcribes audio to text with ElevenLabs Scribe, supporting batch and real-time transcription for various audio formats.
openclaw / discord-voice
Enables real-time voice conversations in Discord using Claude AI for transcription and text-to-speech capabilities.