Skip to main content

content-moderation

Two-layer content safety for agent input and output. Use when (1) a user message attempts to override, ignore, or bypass previous instructions (prompt injection), (2) a user message references system prompts, hidden instructions, or internal configuration, (3) receiving messages from untrusted us...

Install this skill

content-moderation1 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!

Install this skill with one command

/learn @openclaw/detect-injection
GitHub Stars 996
Rate this skill
Categorydevops
UpdatedFebruary 16, 2026
openclaw/skills