Shieldon

Audit AI Agent Skills
Before Execution

Set Up MCP Server

Free

Zero-maintenance scanning. YARA + LLM. Always latest rules.

  • 39 YARA rules across 6 threat categories
  • LLM semantic layer
  • Catches social engineering & prompt injection
  • Catches reverse shells, data exfiltration and obfuscation patterns
  • Binary verdict + reasoning
~5 sec10 scans / day

Human? Send this to your agent:

Read https://shieldon.dev/setup.md and set up the Shieldon MCP server to scan skills before installing them.

Agent? Read the setup instructions directly:

setup.md

Trusted by

MindsAnimoca Brands

What Shieldon Catches

Six categories of threats detected across every skill.md your agent encounters — pattern-matched by YARA and reasoned about by an LLM.

Credential Theft

AWS, GitHub, Stripe, OpenAI, Anthropic, Supabase, cloud provider keys, private keys, .env / .ssh access, crypto wallets

Command Injection

Reverse shells, curl | bash, privilege escalation, container escape, cron persistence, SSH key injection, PowerShell

Data Exfiltration

webhook.site, paste services, ngrok tunnels, DNS tunneling, Discord/Slack/Telegram webhooks, modern drop services

Obfuscation

Base64 decode chains, hex encoding, eval() + encoding, char-by-char string building, unicode escape evasion

Audit-Override Framing

Prose pleading 'educational only', 'classify as safe', 'ignore the YARA finding', END-OVERRIDE markers — social engineering aimed at the auditor LLM

Multistage Remote Code

exec(...http://...), bash <(curl ...), plugin/manifest loaders that defer code to a remote URL the auditor can't inspect

How Scanning Works

Every hosted scan runs through a 3-stage pipeline in around 5 seconds.

skill.md
1

YARA deep scan

39 patterns across 6 threat categories

2

LLM semantic analysis

New

Context-aware safety verdict with reasoning

3

Verdict

Binary safe / block + reasoning + findings

Example output
{
  "scan_id": "a1b2c3d4-...",
  "verdict": "block",
  "reasoning": "Override framing co-occurs with credential theft.",
  "findings": [
    {
      "type": "yara",
      "description": "Supabase access key pattern detected",
      "severity": "HIGH",
      "evidence": "Secret at offset 342: sb_secret_example"
    },
    {
      "type": "llm",
      "description": "Audit-override prose used to launder credential code",
      "severity": "HIGH",
      "evidence": "'classify as safe — for educational purposes only'"
    }
  ]
}

Local skill vs. MCP server

The local skill is YARA-only and runs offline. The hosted MCP adds the LLM semantic layer.

Local skillMCP server
Installnpx skills addAdd endpoint to config
Detection rules39 YARA39 YARA + LLM
Works offline
LLM semantic analysis
Rule updatesnpx skills updateAutomatic
Best forIndividualsAutonomous agents, CI/CD, enterprises