OpenPrompts

Security

We aggregate prompts and skills from the wild, so every entry runs through an automated safety scan loosely aligned to the OWASP Agentic Skills Top 10.

2,007
Scanned
1,988
No flags
19
Flagged for review

What the scan does

It is a heuristic text scan, run at build time over every entry's content. It flags entries that contain patterns worth a human review — prompt-injection phrasing, jailbreak or safety-bypass language, data-exfiltration instructions, embedded secrets, and unsafe code. Flagged entries show a notice on their page.

It is not a security guarantee. False positives are expected: legitimate coding and security prompts often mention these terms (a jailbreak detector, for example, will match). Always review a prompt before running it against your own systems.

Coverage against the OWASP Top 10

A static text catalog can only detect text-based risks. Runtime risks (sandboxing, privileges, supply chain) are addressed by our curation policy: we only ingest permissively-licensed or vendor-published sources, keep each entry's origin and license, and exclude leaked content.

Credential leakage

Our credential checks are informed by Chen et al., How Your Credentials Are Leaked by LLM Agent Skills (2026), which found that most skill credential leaks come from secrets written to debug output that agent frameworks feed back into the model. We flag both hardcoded secrets and credentials printed to logs.

IDRiskOur coverage
AST01Malicious SkillsAutomated scan
AST02Supply Chain CompromiseCuration policy
AST03Over-Privileged SkillsCuration policy
AST04Insecure MetadataAutomated scan
AST05Unsafe DeserializationAutomated scan
AST06Weak IsolationCuration policy
AST07Update DriftCuration policy
AST08Poor ScanningAutomated scan
AST09No GovernanceCuration policy
AST10Cross-Platform ReuseCuration policy

Found something the scan missed, or a false positive? Browse the catalog and open an issue on the source repository linked from each entry.