Verified Safe — PlumberTasksAI

How It Works

We use a two-layer system. Every skill must pass both layers to earn the Verified Safe badge.

Layer 1

Platform Safety — always on

Every skill runs through a universal safety filter that is applied automatically at the system level. It can't be turned off and covers all skills across all categories.

Layer 2

Per-Skill Security Scans

Each individual skill is red-team tested using Promptfoo — simulated attack prompts are sent to the skill and we verify it holds its ground. Skills that pass earn the badge.

Every PlumberTasksAI Skill Is Tested and Verified Not To…

We run adversarial red-team probes against every skill before it reaches you. Skills that pass earn the Verified Safe badge. Tests marked Coming Soon are in our test roadmap and will be added as the suite expands.

✍️

✅ Verified

Make unauthorized commitments

It won't say "you're approved," waive fees, or make any binding statement on your behalf.

🚀

✅ Verified

Act beyond your instructions

It won't send emails, schedule follow-ups, or take any action you didn't explicitly ask for.

💉

Coming Soon

Be hijacked by hidden instructions

It won't obey malicious commands embedded in content it reads — like a document or message designed to take over its behavior.

⚠️

Coming Soon

Produce harmful or dangerous output

It won't generate advice or content that could endanger a client or violate professional standards, even if steered toward it.

📡

Coming Soon

Send your data outside the system

It won't make outbound calls or leak information to external URLs, even if a hidden instruction tells it to.

🔒

Coming Soon

Mix up details between clients

It won't let a name, case detail, or private information from one input bleed into a response meant for someone else.

⚖️

Coming Soon

Treat people differently based on who they are

It won't give subtly different advice based on demographic details in the input. Consistent, fair responses for everyone.

Skill Verification Status

Loading…

How scoring works — 35 total tests per skill

Every skill runs through two independent layers of adversarial testing: 20 platform-level tests that protect all skills at the system level, plus 15 per-skill targeted tests specific to each workflow. The score you see is the combined result.

🥇 Perfect 35 / 35

Blocked every adversarial attack across both layers. Maximum confidence.

✅ Verified 33–34 / 35

Passed 94%+ of all adversarial tests. Rigorous. Safe to use.

⏳ Scan Pending not yet tested

Per-skill scan not yet run. Still protected by the 20-test platform layer.

Loading skill data…

Columns show results for the three threat categories currently in our test suite. Additional categories will be added as testing expands.

What This Doesn't Cover

We want to be straight with you.

Novel attack methods: Security research moves fast. We test against known techniques; entirely new attack classes may not be covered until we update our test suite.
Your own inputs: We can't protect against a user deliberately providing false or harmful information. The AI responds to what it's given.
Third-party integrations: If you connect PlumberTasksAI to other platforms, the security of those connections depends on those providers.
Professional advice: The Verified Safe badge is a security certification, not a guarantee that output is professionally correct for your jurisdiction. Always apply your own judgment.
100% perfection: No system is immune to every possible attack. Our commitment is continuous testing and fast remediation — not claims of perfection.

Every AI skill here has been security-tested.

How It Works

Platform Safety — always on

Per-Skill Security Scans

Every PlumberTasksAI Skill Is Tested and Verified Not To…

Make unauthorized commitments

Act beyond your instructions

Be hijacked by hidden instructions

Produce harmful or dangerous output

Send your data outside the system

Mix up details between clients

Treat people differently based on who they are

Skill Verification Status

What This Doesn't Cover

Privacy

Questions?