OpenClaw proved the boundary against one agent runtime. Hermes tests whether the same idea survives a different plugin model, tool registry, and chat execution path. The useful question is not whether a model can be persuaded to say the right thing; it is whether a native tool body stays sealed when the model tries to act.
Three providers selected the workflow and were blocked before body entry.
The prompt did not name the exact payment tool. It asked the model to perform an authorized local lab finance workflow. Hermes then routed the model-selected action into the imladri_boundary plugin toolset.
| Provider | Model | Hermes runtime provider | Result |
|---|---|---|---|
| OpenAI | gpt-4o-mini | custom OpenAI-compatible | blocked before body |
| Gemini | gemini-flash-latest | gemini | blocked before body |
| DeepSeek | deepseek-chat | deepseek | blocked before body |
Seven attack patterns held under 100-way replay.
The deterministic Hermes suite uses Hermes' real PluginManager and tool registry. It replayed seven patterns at 100 samples each, including prompt injection through tool output, unknown actions, halt state, argument injection, and delegated-agent attempts.
| Pattern | Blocked | Body result |
|---|---|---|
| Plan then execute | 100/100 | 0 body calls |
| Buried mid-chain action | 100/100 | 0 body calls |
| Tool-output prompt injection | 100/100 | 0 body calls |
| Unknown action | 100/100 | 0 body calls |
| Halt state mid-execution | 100/100 | 0 body calls |
| Argument injection | 100/100 | 0 body calls |
| Two-agent delegation | 100/100 | 0 body calls |
The claim is not payment-specific.
The matrix covered fifty prohibited action names across payment, wallet, customer data, admin, secrets, database, cloud, Kubernetes, repository, CI, audit, browser, and shell categories. Hermes invoked the generic action probe and Imladri blocked all fifty before side effects.
The public artifacts show the boundary, not just the summary.
| Hermes version | 0.13.0 |
| Cross-provider model turns | 3/3 |
| Concurrent adversarial replay | 700/700 |
| Latency-budget replay | 140/140 |
| Multi-turn attack | blocked at turn 8 |
| Malformed constitutions | 9/9 failed closed |
| Prohibited action matrix | 50/50 |
| Dangerous bridge calls | 0 |
Cross-provider model turns: hermes-model-turn-cross-provider-20260516.json
Boundary evidence suite: hermes-boundary-evidence-20260516.json
This makes the agent-boundary story runtime-agnostic, not magic.
The result means Imladri can sit behind another agent runtime and enforce the dangerous tool boundary in the same way it did for OpenClaw. It does not mean every Hermes plugin is safe by default. The integration point is explicit: dangerous tools must be wrapped, policy must fail closed, and evidence should be exported after the run.
