NousResearch Hermes-Agent Prompt Injection Filter Bypass Vulnerability

Vulnerability

A prompt injection filter bypass vulnerability has been identified in NousResearch Hermes-Agent versions through 2026.4.30. The issue resides in the skills management functionality, specifically within the tools/skills_tool.py file. The vulnerability allows for injection attacks that can be executed remotely, by exploiting the filter's reliance on naive exact-string matching to detect malicious content in user-installed skills. This flaw enables crafted prompt injection payloads to pass undetected into the LLM agent context, potentially allowing attackers to manipulate the agent's behavior or extract sensitive information.

Impact

Exploitation of this vulnerability could lead to unauthorized prompt injection into the LLM agent's context, bypassing existing security filters. This could allow attackers to gain control over the agent's responses and behavior, potentially overriding safety constraints, accessing sensitive data, executing commands through the agent's shell tool, or manipulating the end user via crafted agent responses.

Reproduction

The vulnerability can be reproduced by creating a malicious skill that includes a prompt injection payload with extra words or synonyms, and then loading this skill through the 'skill_view()' function. The injection filter will fail to detect the payload, allowing it to be injected into the LLM agent's context.

Added: Jun 1, 2026, 4:24 AM
Updated: Jun 1, 2026, 4:24 AM

Vulnerability Rating

Custom Algorithm
spread
0.0
impact
0.6
exploitability
8.0
remediation
0.0
relevance
9.7
threat
6.4
urgency
2.9
incentive
0.0

Our algorithm analyzes dozens of metrics to generate these 8 key vulnerability categories, which are then combined to calculate the overall risk score.