AIProxyGuard

LLM Security Proxy with Prompt Injection Detection.

What It Does

AIProxyGuard sits between your application and LLM providers to detect and block malicious inputs before they reach the model. Point your OpenAI/Anthropic SDK at the proxy instead of directly at the provider.

Quick Start

# Run the proxy
docker run -d -p 8080:8080 ghcr.io/ainvirion/aiproxyguard:latest

# Verify it's running
curl http://localhost:8080/healthz

Point your LLM client to the proxy:

from openai import OpenAI

client = OpenAI(
    api_key="sk-...",
    base_url="http://localhost:8080/openai/v1"
)

# Normal requests work as expected
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)

# Malicious requests are blocked
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Ignore all previous instructions..."}]
)
# Raises: BadRequestError - content_blocked

Detection-Only Mode

Use the /check endpoint to scan text without forwarding to an LLM:

curl -X POST http://localhost:8080/check \
  -H "Content-Type: application/json" \
  -d '{"text": "Ignore all previous instructions"}'

# Response:
# {"action": "block", "category": "prompt-injection", "signature_name": "Ignore instructions directive", "confidence": 0.9}

Features

Multi-Provider Routing - OpenAI, Anthropic, OpenRouter, Ollama
Detection-Only Mode - /check endpoint for pre-validation
Request & Response Scanning - Regex + heuristics detection
Policy Engine - Per-category actions (block/warn/log)
Rate Limiting - iptables-based DDoS protection
Prometheus Metrics - Full observability at /metrics
Control Plane - Fleet management, automatic signature sync

Detection Categories

Category	Description
`prompt-injection`	Instruction override attempts
`jailbreak`	DAN mode, persona exploits
`encoding-bypass`	Base64/hex/ROT13 obfuscation
`delimiter-injection`	JSON/XML structure attacks
`indirect-injection`	Tool abuse, plugin exploits
`unicode-evasion`	Homoglyphs, fullwidth chars
`role-manipulation`	Named character roleplay

Documentation

Full documentation at ainvirion.github.io/aiproxyguard

Control Plane

Connect to aiproxyguard.com for fleet management and automatic signature updates:

docker run -d -p 8080:8080 \
  -e AIPROXYGUARD_CONTROL_PLANE_ENABLED=true \
  -e AIPROXYGUARD_CONTROL_PLANE_API_KEY=your-api-key \
  ghcr.io/ainvirion/aiproxyguard:latest

Development

python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
pytest

Name		Name	Last commit message	Last commit date
Latest commit History 212 Commits
.github		.github
deploy		deploy
docs		docs
models/prompt-classifier-v1		models/prompt-classifier-v1
signatures		signatures
src/aiproxyguard		src/aiproxyguard
tests		tests
.clabot		.clabot
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
SECURITY.md		SECURITY.md
config.docker.yaml		config.docker.yaml
config.example.yaml		config.example.yaml
config.test.yaml		config.test.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIProxyGuard

What It Does

Quick Start

Detection-Only Mode

Features

Detection Categories

Documentation

Control Plane

Development

License

About

Uh oh!

Releases 37

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AIProxyGuard

What It Does

Quick Start

Detection-Only Mode

Features

Detection Categories

Documentation

Control Plane

Development

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 37

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages