codagent

Production utilities for Python LLM agents.

codagent is a small library for the patterns that every Python team ends up writing when they ship LLM agents — regardless of which framework they use:

Composable node wrappers (retry, timeout, cache, structured output)
Tool decorators (validation, circuit breaker, rate limit)
Observability primitives (cost, step, trace)
Behavior contracts (assumption surface, refusal patterns, citation enforcement, LLM-as-judge supervisors)

Works with LangGraph, LangChain, LlamaIndex, CrewAI, AutoGen, DSPy, Pydantic AI, or raw OpenAI / Anthropic clients. The core is framework-agnostic; framework-specific adapters live in codagent.integrations.

Why this exists

Every Python team building production LLM agents writes the same glue:

Retry on transient API errors
Circuit-break on flaky tools
Track tokens and cost
Force the model to surface its assumptions
Stop "should work" claims without evidence
Refuse sensitive prompts with a structured block
Run a domain supervisor agent over every response

Each of these is a few lines, but the few-lines-per-team adds up across hundreds of repos. codagent packages them once.

Install

pip install git+https://github.com/rlaope/codagent.git

Python 3.14 note. Editable installs (pip install -e) currently fail to import on Python 3.14 because setuptools writes __editable__.codagent-*.pth, and 3.14 skips .pth files whose name starts with _ as hidden. Use a regular install (pip install /path/to/codagent) on 3.14, or stay on Python 3.13 / 3.12 for editable workflows. Tracked at upstream pypa/setuptools.

Optional integrations (install only what your stack needs):

pip install codagent[openai]         # OpenAI client wrap
pip install codagent[anthropic]      # Anthropic client wrap
pip install codagent[langchain]      # LangChain callback + Runnable
pip install codagent[langgraph]      # LangGraph node factories
pip install codagent[llamaindex]     # LlamaIndex callback handler
pip install codagent[pydantic-ai]    # Pydantic AI system-prompt helper
pip install codagent[crewai]         # CrewAI agent backstory adapter
pip install codagent[autogen]        # AutoGen system-message adapter
pip install codagent[dspy]           # DSPy module wrapper
pip install codagent[guardrails-ai]  # wrap Guardrails.ai validators
pip install codagent[nemo]           # wrap NeMo Guardrails flows

What's in the library

codagent/
├── nodes/              composable node wrappers
│   with_retry, with_timeout, with_cache, parse_structured,
│   with_loop_guard
│
├── tools/              tool callable hardening
│   validated_tool, circuit_breaker, rate_limit
│
├── observability/      production tracking
│   CostTracker, BudgetCap, StepBudget, StateTracer
│
├── harness/            behavior contracts
│   Harness composer, 7 built-in contracts (Assumption Surface,
│   Verification Loop, Tool Call Surface, Refusal Pattern,
│   Citation Required, Faithfulness, MetaAgent Contract)
│
└── integrations/       Python LLM ecosystem adapters
    wrap_openai, wrap_anthropic, pydantic_ai_prompt,
    HarnessRunnable (LangChain), assumption_surface_node (LangGraph),
    HarnessLlamaIndexCallback, crewai/autogen/dspy stubs

Quick start

OpenAI / Anthropic raw client

from openai import OpenAI
from anthropic import Anthropic
from codagent.harness import AssumptionSurface, VerificationLoop
from codagent.integrations import wrap_openai, wrap_anthropic

oai = wrap_openai(OpenAI(), AssumptionSurface(), VerificationLoop())
ant = wrap_anthropic(Anthropic(), AssumptionSurface(), VerificationLoop())
# Both clients now inject the harness addendum into every call.

LangChain

from langchain_openai import ChatOpenAI
from codagent.harness import Harness, AssumptionSurface
from codagent.integrations import HarnessRunnable

chain = HarnessRunnable(
    Harness.compose(AssumptionSurface(min_items=2)),
    ChatOpenAI(model="gpt-4o"),
)

LangGraph

from codagent.integrations import assumption_surface_node, verification_gate

graph.add_node("clarify", assumption_surface_node(my_llm, min_items=3))
graph.add_conditional_edges("execute", verification_gate, {"verified": "done", "missing": "retry"})

Pydantic AI

from pydantic_ai import Agent
from codagent.harness import Harness, CitationRequired
from codagent.integrations import pydantic_ai_prompt

h = Harness.compose(CitationRequired(min_citations=1))
agent = Agent(
    "openai:gpt-4o",
    system_prompt=pydantic_ai_prompt(h, base="You are a research assistant."),
)
result = agent.run_sync("Summarize the latest in transformer scaling")
print(h.validate(str(result.data)))

LlamaIndex

from llama_index.core import Settings
from codagent.harness import Harness, AssumptionSurface
from codagent.integrations import HarnessLlamaIndexCallback

Settings.callback_manager = CallbackManager([
    HarnessLlamaIndexCallback(Harness.compose(AssumptionSurface()))
])

Production hardening (any framework)

from codagent.nodes import with_retry, with_cache, with_loop_guard
from codagent.tools import circuit_breaker, rate_limit
from codagent.observability import BudgetCap, CostTracker, StepBudget

# Stack node wrappers
robust_node = with_retry(
    with_cache(my_llm_call, key_fn=lambda s: s["query"], ttl=300),
    attempts=3,
    on=(ConnectionError,),
)

# Decorate tools — and guard against agent thrashing
@circuit_breaker(failure_threshold=5, reset_after=60)
@rate_limit(per_second=10)
def _search_db(query: str): ...
search_db = with_loop_guard(_search_db, window=10, max_repeats=3)

# Track cost + bound steps + hard USD ceiling
budget = StepBudget(max_steps=20)
cost = CostTracker(model="gpt-4o")
cap = BudgetCap(tracker=cost, usd=2.0)  # raises BudgetExceeded if a run blows past $2
with cost:
    result = run_my_agent(...)
    cap.check()  # call at safe boundaries, or route LLM calls through cap.record_call()
print(f"${cost.total_usd:.4f} over {cost.calls} calls")

RAG faithfulness (catch hallucinations regex can't)

from codagent.harness import Harness, CitationRequired, FaithfulnessContract

faith = FaithfulnessContract(judge=my_llm_judge)  # any callable str -> str
harness = Harness.compose(CitationRequired(), faith)

# In your retrieve node:
docs = retriever.search(query, k=3)
faith.set_context([d.text for d in docs])

# In validation:
result = harness.validate(answer)
# CitationRequired: regex check that [source: ...] markers exist
# FaithfulnessContract: judge confirms each factual claim is grounded in context

Built-in behavior contracts

Contract	Forces
`AssumptionSurface`	leading `Assumptions:` block when request is ambiguous
`VerificationLoop`	evidence (test, output, diff) before any "done" claim
`ToolCallSurface`	explicit `ToolCall:` block before tool invocation
`RefusalPattern`	structured `Refusal:` block on sensitive keywords
`CitationRequired`	`[source: ...]` markers on factual claims
`FaithfulnessContract`	every claim grounded in retrieved RAG context (LLM-as-judge)
`MetaAgentContract`	LLM-as-judge validation by a supervisor agent

Compose them: Harness.compose(AssumptionSurface(), CitationRequired(), ...).

End-to-end example

examples/langgraph_full_stack.py combines node wrappers + tool decorators + observability + harness in one runnable script (offline — no API keys required).

Status

v0.5.0 alpha. Core is stable; framework adapters expand as the ecosystem evolves. v0.5.0 added three production guardrails (BudgetCap, with_loop_guard, FaithfulnessContract) that fill gaps LangGraph leaves to the developer — see CHANGELOG for the rationale and survey.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.cursor/rules		.cursor/rules
.github		.github
codagent		codagent
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

codagent

Why this exists

Install

What's in the library

Quick start

OpenAI / Anthropic raw client

LangChain

LangGraph

Pydantic AI

LlamaIndex

Production hardening (any framework)

RAG faithfulness (catch hallucinations regex can't)

Built-in behavior contracts

End-to-end example

Status

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors 2

Languages

Folders and files

Latest commit

History

Repository files navigation

codagent

Why this exists

Install

What's in the library

Quick start

OpenAI / Anthropic raw client

LangChain

LangGraph

Pydantic AI

LlamaIndex

Production hardening (any framework)

RAG faithfulness (catch hallucinations regex can't)

Built-in behavior contracts

End-to-end example

Status

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors 2

Languages

Packages