Plan Forge

The AI-Native SDLC Forge Shop

Plan Forge is the orchestration harness that sits on top of GitHub Copilot (and other AI coding tools). It does not replace your model or your IDE — it adds the SDLC layer GitHub deliberately leaves to the ecosystem: planning, validation gates, memory, cost control, and reviewer separation.

It is also licensed MIT because your SDLC is yours, and your institutional memory lives in OpenBrain — a user-owned service — because your accumulated decisions should not be trapped inside any one AI vendor.

A blacksmith doesn't hand raw iron to a customer. They smelt it, hammer it, temper it — and then they watch, because a blade that isn't maintained will dull.

Plan Forge is a full-lifecycle AI development shop. Raw ideas are smelted through the Crucible into structured plans. Plans are forged into working code through a 7-step hardened pipeline. Shipped code is guarded by LiveGuard — drift, secrets, dependencies, incidents, all watched in real time. And every finding is learned back into the shop's memory, so the next run starts smarter than the last.

Smelt the idea. Forge the code. Guard the build. Learn from every run.

Website · Shop Tour · Manual · Documentation · FAQ · Extensions · Spec Kit Interop

105 MCP Tools · 48+ CLI Commands · 103+ REST Endpoints · ~21 Agents · 4 Reviewer Skills · 9 Presets · 7 Adapters · 7947 Tests · v3.20.0-dev

The Four Stations

Plan Forge is one shop with four stations. Each one handles a distinct part of the software lifecycle — and they all share one memory.

Station	Verb	What happens here	Start with
🪨 Smelt	Smelt the idea	Rough idea → Crucible interview → tempered plan with scope contract and validation gates.	Crucible · Tempering design notes
🔨 Forge	Forge the code	Hardened plan → DAG-scheduled execution → quorum consensus → shipped code. The classic 7-step pipeline.	Your First Plan · AI Plan Hardening Runbook
🛡️ Guard	Guard the build	Shipped code → drift scoring, secret scan, dep watch, regression guard, incident capture, remote alerts.	What is LiveGuard? · LiveGuard Tools
🧠 Learn	Learn from every run	Findings → Hallmark provenance-stamped OpenBrain memory → capability-negotiated OpenBrain writes → Health DNA → self-tuned escalation, cost, and quorum thresholds. Anvil deduplicates high-frequency writes; Lattice indexes code structure for blast-radius scoring.	Memory Architecture · Bug Registry + Testbed

And the control room that ties them together: the live dashboard at localhost:3100/dashboard with 25 tabs, session replay, WebSocket event hub, cost reports, OTLP traces, and a remote bridge for Telegram / Slack / Discord / OpenClaw.

Start Here

You are...	Start with
Evaluating Plan Forge	Read the Shop Tour → Skim capabilities
A developer using VS Code + Copilot	Run Quick Start → Read COPILOT-VSCODE-GUIDE.md
An AI agent setting up a project	Read AGENT-SETUP.md (your entry point)
Already shipping and want the watch layer	Jump to LiveGuard — runs standalone on any codebase
Just browsing	Keep reading — or visit planforge.software

Verified Results

Verified: 38+ phases self-built, 7947 self-tests, 105 MCP tools, zero manual rollbacks. See docs/capabilities.md.

A/B Test Results (April 2026)

Same app, same model (Claude Opus 4.6), same time (~7 min). Only difference: Run A had Plan Forge.

Metric	Plan Forge	Vibe Coding
Tests	60	13
Interfaces	6	0
DTOs	9	0
Quality Score	99/100	44/100

Read the full results →

How the Stations Work Together

The four stations form a closed loop: Smelt intake → Forge builds → Guard watches → Learn captures findings into memory → next Smelt starts smarter. Inside the Forge, a deterministic slice executor runs the plan; a reflective inner loop — retries with reflexion context, trajectory capture, auto-skill promotion, and postmortems — turns every slice into a research step that teaches the next one. Together, the ten opt-in inner-loop subsystems compose into a self-deterministic agent loop that keeps execution reproducible while the loop's context improves each pass.

For architecture diagrams, the 7-step pipeline mermaid, the LiveGuard cycle, and the self-improving feedback loops, see the Shop Tour and the interactive manual.

The Virtual Engineering Team

Plan Forge is not "AI plus a code-completion plugin." It is a full enterprise engineering shop where every traditional role is filled by a specialized agent or guardrail, governed by 40 years of software engineering practice encoded into 17+ auto-loading instruction files and 20 specialized reviewers.

Traditional engineering team role	Plan Forge equivalent
Product Owner (spec, direction, acceptance)	You — non-negotiable, non-replaceable
IC engineers (architecture, security, performance, DB, deploy, API, accessibility, multi-tenancy, CI/CD, observability, dependency, compliance)	20 specialized agents + 17 auto-loading guardrail files
Tech lead / staff engineer	Quorum mode (multi-model consensus) + auto-escalation on slice failure
Engineering manager	Forge-Master Observer + Auditor — agents supervising agents, not metaphorically, literally
QA team	Tempering harness + testbed + regression guard + slice gates
SRE / on-call	LiveGuard (secret scan, drift, dep watch, env diff, incident capture, runbooks)
Continuous improvement / retros	Audit loop + bug registry auto-smelt + Reflexion lessons + auto-skill promotion
Architecture review board	Independent Session-3 review gate (fresh AI session, full guardrail load)
Institutional knowledge / wiki	OpenBrain L3 memory with Hallmark provenance
Release manager	Shipper agent + release-checklist + version.instructions.md

Your three jobs as the human:

Make a few technology choices once — pick a preset (dotnet, typescript, python, etc.), declare Project Principles, set forbidden patterns. One-time, then locked.
Own the spec and the direction — Product Owner duties. Crucible interviews you; you decide what to build and why. The shop won't deviate from the Scope Contract you sign off on.
Run manual acceptance testing — agents don't read minds. They implement what the spec says, not what you intended. A green pipeline (all tests pass, no drift, no secrets, no regressions, Forge-Master independently audited) proves the code is correct. Only you can decide whether it's what you actually wanted. That's Product Owner UAT, same as it has been for 40 years.

Three closed loops, not one. The Forge builds. Forge-Master supervises. The Learn station feeds every finding back into the next plan. The audit loop ran for two weeks unattended on a real production Next.js site and surfaced 30+ defects the maintainer didn't know existed — see Day in the Forge. Skeptical CTOs: this is the part that distinguishes Plan Forge from "an AI that writes code."

Quick Start

Prerequisites

VS Code with GitHub Copilot (free, Pro, or Enterprise)
Git installed

1. Clone and Run Setup

git clone https://github.com/srnichols/plan-forge.git my-project-plans
cd my-project-plans

# Windows (PowerShell)
.\setup.ps1 -Preset dotnet          # or: typescript, python, java, go, swift, rust, php, azure-iac

# Mac / Linux
./setup.sh --preset dotnet

Setup copies all framework files, installs MCP dependencies, and generates config. Zero manual steps.

2. Start Planning

Open VS Code → Copilot Chat → Agent Mode
Describe your feature → the pipeline guides you through 7 steps
LiveGuard watches automatically after you ship

3. (Recommended) Enable Persistent Memory

Plan Forge ships with L1 (Hub) + L2 (.forge/*.jsonl) memory built in. The L3 layer — cross-session, cross-tool, semantic-search memory that powers Reflexion lessons, Auto-skills, cross-project Federation, and 28 auto-capturing MCP tools — requires OpenBrain, a self-hosted MCP server (PostgreSQL + pgvector). Plan Forge works without it (every hook degrades silently), but the inner loop only improves over time when L3 is present.

Pick the path that fits:

Path	Time	Cost	Best for
Docker Compose	~5 min	Free	Local dev, single machine
Supabase Cloud	~10 min	~$0.10–$0.30 / mo	Solo / small team, zero ops
Kubernetes / Azure Container Apps	~30 min	Cloud rates	Teams, federation across repos
Skip for now	0 min	—	Try Plan Forge first; enable later with `pforge brain hint`

Full walkthrough: srnichols.github.io/OpenBrain. Already running OpenBrain? pforge brain status confirms Plan Forge sees it.

See docs/CLI-GUIDE.md for all presets, flags, and multi-agent options.

What's Included

9 Tech-Stack Presets

Preset	Stack	Preset	Stack
`dotnet`	.NET / C# / ASP.NET Core	`swift`	Swift / SwiftUI / Vapor
`typescript`	TypeScript / React / Node	`rust`	Rust / Axum / Tokio
`python`	Python / FastAPI / Django	`php`	PHP / Laravel / Symfony
`java`	Java / Spring Boot	`azure-iac`	Bicep / Terraform / azd
`go`	Go / Chi / Gin

7 AI Agent Adapters

One setup command, every tool: setup.ps1 -Agent all

GitHub Copilot (primary) · Claude Code · Cursor · Codex CLI · Gemini CLI · Windsurf · Generic

MCP Server (105 Tools)

pforge-mcp/server.mjs exposes core, LiveGuard, Watcher, Crucible, Tempering, Bug Registry, Testbed, Forge-Master, Hallmark, Anvil, Lattice, Sync, and Memory operations. Live dashboard at localhost:3100/dashboard. 103+ REST endpoints for external integrations.

Key tools: forge_run_plan · forge_liveguard_run · forge_analyze · forge_master_ask · forge_capabilities · forge_smith · forge_cost_report · forge_lattice_query · forge_sync_memories · forge_memory_capture

Optional Capabilities

Feature	How to Enable	What It Does
Quorum mode	Automatic (complexity ≥ 6)	3 models analyze in parallel, reviewer synthesizes. Works OAuth-only via the Copilot CLI — no API keys required. Add `XAI_API_KEY` to mix in a Grok leg. Self-tuning threshold.
Audit Loop	`pforge audit-loop` or `.forge.json#audit`	Closed-loop drain: content-audit scanner → triage → fix. Default off; opt-in via `audit.mode: "auto"` or `"always"`.
Auto-escalation	Built-in	Model fails → auto-promotes. Chain reorders by success rate.
Cost tracking	Built-in	Per-slice tokens, 23-model pricing, `--estimate` with historical calibration.
OpenBrain memory (L3)	Recommended — see Quick Start § 3	The L3 memory layer. 28 tools auto-capture findings with Hallmark provenance stamps via capability-negotiated OpenBrain writes. Anvil deduplicates Δ-identical writes. 4 prompts search before acting. Without it, Reflexion, Auto-skills, and Federation are inert.
Extensions	`pforge ext add <name>`	HIPAA, SaaS multi-tenancy, etc.
CI validation	`srnichols/plan-forge-validate@v1`	GitHub Action for plan quality gates.
Notifications	Configure in `.forge.json`	Slack, Discord, Telegram, webhooks via bridge.
Spec Kit bridge	Auto-detected	Import specs + constitution from Spec Kit projects.

Documentation

Resource	Purpose
docs/COPILOT-VSCODE-GUIDE.md	VS Code + Copilot walkthrough
docs/CLI-GUIDE.md	`pforge` CLI reference
docs/REST-API.md	All 103 REST endpoints, organized by domain
docs/capabilities.md	Full feature reference — all 105 MCP tools, ~21 agents, 4 reviewer skills, 103+ REST endpoints
CUSTOMIZATION.md	Adapt guardrails for your project
planforge.software/manual/	Interactive web manual (30 chapters + 14 appendices)
planforge.software/faq.html	FAQ
AGENT-SETUP.md	AI agent entry point
CONTRIBUTING.md · docs/RELEASE-CHECKLIST.md	Maintainer guide · release & distribution-sync checklist

Git Workflow

git commit -m "<type>(<scope>): <description>"   # feat, fix, refactor, test, docs, chore

See .github/instructions/git-workflow.instructions.md for conventions.

Contributing

See CONTRIBUTING.md. For extensions: extensions/PUBLISHING.md. For skills: docs/SKILL-BLUEPRINT.md.

License

MIT — use these guardrails in your projects, teams, and tools.

Name		Name	Last commit message	Last commit date
Latest commit History 1,863 Commits
.forge		.forge
.github		.github
docs		docs
extensions		extensions
pforge-master		pforge-master
pforge-mcp		pforge-mcp
pforge-sdk		pforge-sdk
presets		presets
scripts		scripts
templates		templates
.check-manual-links-baseline.json		.check-manual-links-baseline.json
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
2		2
AGENT-SETUP.md		AGENT-SETUP.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
CUSTOMIZATION.md		CUSTOMIZATION.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
VERSION		VERSION
action.yml		action.yml
llms.txt		llms.txt
package-lock.json		package-lock.json
package.json		package.json
pforge		pforge
pforge.ps1		pforge.ps1
pforge.sh		pforge.sh
plugin.json		plugin.json
server.mjs		server.mjs
setup.ps1		setup.ps1
setup.sh		setup.sh
tailwind.config.cjs		tailwind.config.cjs
validate-setup.ps1		validate-setup.ps1
validate-setup.sh		validate-setup.sh
vitest.config.mjs		vitest.config.mjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plan Forge

The AI-Native SDLC Forge Shop

The Four Stations

Start Here

Verified Results

A/B Test Results (April 2026)

How the Stations Work Together

The Virtual Engineering Team

Quick Start

Prerequisites

1. Clone and Run Setup

2. Start Planning

3. (Recommended) Enable Persistent Memory

What's Included

9 Tech-Stack Presets

7 AI Agent Adapters

MCP Server (105 Tools)

Optional Capabilities

Documentation

Git Workflow

Contributing

License

About

Uh oh!

Releases 93

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Plan Forge

The AI-Native SDLC Forge Shop

The Four Stations

Start Here

Verified Results

A/B Test Results (April 2026)

How the Stations Work Together

The Virtual Engineering Team

Quick Start

Prerequisites

1. Clone and Run Setup

2. Start Planning

3. (Recommended) Enable Persistent Memory

What's Included

9 Tech-Stack Presets

7 AI Agent Adapters

MCP Server (105 Tools)

Optional Capabilities

Documentation

Git Workflow

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 93

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages