feat(agent-ready): add session-startup, DoD, and JSON-ledger guidance#38
Open
dgalarza wants to merge 1 commit into
Open
feat(agent-ready): add session-startup, DoD, and JSON-ledger guidance#38dgalarza wants to merge 1 commit into
dgalarza wants to merge 1 commit into
Conversation
Three additions to the AGENTS.md template drawn from Anthropic's harness design articles, addressing failure modes that the existing scaffold did not cover: - Session Startup: a bearing-getting ritual (pwd, git log, smoke test) agents run on a fresh context. Prevents the "look around, declare done" failure mode where an agent piles new work on top of an existing broken state. - Definition of Done: codifies end-to-end verification before marking work complete. Addresses the most common over-confidence failure -- agents shipping changes based on "unit tests pass" without exercising the feature end-to-end. - JSON over Markdown for ledgers: short note in Template Notes capturing Anthropic's finding that agents are far less likely to inappropriately edit JSON files than Markdown ones. Audit mode now checks for the presence of Session Startup and Definition of Done sections, flags DoD sections that omit end-to-end verification, and offers auto-fixes for both. agents-md mode populates both sections during generation using detected commands. Bumps agent-ready to 1.3.0 in plugin.json and marketplace.json.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds three patterns to the AGENTS.md template drawn from Anthropic's harness design articles (Effective harnesses for long-running agents, Harness design for long-running application development), addressing failure modes the existing scaffold did not cover:
pwd,git log, smoke test) agents run on every fresh context. Prevents the "look around, declare done" failure mode where an agent piles new work on top of an existing broken state.agents-mdmode populates Session Startup and DoD during generation using detected build/test commands.auditmode now checks for both sections, flags DoD sections that omit end-to-end verification, and offers matching auto-fixes.Bumps
agent-readyto 1.3.0 inplugin.jsonandmarketplace.json.Test plan
auditmode against a project missing Session Startup — confirm it flags as missing and offers fixauditmode against a project with DoD but no end-to-end mention — confirm "present but does not mention end-to-end verification" warning firesagents-mdmode on a fresh project — confirm generated AGENTS.md contains both new sections with detected commands filled in