Methodology Overview
Why This Matters
Section titled “Why This Matters”Every engineering team accumulates invisible debt — not in code, but in decisions. You pick a database, choose an auth strategy, design an API surface. Six months later, someone asks “why did we do it this way?” and nobody remembers. The Slack thread is buried. The Google Doc is stale. The person who made the call left the company.
This is not a documentation problem. It is a knowledge loss problem. And it compounds: teams repeat the same mistakes, relitigate the same debates, and build on assumptions nobody can verify.
Forgeplan exists to make decisions traceable, verifiable, and durable.
The Problem
Section titled “The Problem”Decisions are scattered across Slack threads, Google Docs, and someone’s memory. PRDs nobody reads. RFCs without follow-up. Architecture choices made on gut feeling.
Forgeplan makes you think before coding. Instead of “open IDE -> write code -> deploy”, you get:
Route -> Shape -> Validate -> Reason -> Build -> Prove -> ActivateConsider a real scenario: your team decides to use JWT for authentication. Six months later, a security audit asks why you chose JWT over session-based auth. With Forgeplan, the ADR explains the reasoning, the Evidence shows the benchmark results, and the PRD traces back to the original requirements. Without Forgeplan, you are digging through Slack history and hoping someone remembers.
What Forgeplan Is NOT
Section titled “What Forgeplan Is NOT”- Not Jira. Not a project management tool.
- Not a task tracker. Not for daily standups.
- Not a code generator. AI assists, you decide.
Forgeplan is a structured knowledge base for engineering decisions — local-first, git-native, single binary. Think of it as a lab notebook for engineering: you record what you tried, why, and what the results were.
Core Principles
Section titled “Core Principles”1. Pipeline is a guideline, not bureaucracy
Section titled “1. Pipeline is a guideline, not bureaucracy”Don’t create all 10 artifact types for every task. Tactical depth = just code. Deep depth = full pipeline. If the answer is obvious and reversible, skip the paperwork. A one-line bug fix does not need a PRD.
2. Every requirement: “[Actor] can [capability]”
Section titled “2. Every requirement: “[Actor] can [capability]””No implementation leakage in PRDs. Describe WHAT, not HOW. Write “User can persist and query structured data with ACID guarantees” — not “Use PostgreSQL for storage.” The moment you name a technology in a requirement, you have closed off alternatives before evaluating them.
3. Child references parent
Section titled “3. Child references parent”PRD -> Epic, RFC -> PRD, ADR -> RFC. Always traceable upward. This means if you read any artifact, you can follow the links up to understand the full context of why it exists.
4. Supersede, don’t delete
Section titled “4. Supersede, don’t delete”Old artifacts get status: superseded. History is preserved. When your team asks “didn’t we try this approach before?”, the superseded artifact shows what was tried, why it was abandoned, and what replaced it.
5. R_eff = min(evidence)
Section titled “5. R_eff = min(evidence)”Trust is the weakest link. Not average — minimum. One blind spot drags everything down. If you have three solid benchmarks and one completely untested assumption about production load, your decision reliability equals that untested assumption.
The Full Cycle
Section titled “The Full Cycle”1. Route: forgeplan route "your task" -> determines depth2. Shape: forgeplan new prd "Title" -> create artifact3. Validate: forgeplan validate PRD-XXX -> quality gates4. Reason: forgeplan reason PRD-XXX -> ADI hypotheses (Standard+)5. Build: write code, test every pub fn6. Prove: forgeplan new evidence -> link -> score7. Activate: forgeplan activate PRD-XXXWork isn’t done until: PRD filled + validated + evidence created + R_eff > 0 + activated.
Common Mistakes
Section titled “Common Mistakes”- Creating artifacts for everything. A button color change does not need a PRD. Use
forgeplan routeto determine the right level of rigor — most tasks are Tactical and need zero documentation. - Leaving PRD stubs. Creating an empty PRD and moving on to code means you now have a stub that clutters your project health. Either fill it immediately or do not create it.
- Activating without evidence. An active PRD with no code, no tests, and R_eff = 0 is a false promise. It tells anyone reading your project that this decision is validated when it is not.
- Skipping ADI on Deep decisions. If the router says Deep and you jump straight to code, you are betting on your first idea without checking alternatives. ADI takes 10 minutes; a wrong architecture takes weeks to undo.
- Treating the pipeline as a checklist. The pipeline is a thinking tool. Going through the motions mechanically (create PRD, check; create RFC, check) without actually reasoning through each step defeats the purpose.
10 Artifact Types
Section titled “10 Artifact Types”| Type | Question it answers | When to use |
|---|---|---|
| PRD | What & why? | Feature planning |
| RFC | How to build? | Architecture proposal |
| ADR | Why this way? | Decision record |
| Epic | What’s the big picture? | Multi-PRD initiative |
| Spec | What’s the contract? | API/data models |
| Problem | What’s the signal? | Bug, risk, observation |
| Evidence | Does it work? | Test results, benchmarks |
| Solution | Which option? | 2-3 variant comparison |
| Note | Quick decision | Micro-decision (90-day TTL) |
| Refresh | Still valid? | Stale artifact re-evaluation |
In practice, most teams use 5-6 types regularly: PRD, RFC, ADR, Evidence, Problem, and Epic. The rest (Note, Solution, Spec, Refresh) are situational. Do not feel obligated to use all 10.
When NOT to Create an Artifact
Section titled “When NOT to Create an Artifact”Not every task deserves structure. Here is a quick guide:
- Bug fix in one file — just fix it. Maybe a Note if the root cause was surprising.
- Obvious refactoring — no behavior change, no artifact needed.
- Internal tooling tweak — if only you use it and it is easily reversible, skip.
- Anything where the answer is obvious — do not document what does not need explaining.
The litmus test: “Will someone (including future me) ever ask why this decision was made?” If yes, create an artifact. If no, just ship.
The 10 Rules (at a glance)
Section titled “The 10 Rules (at a glance)”The rest of the methodology expands on ten rules. Each has its own doc — this list is the index.
- Route before you code — Routing & Depth
- Shape → Validate → ADI → Code → Evidence → Activate — full cycle, not a checklist
[Actor] can [capability]— no implementation leakage in PRD FRs- Child references parent — PRD→Epic, RFC→PRD, ADR→RFC
- Supersede, don’t delete — Lifecycle
- Quality gates by depth — Tactical = nothing, Standard = Verification Gate, Deep+ = Adversarial Review
- Session start:
health+ memory recall — fix orphans and blind spots first - Work isn’t done until R_eff > 0 + activated — Evidence & R_eff
- ADI is mandatory on Deep+ — ADI Reasoning
- Stale = re-validate, don’t ignore — set
valid_untilon every ADR
Tooling
Section titled “Tooling”In AI coding agents this entire methodology is packaged as the /forge skill. Install it with forgeplan setup-skill (offline, built into the binary) or npx skills add ForgePlan/marketplace --skill forge. The skill wraps route -> new -> validate -> reason -> activate into a single conversational command.
Additional plugins provide code auditing (/audit), sprint planning (/sprint), FPF reasoning (/fpf), and more. Full plugin catalog: Marketplace Overview.
Next Steps
Section titled “Next Steps”- Quick Start — first artifact in 10 minutes
- First Artifact Tutorial — hands-on 20-minute walkthrough
- Routing & Depth — how to choose the right level
- Artifact Lifecycle — draft → active → terminal
- Evidence & R_eff Scoring — how trust is measured
- ADI Reasoning — structured thinking before building
- CLI Reference — every command documented
- FPF Framework Guide — the reasoning backbone