docs: Add pluggable molecules design documentation

- Add Pluggable Molecules section to architecture.md explaining directory-based plugins for extensible workflows - Create molecules.md with full design for code-review molecule - Document plugin CLAUDE.md format with frontmatter - Describe phased execution (discovery/structural/tactical/synthesis) - Include implementation plan and usage examples 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-20 03:09:20 -08:00
parent 17123eabd9
commit c84d832c98
3 changed files with 723 additions and 69 deletions
@@ -564,6 +564,166 @@ Needs: kill-core
 - Batch processing (swarm coordination)
 - Recovery procedures (doctor --fix)

+### Pluggable Molecules
+
+Some workflows benefit from **pluggable steps** - dimensions that can be added or removed by creating/deleting directories. The canonical example is **code review**, where each review dimension (security, performance, test coverage) is a plugin.
+
+#### Philosophy: Plugins Are Directories
+
+In Gas Town, a plugin is just a directory that agents notice. If the directory exists, the plugin exists. No registration, no manifests, no YAML configs.
+
+```
+~/gt/molecules/code-review/
+├── discovery/
+│   ├── file-census/
+│   │   └── CLAUDE.md          # Instructions for this step
+│   ├── dep-graph/
+│   │   └── CLAUDE.md
+│   └── duplication-scan/
+│       └── CLAUDE.md
+├── structural/
+│   ├── architecture-review/
+│   │   └── CLAUDE.md
+│   └── missing-abstractions/
+│       └── CLAUDE.md
+└── tactical/
+    ├── security-scan/
+    │   └── CLAUDE.md
+    ├── performance-review/
+    │   └── CLAUDE.md
+    └── test-gaps/
+        └── CLAUDE.md
+```
+
+**Add a dimension**: Create a directory with CLAUDE.md
+**Remove a dimension**: Delete the directory
+**Customize a dimension**: Edit its CLAUDE.md
+
+#### Plugin Metadata
+
+Each plugin can include frontmatter in its CLAUDE.md:
+
+```markdown
+---
+phase: tactical
+needs: [structural-complete]
+tier: sonnet
+---
+
+# Security Scan
+
+Look for OWASP Top 10 vulnerabilities...
+```
+
+| Field | Description |
+|-------|-------------|
+| `phase` | Grouping for ordering (discovery, structural, tactical, synthesis) |
+| `needs` | Dependencies - other steps that must complete first |
+| `tier` | Model hint (haiku, sonnet, opus) |
+
+#### Dynamic Molecule Assembly
+
+When instantiating a pluggable molecule, the system:
+
+1. **Scans** plugin directories under the molecule root
+2. **Parses** metadata from each plugin's CLAUDE.md
+3. **Assembles** the DAG based on phase groupings and dependencies
+4. **Creates** beads for each discovered step
+
+```bash
+gt molecule instantiate code-review --parent=gt-xyz --scope=src/auth
+```
+
+Creates beads like:
+```
+gt-xyz.discovery-file-census
+gt-xyz.discovery-dep-graph
+gt-xyz.structural-architecture-review    # blocks tactical
+gt-xyz.tactical-src-auth-security
+gt-xyz.tactical-src-auth-performance
+gt-xyz.synthesis
+```
+
+The **directory structure IS the molecule**. No separate molecule definition needed.
+
+#### Phases and Ordering
+
+Pluggable molecules support phased execution:
+
+| Phase | Purpose | Parallelism |
+|-------|---------|-------------|
+| `discovery` | Inventory, analyze codebase | Fully parallel |
+| `structural` | Big picture issues (architecture, abstractions) | Sequential for coherence |
+| `tactical` | Per-component detailed review | Parallel per component |
+| `synthesis` | Aggregate, dedupe, prioritize | Single coordinator |
+
+Steps in earlier phases block steps in later phases. Within a phase, parallelism depends on explicit `needs` declarations.
+
+#### Code Review Molecule
+
+The **code-review** molecule is the reference implementation:
+
+**Discovery Phase** (parallel scouts):
+- `file-census` - Inventory: sizes, ages, churn rates
+- `dep-graph` - Dependencies, cycles, inversions
+- `coverage-map` - Test coverage, dead code
+- `duplication-scan` - Near-duplicate files, copy-paste debt
+
+**Structural Phase** (sequential):
+- `architecture-review` - Does structure match domain?
+- `abstraction-analysis` - Wrangling at wrong layers?
+- `consolidation-planner` - What should be unified?
+
+**Tactical Phase** (parallel per hotspot):
+- `security-scan` - OWASP Top 10, injection, auth bypass
+- `performance-review` - N+1 queries, missing caching
+- `complexity-analysis` - Cyclomatic > 10, deep nesting
+- `test-gaps` - Untested branches, missing edge cases
+- `elegance-review` - Magic numbers, unclear names
+
+**Synthesis Phase** (single coordinator):
+- Deduplicate findings
+- Establish dependencies between fix-beads
+- Prioritize by impact
+- Sequence recommendations
+
+#### Findings Become Beads
+
+Each review step generates findings as beads:
+
+```
+gt-sec-001  SQL injection in login()     discovered-from: gt-xyz.tactical-security
+gt-sec-002  Missing CSRF token           discovered-from: gt-xyz.tactical-security
+gt-perf-001 N+1 query in dashboard       discovered-from: gt-xyz.tactical-performance
+```
+
+These are the work that "feeds the beast" - the review molecule generates fix beads.
+
+#### Iteration Without Built-In Loops
+
+You don't need convergence built into the molecule. Just run it again:
+
+1. Run `gt molecule instantiate code-review`
+2. Swarm closes all review beads, generates fix beads
+3. Fix beads get closed
+4. Run `gt molecule instantiate code-review` again
+5. Fewer findings this time
+6. Repeat until noise floor
+
+Each instantiation is independent. The ledger shows all runs, enabling comparison.
+
+#### Static vs Pluggable
+
+| Aspect | Static Molecule | Pluggable Molecule |
+|--------|-----------------|-------------------|
+| Definition | Steps in issue description | Steps from directory scan |
+| Add step | Edit molecule bead | Create directory |
+| Remove step | Edit molecule bead | Delete directory |
+| Customization | Edit description | Edit plugin CLAUDE.md |
+| Use case | Fixed workflows | Extensible workflows |
+
+Both patterns are valid. Use static molecules for well-defined workflows (engineer-in-box, polecat-work). Use pluggable molecules when dimensions should be customizable (code-review, migration-analysis).
+
 ## Directory Structure

 ### Harness Level
@@ -0,0 +1,486 @@
+# Molecules: Composable Workflow Templates
+
+This document covers the molecule system in depth, including pluggable molecules
+and the code-review molecule design.
+
+For an overview, see [architecture.md](architecture.md#molecules-composable-workflow-templates).
+
+## Core Concepts
+
+A **molecule** is a crystallized workflow pattern stored as a beads issue. When
+instantiated on a parent issue, it creates child beads forming a DAG of steps.
+
+| Concept | Description |
+|---------|-------------|
+| Molecule | Read-only workflow template (type=molecule in beads) |
+| Step | Individual unit of work within a molecule |
+| Bond | Dependency between steps (Needs: directive) |
+| Instance | Concrete beads created when molecule is instantiated |
+
+## Two Molecule Types
+
+### Static Molecules
+
+Steps are embedded in the molecule's description field:
+
+```markdown
+## Step: design
+Think carefully about architecture...
+
+## Step: implement
+Write the code...
+Needs: design
+
+## Step: test
+Run tests...
+Needs: implement
+```
+
+**Use case**: Well-defined, fixed workflows (engineer-in-box, polecat-work).
+
+**Commands**:
+```bash
+bd create --type=molecule --title="My Workflow" --description="..."
+gt molecule instantiate mol-xyz --parent=issue-123
+```
+
+### Pluggable Molecules
+
+Steps are discovered from directories. Each directory is a plugin:
+
+```
+~/gt/molecules/code-review/
+├── discovery/
+│   ├── file-census/CLAUDE.md
+│   └── dep-graph/CLAUDE.md
+├── structural/
+│   └── architecture-review/CLAUDE.md
+└── tactical/
+    ├── security-scan/CLAUDE.md
+    └── performance-review/CLAUDE.md
+```
+
+**Use case**: Extensible workflows where dimensions can be added/removed.
+
+**Commands**:
+```bash
+gt molecule instantiate code-review --parent=issue-123 --scope=src/
+```
+
+## Plugin CLAUDE.md Format
+
+Each plugin directory contains a CLAUDE.md with optional frontmatter:
+
+```markdown
+---
+phase: tactical
+needs: [structural-complete]
+tier: sonnet
+---
+
+# Security Scan
+
+## Objective
+
+Identify security vulnerabilities in the target code.
+
+## Focus Areas
+
+- OWASP Top 10 vulnerabilities
+- Injection attacks (SQL, command, LDAP)
+- Authentication/authorization bypasses
+- Hardcoded secrets
+- Insecure deserialization
+
+## Output
+
+For each finding, create a bead with:
+- Clear title describing the vulnerability
+- File path and line numbers
+- Severity (P0-P4)
+- Suggested remediation
+
+Tag findings with `label: security`.
+```
+
+### Frontmatter Fields
+
+| Field | Type | Description |
+|-------|------|-------------|
+| `phase` | string | Execution phase: discovery, structural, tactical, synthesis |
+| `needs` | list | Step references that must complete first |
+| `tier` | string | Model hint: haiku, sonnet, opus |
+
+### Phase Semantics
+
+| Phase | Blocks | Parallelism | Purpose |
+|-------|--------|-------------|---------|
+| discovery | nothing | full | Inventory, gather data |
+| structural | discovery | sequential | Big picture analysis |
+| tactical | structural | per-component | Detailed review |
+| synthesis | tactical | single | Aggregate results |
+
+## Code Review Molecule
+
+The code-review molecule is the reference implementation for pluggable molecules.
+
+### Directory Structure
+
+```
+~/gt/molecules/code-review/
+├── README.md                    # Molecule overview
+│
+├── discovery/                   # Phase 1: Parallel scouts
+│   ├── file-census/
+│   │   └── CLAUDE.md           # Inventory: sizes, ages, churn
+│   ├── dep-graph/
+│   │   └── CLAUDE.md           # Dependencies, cycles, inversions
+│   ├── coverage-map/
+│   │   └── CLAUDE.md           # Test coverage, dead code
+│   └── duplication-scan/
+│       └── CLAUDE.md           # Near-duplicates, copy-paste
+│
+├── structural/                  # Phase 2: Sequential for coherence
+│   ├── architecture-review/
+│   │   └── CLAUDE.md           # Structure vs domain alignment
+│   ├── abstraction-analysis/
+│   │   └── CLAUDE.md           # Wrong-layer wrangling
+│   └── consolidation-planner/
+│       └── CLAUDE.md           # What should be unified
+│
+├── tactical/                    # Phase 3: Parallel per hotspot
+│   ├── security-scan/
+│   │   └── CLAUDE.md           # OWASP, injection, auth
+│   ├── performance-review/
+│   │   └── CLAUDE.md           # N+1, caching, memory
+│   ├── complexity-analysis/
+│   │   └── CLAUDE.md           # Cyclomatic, nesting
+│   ├── test-gaps/
+│   │   └── CLAUDE.md           # Untested paths, edge cases
+│   └── elegance-review/
+│       └── CLAUDE.md           # Magic numbers, naming
+│
+└── synthesis/                   # Phase 4: Single coordinator
+    └── aggregate/
+        └── CLAUDE.md           # Dedupe, prioritize, sequence
+```
+
+### Discovery Phase Plugins
+
+#### file-census
+
+**Purpose**: Build inventory of what we're reviewing.
+
+**Output**:
+- Total files, lines, and size
+- Files by age (old = potential legacy)
+- Files by churn (high churn = hotspots)
+- Largest files (candidates for splitting)
+
+#### dep-graph
+
+**Purpose**: Map dependencies and structure.
+
+**Output**:
+- Dependency graph (imports, requires)
+- Circular dependencies
+- Orphaned code (unreachable)
+- Inverted dependencies (high-level depending on low-level)
+
+#### coverage-map
+
+**Purpose**: Understand test coverage.
+
+**Output**:
+- Overall coverage percentage
+- Untested files/functions
+- Coverage by component
+- Dead code (never executed)
+
+#### duplication-scan
+
+**Purpose**: Find duplicated logic.
+
+**Output**:
+- Near-duplicate files
+- Copy-paste code blocks
+- Redundant implementations of same concept
+
+### Structural Phase Plugins
+
+#### architecture-review
+
+**Purpose**: Assess high-level structure.
+
+**Questions**:
+- Does directory structure match domain concepts?
+- Are boundaries clean between components?
+- Is there a clear layering strategy?
+- Are cross-cutting concerns (logging, auth) handled consistently?
+
+**Output**: Structural findings as beads, with refactoring recommendations.
+
+#### abstraction-analysis
+
+**Purpose**: Find missing or wrong abstractions.
+
+**Signs of problems**:
+- Same boilerplate repeated
+- Business logic mixed with infrastructure
+- Leaky abstractions (implementation details exposed)
+- Primitive obsession (should be domain types)
+
+**Output**: Abstraction issues as beads.
+
+#### consolidation-planner
+
+**Purpose**: Identify what should be unified.
+
+**Looks for**:
+- Multiple implementations of same concept
+- Similar code in different places
+- Parallel hierarchies
+- Scattered handling of same concern
+
+**Output**: Consolidation recommendations as beads.
+
+### Tactical Phase Plugins
+
+These run in parallel, each agent reviewing assigned files/components.
+
+#### security-scan
+
+**Focus**:
+- OWASP Top 10
+- Injection vulnerabilities
+- Authentication/authorization issues
+- Secrets in code
+- Insecure configurations
+
+#### performance-review
+
+**Focus**:
+- N+1 queries
+- Missing caching opportunities
+- Memory leaks
+- Unnecessary computation
+- Blocking operations in hot paths
+
+#### complexity-analysis
+
+**Focus**:
+- Cyclomatic complexity > 10
+- Deep nesting (> 4 levels)
+- Long functions (> 50 lines)
+- God classes/files
+- Complex conditionals
+
+#### test-gaps
+
+**Focus**:
+- Untested public APIs
+- Missing edge cases
+- No error path testing
+- Brittle tests (mock-heavy, order-dependent)
+
+#### elegance-review
+
+**Focus**:
+- Magic numbers/strings
+- Unclear naming
+- Inconsistent style
+- Missing documentation for complex logic
+- Overly clever code
+
+### Synthesis Phase
+
+#### aggregate
+
+**Purpose**: Combine all findings into actionable backlog.
+
+**Tasks**:
+1. Deduplicate similar findings
+2. Group related issues
+3. Establish fix dependencies (fix X before Y)
+4. Prioritize by impact
+5. Sequence for efficient fixing
+
+**Output**: Prioritized backlog ready for swarming.
+
+## Implementation Plan
+
+### Phase 1: Pluggable Molecule Infrastructure
+
+1. **Directory scanner** (`internal/molecule/scanner.go`)
+   - Scan molecule directories for plugins
+   - Parse CLAUDE.md frontmatter
+   - Build plugin registry
+
+2. **DAG builder** (`internal/molecule/dag.go`)
+   - Assemble dependency graph from plugins
+   - Respect phase ordering
+   - Validate no cycles
+
+3. **Instantiation** (`internal/molecule/instantiate.go`)
+   - Create beads for each step
+   - Wire dependencies
+   - Support scope parameter
+
+### Phase 2: Code Review Molecule
+
+1. **Plugin directory structure**
+   - Create ~/gt/molecules/code-review/
+   - Write CLAUDE.md for each dimension
+
+2. **Discovery plugins** (4)
+   - file-census, dep-graph, coverage-map, duplication-scan
+
+3. **Structural plugins** (3)
+   - architecture-review, abstraction-analysis, consolidation-planner
+
+4. **Tactical plugins** (5)
+   - security-scan, performance-review, complexity-analysis, test-gaps, elegance-review
+
+5. **Synthesis plugin** (1)
+   - aggregate
+
+### Phase 3: CLI Integration
+
+1. **gt molecule scan** - Show discovered plugins
+2. **gt molecule validate** - Validate plugin structure
+3. **gt molecule instantiate** - Create beads from plugins
+4. **gt review** - Convenience wrapper for code-review molecule
+
+## Usage Examples
+
+### Basic Code Review
+
+```bash
+# Run full code review on project
+gt molecule instantiate code-review --parent=gt-review-001
+
+# Check what's ready to work
+bd ready
+
+# Swarm it
+gt swarm --parent=gt-review-001
+```
+
+### Scoped Review
+
+```bash
+# Review only src/auth/
+gt molecule instantiate code-review --parent=gt-review-002 --scope=src/auth
+
+# Review only tactical dimensions
+gt molecule instantiate code-review --parent=gt-review-003 --phases=tactical
+```
+
+### Adding a Custom Dimension
+
+```bash
+# Create plugin directory
+mkdir -p ~/gt/molecules/code-review/tactical/accessibility-review
+
+# Add CLAUDE.md
+cat > ~/gt/molecules/code-review/tactical/accessibility-review/CLAUDE.md << 'EOF'
+---
+phase: tactical
+needs: [structural-complete]
+tier: sonnet
+---
+
+# Accessibility Review
+
+Check for WCAG 2.1 compliance issues...
+EOF
+
+# Now it's automatically included in code-review
+gt molecule scan code-review
+```
+
+### Iteration
+
+```bash
+# First review pass
+gt molecule instantiate code-review --parent=gt-review-001
+# ... fix issues ...
+
+# Second pass (fewer findings expected)
+gt molecule instantiate code-review --parent=gt-review-002
+# ... fix remaining issues ...
+
+# Third pass (should be at noise floor)
+gt molecule instantiate code-review --parent=gt-review-003
+```
+
+## Beads Generated by Reviews
+
+Each review step generates findings as beads:
+
+```
+gt-sec-001  SQL injection in login()     type=bug  priority=1  label=security
+gt-sec-002  Missing CSRF protection       type=bug  priority=2  label=security
+gt-perf-001 N+1 query in user list       type=bug  priority=2  label=performance
+gt-arch-001 Auth logic in controller     type=task priority=3  label=refactor
+```
+
+Findings link back to the review:
+```
+discovered-from: gt-review-001.tactical-security-scan
+```
+
+This enables querying: "What did the security scan find?"
+
+## Feed the Beast Pattern
+
+Code review is a **work generator**:
+
+```
+Low on beads?
+     │
+     ▼
+gt molecule instantiate code-review
+     │
+     ▼
+Generates 50-200 fix beads
+     │
+     ▼
+Prioritize and swarm
+     │
+     ▼
+Codebase improves overnight
+     │
+     ▼
+Repeat weekly
+```
+
+## Future Extensions
+
+### Custom Molecule Types
+
+Beyond code-review, pluggable molecules could support:
+
+- **migration-analysis**: Database migrations, API versioning
+- **onboarding-review**: New hire documentation gaps
+- **compliance-audit**: Regulatory requirements check
+- **dependency-audit**: Outdated/vulnerable dependencies
+
+### Scheduled Reviews
+
+```yaml
+# In rig config
+scheduled_molecules:
+  - molecule: code-review
+    scope: "**/*.go"
+    schedule: "0 0 * * 0"  # Weekly Sunday midnight
+    priority: 3
+```
+
+### Review Trends
+
+Track findings over time:
+```bash
+gt review history --molecule=code-review
+# Shows: findings per run, categories, fix rate
+```