docs: reorganize documentation into concepts, design, and examples
Move documentation files into a clearer structure: - concepts/: core ideas (convoy, identity, molecules, polecat-lifecycle, propulsion) - design/: architecture and protocols (architecture, escalation, federation, mail, etc.) - examples/: demos and tutorials (hanoi-demo) - overview.md: renamed from understanding-gas-town.md Remove outdated/superseded docs and update reference.md. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
committed by
Steve Yegge
parent
8ed31e9634
commit
88f784a9aa
283
docs/concepts/polecat-lifecycle.md
Normal file
283
docs/concepts/polecat-lifecycle.md
Normal file
@@ -0,0 +1,283 @@
|
||||
# Polecat Lifecycle
|
||||
|
||||
> Understanding the three-layer architecture of polecat workers
|
||||
|
||||
## Overview
|
||||
|
||||
Polecats have three distinct lifecycle layers that operate independently. Confusing
|
||||
these layers leads to bugs like "idle polecats" and misunderstanding when
|
||||
recycling occurs.
|
||||
|
||||
## The Self-Cleaning Polecat Model
|
||||
|
||||
**Polecats are responsible for their own cleanup.** When a polecat completes its
|
||||
work unit, it:
|
||||
|
||||
1. Signals completion via `gt done`
|
||||
2. Exits its session immediately (no idle waiting)
|
||||
3. Requests its own nuke (self-delete)
|
||||
|
||||
This removes dependency on the Witness/Deacon for cleanup and ensures polecats
|
||||
never sit idle. The simple model: **sandbox dies with session**.
|
||||
|
||||
### Why Self-Cleaning?
|
||||
|
||||
- **No idle polecats** - There's no state where a polecat exists without work
|
||||
- **Reduced watchdog overhead** - Deacon doesn't need to patrol for zombies
|
||||
- **Faster turnover** - Resources freed immediately on completion
|
||||
- **Simpler mental model** - Done means gone
|
||||
|
||||
### What About Pending Merges?
|
||||
|
||||
The Refinery owns the merge queue. Once `gt done` submits work:
|
||||
- The branch is pushed to origin
|
||||
- Work exists in the MQ, not in the polecat
|
||||
- If rebase fails, Refinery re-implements on new baseline (fresh polecat)
|
||||
- The original polecat is already gone - no sending work "back"
|
||||
|
||||
## The Three Layers
|
||||
|
||||
| Layer | Component | Lifecycle | Persistence |
|
||||
|-------|-----------|-----------|-------------|
|
||||
| **Session** | Claude (tmux pane) | Ephemeral | Cycles per step/handoff |
|
||||
| **Sandbox** | Git worktree | Persistent | Until nuke |
|
||||
| **Slot** | Name from pool | Persistent | Until nuke |
|
||||
|
||||
### Session Layer
|
||||
|
||||
The Claude session is **ephemeral**. It cycles frequently:
|
||||
|
||||
- After each molecule step (via `gt handoff`)
|
||||
- On context compaction
|
||||
- On crash/timeout
|
||||
- After extended work periods
|
||||
|
||||
**Key insight:** Session cycling is **normal operation**, not failure. The polecat
|
||||
continues working—only the Claude context refreshes.
|
||||
|
||||
```
|
||||
Session 1: Steps 1-2 → handoff
|
||||
Session 2: Steps 3-4 → handoff
|
||||
Session 3: Step 5 → gt done
|
||||
```
|
||||
|
||||
All three sessions are the **same polecat**. The sandbox and slot persist throughout.
|
||||
|
||||
### Sandbox Layer
|
||||
|
||||
The sandbox is the **git worktree**—the polecat's working directory:
|
||||
|
||||
```
|
||||
~/gt/gastown/polecats/Toast/
|
||||
```
|
||||
|
||||
This worktree:
|
||||
- Exists from `gt sling` until `gt polecat nuke`
|
||||
- Survives all session cycles
|
||||
- Contains uncommitted work, staged changes, branch state
|
||||
- Is independent of other polecat sandboxes
|
||||
|
||||
The Witness never destroys sandboxes mid-work. Only `nuke` removes them.
|
||||
|
||||
### Slot Layer
|
||||
|
||||
The slot is the **name allocation** from the polecat pool:
|
||||
|
||||
```bash
|
||||
# Pool: [Toast, Shadow, Copper, Ash, Storm...]
|
||||
# Toast is allocated to work gt-abc
|
||||
```
|
||||
|
||||
The slot:
|
||||
- Determines the sandbox path (`polecats/Toast/`)
|
||||
- Maps to a tmux session (`gt-gastown-Toast`)
|
||||
- Appears in attribution (`gastown/polecats/Toast`)
|
||||
- Is released only on nuke
|
||||
|
||||
## Correct Lifecycle
|
||||
|
||||
```
|
||||
┌─────────────────────────────────────────────────────────────┐
|
||||
│ gt sling │
|
||||
│ → Allocate slot from pool (Toast) │
|
||||
│ → Create sandbox (worktree on new branch) │
|
||||
│ → Start session (Claude in tmux) │
|
||||
│ → Hook molecule to polecat │
|
||||
└─────────────────────────────────────────────────────────────┘
|
||||
│
|
||||
▼
|
||||
┌─────────────────────────────────────────────────────────────┐
|
||||
│ Work Happens │
|
||||
│ │
|
||||
│ Session cycles happen here: │
|
||||
│ - gt handoff between steps │
|
||||
│ - Compaction triggers respawn │
|
||||
│ - Crash → Witness respawns │
|
||||
│ │
|
||||
│ Sandbox persists through ALL session cycles │
|
||||
└─────────────────────────────────────────────────────────────┘
|
||||
│
|
||||
▼
|
||||
┌─────────────────────────────────────────────────────────────┐
|
||||
│ gt done (self-cleaning) │
|
||||
│ → Push branch to origin │
|
||||
│ → Submit work to merge queue (MR bead) │
|
||||
│ → Request self-nuke (sandbox + session cleanup) │
|
||||
│ → Exit immediately │
|
||||
│ │
|
||||
│ Work now lives in MQ, not in polecat. │
|
||||
│ Polecat is GONE. No idle state. │
|
||||
└─────────────────────────────────────────────────────────────┘
|
||||
│
|
||||
▼
|
||||
┌─────────────────────────────────────────────────────────────┐
|
||||
│ Refinery: merge queue │
|
||||
│ → Rebase and merge to main │
|
||||
│ → Close the issue │
|
||||
│ → If conflict: spawn FRESH polecat to re-implement │
|
||||
│ (never send work back to original polecat - it's gone) │
|
||||
└─────────────────────────────────────────────────────────────┘
|
||||
```
|
||||
|
||||
## What "Recycle" Means
|
||||
|
||||
**Session cycling**: Normal. Claude restarts, sandbox stays, slot stays.
|
||||
|
||||
```bash
|
||||
gt handoff # Session cycles, polecat continues
|
||||
```
|
||||
|
||||
**Sandbox recreation**: Repair only. Should be rare.
|
||||
|
||||
```bash
|
||||
gt polecat repair Toast # Emergency: recreate corrupted worktree
|
||||
```
|
||||
|
||||
Session cycling happens constantly. Sandbox recreation should almost never happen
|
||||
during normal operation.
|
||||
|
||||
## Anti-Patterns
|
||||
|
||||
### Idle Polecats
|
||||
|
||||
**Myth:** Polecats wait between tasks in an idle state.
|
||||
|
||||
**Reality:** Polecats don't exist without work. The lifecycle is:
|
||||
1. Work assigned → polecat spawned
|
||||
2. Work done → polecat nuked
|
||||
3. There is no idle state
|
||||
|
||||
If you see a polecat without work, something is broken. Either:
|
||||
- The hook was lost (bug)
|
||||
- The session crashed before loading context
|
||||
- Manual intervention corrupted state
|
||||
|
||||
### Manual State Transitions
|
||||
|
||||
**Anti-pattern:**
|
||||
```bash
|
||||
gt polecat done Toast # DON'T: external state manipulation
|
||||
gt polecat reset Toast # DON'T: manual lifecycle control
|
||||
```
|
||||
|
||||
**Correct:**
|
||||
```bash
|
||||
# Polecat signals its own completion:
|
||||
gt done # (from inside the polecat session)
|
||||
|
||||
# Only Witness nukes polecats:
|
||||
gt polecat nuke Toast # (from Witness, after verification)
|
||||
```
|
||||
|
||||
Polecats manage their own session lifecycle. The Witness manages sandbox lifecycle.
|
||||
External manipulation bypasses verification.
|
||||
|
||||
### Sandboxes Without Work
|
||||
|
||||
**Anti-pattern:** A sandbox exists but no molecule is hooked.
|
||||
|
||||
This means:
|
||||
- The polecat was spawned incorrectly
|
||||
- The hook was lost during crash
|
||||
- State corruption occurred
|
||||
|
||||
**Recovery:**
|
||||
```bash
|
||||
# From Witness:
|
||||
gt polecat nuke Toast # Clean slate
|
||||
gt sling gt-abc gastown # Respawn with work
|
||||
```
|
||||
|
||||
### Confusing Session with Sandbox
|
||||
|
||||
**Anti-pattern:** Thinking session restart = losing work.
|
||||
|
||||
```bash
|
||||
# Session ends (handoff, crash, compaction)
|
||||
# Work is NOT lost because:
|
||||
# - Git commits persist in sandbox
|
||||
# - Staged changes persist in sandbox
|
||||
# - Molecule state persists in beads
|
||||
# - Hook persists across sessions
|
||||
```
|
||||
|
||||
The new session picks up where the old one left off via `gt prime`.
|
||||
|
||||
## Session Lifecycle Details
|
||||
|
||||
Sessions cycle for these reasons:
|
||||
|
||||
| Trigger | Action | Result |
|
||||
|---------|--------|--------|
|
||||
| `gt handoff` | Voluntary | Clean cycle to fresh context |
|
||||
| Context compaction | Automatic | Forced by Claude Code |
|
||||
| Crash/timeout | Failure | Witness respawns |
|
||||
| `gt done` | Completion | Session exits, Witness takes over |
|
||||
|
||||
All except `gt done` result in continued work. Only `gt done` signals completion.
|
||||
|
||||
## Witness Responsibilities
|
||||
|
||||
The Witness monitors polecats but does NOT:
|
||||
- Force session cycles (polecats self-manage via handoff)
|
||||
- Interrupt mid-step (unless truly stuck)
|
||||
- Nuke polecats (polecats self-nuke via `gt done`)
|
||||
|
||||
The Witness DOES:
|
||||
- Respawn crashed sessions
|
||||
- Nudge stuck polecats
|
||||
- Handle escalations
|
||||
- Clean up orphaned polecats (crash before `gt done`)
|
||||
|
||||
## Polecat Identity
|
||||
|
||||
**Key insight:** Polecat *identity* is long-lived; only sessions and sandboxes are ephemeral.
|
||||
|
||||
In the HOP model, every entity has a chain (CV) that tracks:
|
||||
- What work they've done
|
||||
- Success/failure rates
|
||||
- Skills demonstrated
|
||||
- Quality metrics
|
||||
|
||||
The polecat *name* (Toast, Shadow, etc.) is a slot from a pool - truly ephemeral.
|
||||
But the *agent identity* that executes as that polecat accumulates a work history.
|
||||
|
||||
```
|
||||
POLECAT IDENTITY (persistent) SESSION (ephemeral) SANDBOX (ephemeral)
|
||||
├── CV chain ├── Claude instance ├── Git worktree
|
||||
├── Work history ├── Context window ├── Branch
|
||||
├── Skills demonstrated └── Dies on handoff └── Dies on gt done
|
||||
└── Credit for work or gt done
|
||||
```
|
||||
|
||||
This distinction matters for:
|
||||
- **Attribution** - Who gets credit for the work?
|
||||
- **Skill routing** - Which agent is best for this task?
|
||||
- **Cost accounting** - Who pays for inference?
|
||||
- **Federation** - Agents having their own chains in a distributed world
|
||||
|
||||
## Related Documentation
|
||||
|
||||
- [Overview](../overview.md) - Role taxonomy and architecture
|
||||
- [Molecules](molecules.md) - Molecule execution and polecat workflow
|
||||
- [Propulsion Principle](propulsion-principle.md) - Why work triggers immediate execution
|
||||
Reference in New Issue
Block a user