Polecats now self-clean when done: - gt done always exits session (no more --exit flag needed) - gt done requests self-nuke (sandbox cleanup) - No idle polecats - done means gone - Refinery re-implements on conflict (never sends work back) New formulas: - mol-polecat-review-pr: review external PRs, approve/reject - mol-polecat-code-review: review code, file beads for findings Docs updated: - polecat-lifecycle.md: self-cleaning model, identity vs session - polecat-CLAUDE.md: updated contract and completion protocol - mol-polecat-work: updated for self-cleaning Implementation beads filed: - gt-yrz4k: gt done always exits - gt-fqcst: polecat self-nuke mechanism - gt-zdmde: abstract work unit completion Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
284 lines
10 KiB
Markdown
284 lines
10 KiB
Markdown
# Polecat Lifecycle
|
|
|
|
> Understanding the three-layer architecture of polecat workers
|
|
|
|
## Overview
|
|
|
|
Polecats have three distinct lifecycle layers that operate independently. Confusing
|
|
these layers leads to bugs like "idle polecats" and misunderstanding when
|
|
recycling occurs.
|
|
|
|
## The Self-Cleaning Polecat Model
|
|
|
|
**Polecats are responsible for their own cleanup.** When a polecat completes its
|
|
work unit, it:
|
|
|
|
1. Signals completion via `gt done`
|
|
2. Exits its session immediately (no idle waiting)
|
|
3. Requests its own nuke (self-delete)
|
|
|
|
This removes dependency on the Witness/Deacon for cleanup and ensures polecats
|
|
never sit idle. The simple model: **sandbox dies with session**.
|
|
|
|
### Why Self-Cleaning?
|
|
|
|
- **No idle polecats** - There's no state where a polecat exists without work
|
|
- **Reduced watchdog overhead** - Deacon doesn't need to patrol for zombies
|
|
- **Faster turnover** - Resources freed immediately on completion
|
|
- **Simpler mental model** - Done means gone
|
|
|
|
### What About Pending Merges?
|
|
|
|
The Refinery owns the merge queue. Once `gt done` submits work:
|
|
- The branch is pushed to origin
|
|
- Work exists in the MQ, not in the polecat
|
|
- If rebase fails, Refinery re-implements on new baseline (fresh polecat)
|
|
- The original polecat is already gone - no sending work "back"
|
|
|
|
## The Three Layers
|
|
|
|
| Layer | Component | Lifecycle | Persistence |
|
|
|-------|-----------|-----------|-------------|
|
|
| **Session** | Claude (tmux pane) | Ephemeral | Cycles per step/handoff |
|
|
| **Sandbox** | Git worktree | Persistent | Until nuke |
|
|
| **Slot** | Name from pool | Persistent | Until nuke |
|
|
|
|
### Session Layer
|
|
|
|
The Claude session is **ephemeral**. It cycles frequently:
|
|
|
|
- After each molecule step (via `gt handoff`)
|
|
- On context compaction
|
|
- On crash/timeout
|
|
- After extended work periods
|
|
|
|
**Key insight:** Session cycling is **normal operation**, not failure. The polecat
|
|
continues working—only the Claude context refreshes.
|
|
|
|
```
|
|
Session 1: Steps 1-2 → handoff
|
|
Session 2: Steps 3-4 → handoff
|
|
Session 3: Step 5 → gt done
|
|
```
|
|
|
|
All three sessions are the **same polecat**. The sandbox and slot persist throughout.
|
|
|
|
### Sandbox Layer
|
|
|
|
The sandbox is the **git worktree**—the polecat's working directory:
|
|
|
|
```
|
|
~/gt/gastown/polecats/Toast/
|
|
```
|
|
|
|
This worktree:
|
|
- Exists from `gt sling` until `gt polecat nuke`
|
|
- Survives all session cycles
|
|
- Contains uncommitted work, staged changes, branch state
|
|
- Is independent of other polecat sandboxes
|
|
|
|
The Witness never destroys sandboxes mid-work. Only `nuke` removes them.
|
|
|
|
### Slot Layer
|
|
|
|
The slot is the **name allocation** from the polecat pool:
|
|
|
|
```bash
|
|
# Pool: [Toast, Shadow, Copper, Ash, Storm...]
|
|
# Toast is allocated to work gt-abc
|
|
```
|
|
|
|
The slot:
|
|
- Determines the sandbox path (`polecats/Toast/`)
|
|
- Maps to a tmux session (`gt-gastown-Toast`)
|
|
- Appears in attribution (`gastown/polecats/Toast`)
|
|
- Is released only on nuke
|
|
|
|
## Correct Lifecycle
|
|
|
|
```
|
|
┌─────────────────────────────────────────────────────────────┐
|
|
│ gt sling │
|
|
│ → Allocate slot from pool (Toast) │
|
|
│ → Create sandbox (worktree on new branch) │
|
|
│ → Start session (Claude in tmux) │
|
|
│ → Hook molecule to polecat │
|
|
└─────────────────────────────────────────────────────────────┘
|
|
│
|
|
▼
|
|
┌─────────────────────────────────────────────────────────────┐
|
|
│ Work Happens │
|
|
│ │
|
|
│ Session cycles happen here: │
|
|
│ - gt handoff between steps │
|
|
│ - Compaction triggers respawn │
|
|
│ - Crash → Witness respawns │
|
|
│ │
|
|
│ Sandbox persists through ALL session cycles │
|
|
└─────────────────────────────────────────────────────────────┘
|
|
│
|
|
▼
|
|
┌─────────────────────────────────────────────────────────────┐
|
|
│ gt done (self-cleaning) │
|
|
│ → Push branch to origin │
|
|
│ → Submit work to merge queue (MR bead) │
|
|
│ → Request self-nuke (sandbox + session cleanup) │
|
|
│ → Exit immediately │
|
|
│ │
|
|
│ Work now lives in MQ, not in polecat. │
|
|
│ Polecat is GONE. No idle state. │
|
|
└─────────────────────────────────────────────────────────────┘
|
|
│
|
|
▼
|
|
┌─────────────────────────────────────────────────────────────┐
|
|
│ Refinery: merge queue │
|
|
│ → Rebase and merge to main │
|
|
│ → Close the issue │
|
|
│ → If conflict: spawn FRESH polecat to re-implement │
|
|
│ (never send work back to original polecat - it's gone) │
|
|
└─────────────────────────────────────────────────────────────┘
|
|
```
|
|
|
|
## What "Recycle" Means
|
|
|
|
**Session cycling**: Normal. Claude restarts, sandbox stays, slot stays.
|
|
|
|
```bash
|
|
gt handoff # Session cycles, polecat continues
|
|
```
|
|
|
|
**Sandbox recreation**: Repair only. Should be rare.
|
|
|
|
```bash
|
|
gt polecat repair Toast # Emergency: recreate corrupted worktree
|
|
```
|
|
|
|
Session cycling happens constantly. Sandbox recreation should almost never happen
|
|
during normal operation.
|
|
|
|
## Anti-Patterns
|
|
|
|
### Idle Polecats
|
|
|
|
**Myth:** Polecats wait between tasks in an idle state.
|
|
|
|
**Reality:** Polecats don't exist without work. The lifecycle is:
|
|
1. Work assigned → polecat spawned
|
|
2. Work done → polecat nuked
|
|
3. There is no idle state
|
|
|
|
If you see a polecat without work, something is broken. Either:
|
|
- The hook was lost (bug)
|
|
- The session crashed before loading context
|
|
- Manual intervention corrupted state
|
|
|
|
### Manual State Transitions
|
|
|
|
**Anti-pattern:**
|
|
```bash
|
|
gt polecat done Toast # DON'T: external state manipulation
|
|
gt polecat reset Toast # DON'T: manual lifecycle control
|
|
```
|
|
|
|
**Correct:**
|
|
```bash
|
|
# Polecat signals its own completion:
|
|
gt done # (from inside the polecat session)
|
|
|
|
# Only Witness nukes polecats:
|
|
gt polecat nuke Toast # (from Witness, after verification)
|
|
```
|
|
|
|
Polecats manage their own session lifecycle. The Witness manages sandbox lifecycle.
|
|
External manipulation bypasses verification.
|
|
|
|
### Sandboxes Without Work
|
|
|
|
**Anti-pattern:** A sandbox exists but no molecule is hooked.
|
|
|
|
This means:
|
|
- The polecat was spawned incorrectly
|
|
- The hook was lost during crash
|
|
- State corruption occurred
|
|
|
|
**Recovery:**
|
|
```bash
|
|
# From Witness:
|
|
gt polecat nuke Toast # Clean slate
|
|
gt sling gt-abc gastown # Respawn with work
|
|
```
|
|
|
|
### Confusing Session with Sandbox
|
|
|
|
**Anti-pattern:** Thinking session restart = losing work.
|
|
|
|
```bash
|
|
# Session ends (handoff, crash, compaction)
|
|
# Work is NOT lost because:
|
|
# - Git commits persist in sandbox
|
|
# - Staged changes persist in sandbox
|
|
# - Molecule state persists in beads
|
|
# - Hook persists across sessions
|
|
```
|
|
|
|
The new session picks up where the old one left off via `gt prime`.
|
|
|
|
## Session Lifecycle Details
|
|
|
|
Sessions cycle for these reasons:
|
|
|
|
| Trigger | Action | Result |
|
|
|---------|--------|--------|
|
|
| `gt handoff` | Voluntary | Clean cycle to fresh context |
|
|
| Context compaction | Automatic | Forced by Claude Code |
|
|
| Crash/timeout | Failure | Witness respawns |
|
|
| `gt done` | Completion | Session exits, Witness takes over |
|
|
|
|
All except `gt done` result in continued work. Only `gt done` signals completion.
|
|
|
|
## Witness Responsibilities
|
|
|
|
The Witness monitors polecats but does NOT:
|
|
- Force session cycles (polecats self-manage via handoff)
|
|
- Interrupt mid-step (unless truly stuck)
|
|
- Nuke polecats (polecats self-nuke via `gt done`)
|
|
|
|
The Witness DOES:
|
|
- Respawn crashed sessions
|
|
- Nudge stuck polecats
|
|
- Handle escalations
|
|
- Clean up orphaned polecats (crash before `gt done`)
|
|
|
|
## Polecat Identity
|
|
|
|
**Key insight:** Polecat *identity* is long-lived; only sessions and sandboxes are ephemeral.
|
|
|
|
In the HOP model, every entity has a chain (CV) that tracks:
|
|
- What work they've done
|
|
- Success/failure rates
|
|
- Skills demonstrated
|
|
- Quality metrics
|
|
|
|
The polecat *name* (Toast, Shadow, etc.) is a slot from a pool - truly ephemeral.
|
|
But the *agent identity* that executes as that polecat accumulates a work history.
|
|
|
|
```
|
|
POLECAT IDENTITY (persistent) SESSION (ephemeral) SANDBOX (ephemeral)
|
|
├── CV chain ├── Claude instance ├── Git worktree
|
|
├── Work history ├── Context window ├── Branch
|
|
├── Skills demonstrated └── Dies on handoff └── Dies on gt done
|
|
└── Credit for work or gt done
|
|
```
|
|
|
|
This distinction matters for:
|
|
- **Attribution** - Who gets credit for the work?
|
|
- **Skill routing** - Which agent is best for this task?
|
|
- **Cost accounting** - Who pays for inference?
|
|
- **Federation** - Agents having their own chains in a distributed world
|
|
|
|
## Related Documentation
|
|
|
|
- [Understanding Gas Town](understanding-gas-town.md) - Role taxonomy and architecture
|
|
- [Polecat Wisp Architecture](polecat-wisp-architecture.md) - Molecule execution
|
|
- [Propulsion Principle](propulsion-principle.md) - Why work triggers immediate execution
|