Skip to content

Top-level Workflows

Two slash commands in this plugin are orchestrator-only: they do not implement, review, or merge anything themselves — they sequence other skills and agents through their phases, holding the human gates between them.

  • /ship — spec → plan → build → review → PR, end-to-end.
  • /test-modernize — five-phase legacy-test modernization for CD, from assessment to quality-target convergence.

Both follow the same orchestrator contract: delegate every phase to the owning skill or agent, honor human gates, surface ambiguous inputs in one batch up front, and report outcomes concisely.


/ship

File: skills/ship/SKILL.md Role: orchestrator. Use when: the user says "ship this", "take this feature end to end", or wants the spec → plan → build → PR flow without re-assembling it each time.

Steps

# Step Delegates to Human gate after?
1 Approach contract — screen request against knowledge/decision-defaults.md; resolve ambiguous high-reversal-cost axes in one batch. (orchestrator only) yes, if a blocker remains
2 Spec (skipped with --skip-spec) — produce Intent, Architecture, Acceptance Criteria. /specs yes — operator approves the spec
3 Plan — decompose into vertical slices with Gherkin scenarios; a tier-scaled set of plan-review personas (1–5, by plan complexity) runs in parallel before the gate. /plan yes — operator approves the plan
4 Build — RED-GREEN-REFACTOR per slice, inline review checkpoints, verification evidence. Do not proceed until the suite is green. /build no
5 Review — run quality-review agents and let the auto-fix loop converge. Only judgment-call findings escalate to the operator. /code-review no
6 PR — pre-PR quality gate, open PR, arm auto-merge by default (--no-auto-merge to opt out). /pr yes — the PR is the final review artifact
7 Report — PR URL, quality-gate result, whether auto-merge is armed. (orchestrator only)

Agents involved (dispatched by the delegated skills)

/build's inline review checkpoints dispatch the review agents listed in team-structure.md → Review Agent Dispatch. /plan dispatches a tier-scaled subset of the five plan-review personas (prompts/plan-review-*.md) — the Acceptance Test Critic always runs; the rest are added as the plan's tier (trivial/standard/complex) warrants — and the progress-guardian gate-keeper. /code-review re-runs the same review agents over the full changeset.

Arguments

/ship <feature-description> [--skip-spec] [--no-auto-merge]

Notes

  • Sequencing only — every gate, fix loop, and evidence requirement comes from the underlying skills. If any phase stops at a gate, /ship stops with it.
  • For a plan-only pass, use /plan; for build-only, use /build.
  • Resume across sessions with /continue.

/test-modernize

File: skills/test-modernize/SKILL.md Role: orchestrator. Use when: a legacy repository needs its tests modernized for CD — coverage ≥ 90%, zero surviving mutants, full determinism, fastest achievable pre-merge wall-clock.

Each phase writes a progress file to memory/test-modernize/<repo-slug>/phase-<n>.md so /continue and --from-phase <n> can resume.

Phases

0. Approach contract

Orchestrator only. Resolve <repo-path>, the parent issue URL (and the tracker CLI it maps to: gh / az boards / glab / acli, or local-files mode when empty), CI config, external test sources, quality targets, and the test-binding mode for the approved Gherkin (bdd-runner or xunit-with-annotations). Record the choices in memory/test-modernize/<slug>/phase-0.md.

1. Analyze

Assess the repo against CD test-architecture criteria and emit phase issues.

Step Delegates to
1.1 Assessment /cd-test-architecture
1.2 Parent + Phase-1/2/5 child issues /issues-from-assessment
1.3 Phase gate review Agent dev-team:test-modernization-review with --phase 1

Human gate — approval before specifying the public interface.

2. Specify public interface (two-pass)

Two-pass design so Stories never bind to un-reviewed scenarios.

Step Delegates to
2.A Author .feature files for every public surface (API endpoint, UI flow, batch-job entry point, library export, event type) /gherkin-public
2.A.2 Phase gate review Agent dev-team:test-modernization-review with --phase 2
Human gate — operator validates the Gherkin (hard stop; .feature files may be edited in place)
2.B --create-stories — one [Component tests] <component> · <surface> Story per approved surface, scenario→Story map written to gherkin-bindings.json; backfill Phase-1 placeholders /gherkin-public

3. Audit + baseline coverage

Step Delegates to
3.1 Disable cannot-fail tests (skip + tag, never delete) /test-audit-disable
3.2 Capture baseline coverage and post to parent issue / FEATURE.md /coverage-baseline
3.3 Phase gate review Agent dev-team:test-modernization-review with --phase 3

Human gate — baseline accepted before adding tests.

4. Fix disabled tests + add no-refactor tests

For each Phase-4 child issue in dependency order:

Step Delegates to
4.1 Drive RED-GREEN-REFACTOR per Story. [Component tests] Stories bind tests to the cited <feature-file>::<scenario-name> pairs using the Phase-0 binding mode; [Baseline] Stories lock in current behavior at existing seams. /build
4.2 After each Story closes, post Δ vs. baseline AND run scoped mutation against the Story's --story-files (production-code files from /build's commit diff; tests filtered) /coverage-delta --story <id> --story-files <files>
4.2b On status: net_new_survivors, surface halt prompt with three operator actions ([s] strengthen / [f] follow-up — drafts Phase-5 [Strengthen assertions] Story / [w] waive). On status: tool_unavailable, triage with [i] install via /init-dev-team / [k] skip — proceed advisory / [q] quit. (orchestrator — owns the halt; worker exit code is always 0)
4.3 After all Phase-4 Stories close, verify scenario→Story-id map against submitted test code Agent dev-team:test-modernization-review with --phase 4
4.4 End-of-phase test review: dispatch /test-design --since <phase-4-base-sha> AND /code-review --since <phase-4-base-sha>. On error/warning findings, dispatch /apply-fixes; re-run /code-review; loop max 2 iterations before escalating with [r]/[w]/[q]. Evidence persisted to phase-4-review.json. /test-design + /code-review + /apply-fixes

Human gate — Δ-coverage AND Phase-4 mutation results AND phase-4-review.json (any waivers explicit) accepted before any production-code refactor.

5. Refactor-for-testability + converge

For each Phase-5 child issue in dependency order:

Step Delegates to
5.1 Confirm matching [Baseline] Story is closed and green (precondition) (orchestrator only)
5.2 Minimum behavior-preserving refactor + the test the new seam unblocks /build
5.3 Loop until coverage / mutants / determinism / pre-merge wall-clock targets are met (or explicitly waived with reason recorded). Reads mutation-history.json (written by Phase 4) and reuses per-file survivor counts when the file's last commit pre-dates the history entry — re-measures only the gaps. /quality-targets-converge
5.4 Phase gate review Agent dev-team:test-modernization-review with --phase 5
5.5 End-of-phase test review: same loop as 4.4, scoped to <phase-5-base-sha>. Evidence persisted to phase-5-review.json. /test-design + /code-review + /apply-fixes

Human gate — final metrics AND phase-5-review.json accepted (or each gap waived with reason).

6. Report

Final coverage %, surviving mutants, determinism status, pre-merge wall-clock, the parent issue URL (or ./plans/test-modernize/FEATURE.md), the list of PRs /build opened in Phases 4 and 5, and any waived targets with reasons.

Agents involved

  • dev-team:test-modernization-review — phase gate-keeper. Reads each phase deliverable from memory/test-modernize/<repo>/phase-<n>.md and verifies it matches the phase's acceptance criteria before the workflow advances. Process gate-keeper, not a code reviewer — not in the standard review-dispatch fan-out.
  • The review agents dispatched by /build's inline checkpoints (same set /ship uses) — see team-structure.md → Review Agent Dispatch.

Arguments

/test-modernize <repo-path> [--parent <issue-url>] [--ci <path>] [--external-tests <loc>] [--from-phase <n>]

Notes

  • Sequencing only — every gate, fix loop, and evidence requirement comes from the underlying skills.
  • For Phase-1-only analysis without committing to the full workflow, invoke /cd-test-architecture directly.
  • The workflow is identical whether or not a tracker CLI is installed — only the destination of the issues changes (tracker vs. ./plans/test-modernize/).
  • The operator-facing diagram lives at diagrams/test-modernize-flow.svg and is embedded in agent-architecture.md.

Why these are documented together

/ship and /test-modernize are the only two multi-phase pipelines with inter-phase human gates in the plugin. Every other slash command is either a single-step worker (e.g. /coverage-baseline, /triage) or a one-shot orchestrator that returns in a single pass (e.g. /code-review, /test-design). Knowing the phase order, the owning skill or agent for each step, and where the human gates fall is the difference between operating these workflows confidently and re-reading every SKILL.md each time.