Top-level Workflows¶

Two slash commands in this plugin are orchestrator-only: they do not implement, review, or merge anything themselves — they sequence other skills and agents through their phases, holding the human gates between them.

/ship — spec → plan → build → review → PR, end-to-end.
/test-modernize — five-phase legacy-test modernization for CD, from assessment to quality-target convergence.

Both follow the same orchestrator contract: delegate every phase to the owning skill or agent, honor human gates, surface ambiguous inputs in one batch up front, and report outcomes concisely.

`/ship`¶

File: skills/ship/SKILL.md Role: orchestrator. Use when: the user says "ship this", "take this feature end to end", or wants the spec → plan → build → PR flow without re-assembling it each time.

Steps¶

#	Step	Delegates to	Human gate after?
1	Approach contract — screen request against `knowledge/decision-defaults.md`; resolve ambiguous high-reversal-cost axes in one batch.	(orchestrator only)	yes, if a blocker remains
2	Spec (skipped with `--skip-spec`) — produce Intent, Architecture, Acceptance Criteria.	`/specs`	yes — operator approves the spec
3	Plan — decompose into vertical slices with Gherkin scenarios; a tier-scaled set of plan-review personas (1–5, by plan complexity) runs in parallel before the gate.	`/plan`	yes — operator approves the plan
4	Build — RED-GREEN-REFACTOR per slice, inline review checkpoints, verification evidence. Do not proceed until the suite is green.	`/build`	no
5	Review — run quality-review agents and let the auto-fix loop converge. Only judgment-call findings escalate to the operator.	`/code-review`	no
6	PR — pre-PR quality gate, open PR, arm auto-merge by default (`--no-auto-merge` to opt out).	`/pr`	yes — the PR is the final review artifact
7	Report — PR URL, quality-gate result, whether auto-merge is armed.	(orchestrator only)	—

Agents involved (dispatched by the delegated skills)¶

/build's inline review checkpoints dispatch the review agents listed in team-structure.md → Review Agent Dispatch. /plan dispatches a tier-scaled subset of the five plan-review personas (prompts/plan-review-*.md) — the Acceptance Test Critic always runs; the rest are added as the plan's tier (trivial/standard/complex) warrants — and the progress-guardian gate-keeper. /code-review re-runs the same review agents over the full changeset.

Arguments¶

/ship <feature-description> [--skip-spec] [--no-auto-merge]

Notes¶

Sequencing only — every gate, fix loop, and evidence requirement comes from the underlying skills. If any phase stops at a gate, /ship stops with it.
For a plan-only pass, use /plan; for build-only, use /build.
Resume across sessions with /continue.

`/test-modernize`¶

File: skills/test-modernize/SKILL.md Role: orchestrator. Use when: a legacy repository needs its tests modernized for CD — coverage ≥ 90%, zero surviving mutants, full determinism, fastest achievable pre-merge wall-clock.

Each phase writes a progress file to memory/test-modernize/<repo-slug>/phase-<n>.md so /continue and --from-phase <n> can resume.

Phases¶

0. Approach contract¶

Orchestrator only. Resolve <repo-path>, the parent issue URL (and the tracker CLI it maps to: gh / az boards / glab / acli, or local-files mode when empty), CI config, external test sources, quality targets, and the test-binding mode for the approved Gherkin (bdd-runner or xunit-with-annotations). Record the choices in memory/test-modernize/<slug>/phase-0.md.

1. Analyze¶

Assess the repo against CD test-architecture criteria and emit phase issues.

Step	Delegates to
1.1 Assessment	`/cd-test-architecture`
1.2 Parent + Phase-1/2/5 child issues	`/issues-from-assessment`
1.3 Phase gate review	Agent `dev-team:test-modernization-review` with `--phase 1`

Human gate — approval before specifying the public interface.

2. Specify public interface (two-pass)¶

Two-pass design so Stories never bind to un-reviewed scenarios.

Step	Delegates to
2.A Author `.feature` files for every public surface (API endpoint, UI flow, batch-job entry point, library export, event type)	`/gherkin-public`
2.A.2 Phase gate review	Agent `dev-team:test-modernization-review` with `--phase 2`
Human gate — operator validates the Gherkin (hard stop; `.feature` files may be edited in place)
2.B `--create-stories` — one `[Component tests] <component> · <surface>` Story per approved surface, scenario→Story map written to `gherkin-bindings.json`; backfill Phase-1 placeholders	`/gherkin-public`

3. Audit + baseline coverage¶

Step	Delegates to
3.1 Disable cannot-fail tests (skip + tag, never delete)	`/test-audit-disable`
3.2 Capture baseline coverage and post to parent issue / `FEATURE.md`	`/coverage-baseline`
3.3 Phase gate review	Agent `dev-team:test-modernization-review` with `--phase 3`

Human gate — baseline accepted before adding tests.

4. Fix disabled tests + add no-refactor tests¶

For each Phase-4 child issue in dependency order:

Step	Delegates to
4.1 Drive RED-GREEN-REFACTOR per Story. `[Component tests]` Stories bind tests to the cited `<feature-file>::<scenario-name>` pairs using the Phase-0 binding mode; `[Baseline]` Stories lock in current behavior at existing seams.	`/build`
4.2 After each Story closes, post Δ vs. baseline AND run scoped mutation against the Story's `--story-files` (production-code files from `/build`'s commit diff; tests filtered)	`/coverage-delta` `--story <id> --story-files <files>`
4.2b On `status: net_new_survivors`, surface halt prompt with three operator actions (`[s]` strengthen / `[f]` follow-up — drafts Phase-5 `[Strengthen assertions]` Story / `[w]` waive). On `status: tool_unavailable`, triage with `[i]` install via `/init-dev-team` / `[k]` skip — proceed advisory / `[q]` quit.	(orchestrator — owns the halt; worker exit code is always 0)
4.3 After all Phase-4 Stories close, verify scenario→Story-id map against submitted test code	Agent `dev-team:test-modernization-review` with `--phase 4`
4.4 End-of-phase test review: dispatch `/test-design --since <phase-4-base-sha>` AND `/code-review --since <phase-4-base-sha>`. On error/warning findings, dispatch `/apply-fixes`; re-run `/code-review`; loop max 2 iterations before escalating with `[r]`/`[w]`/`[q]`. Evidence persisted to `phase-4-review.json`.	`/test-design` + `/code-review` + `/apply-fixes`

Human gate — Δ-coverage AND Phase-4 mutation results AND phase-4-review.json (any waivers explicit) accepted before any production-code refactor.

5. Refactor-for-testability + converge¶

For each Phase-5 child issue in dependency order:

Step	Delegates to
5.1 Confirm matching `[Baseline]` Story is closed and green (precondition)	(orchestrator only)
5.2 Minimum behavior-preserving refactor + the test the new seam unblocks	`/build`
5.3 Loop until coverage / mutants / determinism / pre-merge wall-clock targets are met (or explicitly waived with reason recorded). Reads `mutation-history.json` (written by Phase 4) and reuses per-file survivor counts when the file's last commit pre-dates the history entry — re-measures only the gaps.	`/quality-targets-converge`
5.4 Phase gate review	Agent `dev-team:test-modernization-review` with `--phase 5`
5.5 End-of-phase test review: same loop as 4.4, scoped to `<phase-5-base-sha>`. Evidence persisted to `phase-5-review.json`.	`/test-design` + `/code-review` + `/apply-fixes`

Human gate — final metrics AND phase-5-review.json accepted (or each gap waived with reason).

6. Report¶

Final coverage %, surviving mutants, determinism status, pre-merge wall-clock, the parent issue URL (or ./plans/test-modernize/FEATURE.md), the list of PRs /build opened in Phases 4 and 5, and any waived targets with reasons.

Agents involved¶

dev-team:test-modernization-review — phase gate-keeper. Reads each phase deliverable from memory/test-modernize/<repo>/phase-<n>.md and verifies it matches the phase's acceptance criteria before the workflow advances. Process gate-keeper, not a code reviewer — not in the standard review-dispatch fan-out.
The review agents dispatched by /build's inline checkpoints (same set /ship uses) — see team-structure.md → Review Agent Dispatch.

Arguments¶

/test-modernize <repo-path> [--parent <issue-url>] [--ci <path>] [--external-tests <loc>] [--from-phase <n>]

Notes¶

Sequencing only — every gate, fix loop, and evidence requirement comes from the underlying skills.
For Phase-1-only analysis without committing to the full workflow, invoke /cd-test-architecture directly.
The workflow is identical whether or not a tracker CLI is installed — only the destination of the issues changes (tracker vs. ./plans/test-modernize/).
The operator-facing diagram lives at diagrams/test-modernize-flow.svg and is embedded in agent-architecture.md.

Why these are documented together¶

/ship and /test-modernize are the only two multi-phase pipelines with inter-phase human gates in the plugin. Every other slash command is either a single-step worker (e.g. /coverage-baseline, /triage) or a one-shot orchestrator that returns in a single pass (e.g. /code-review, /test-design). Knowing the phase order, the owning skill or agent for each step, and where the human gates fall is the difference between operating these workflows confidently and re-reading every SKILL.md each time.

Top-level Workflows¶

/ship¶

Steps¶

Agents involved (dispatched by the delegated skills)¶

Arguments¶

Notes¶

/test-modernize¶

Phases¶

0. Approach contract¶

1. Analyze¶

2. Specify public interface (two-pass)¶

3. Audit + baseline coverage¶

4. Fix disabled tests + add no-refactor tests¶

5. Refactor-for-testability + converge¶

6. Report¶

Agents involved¶

Arguments¶

Notes¶

Why these are documented together¶

`/ship`¶

`/test-modernize`¶