Files
skills/grill-me/SKILL.md
Mathias d6a71e370e
Some checks failed
release / tag (push) Has been cancelled
chore: bootstrap skills library — 19 skills + installer + CI auto-tag
Phase 1 of mathias/skills extraction (infra#62 Track D — homelab
next-step plan addendum). Imports ~/dev/.skills/ verbatim (19 skill
dirs + SKILLS_INDEX.md) and adds the installation surface:

- Taskfile.yml — install / update / list / release / check targets
- install.sh — bootstrap installer for hosts without Task. Idempotent
  symlink wirer; default checkout at ~/.local/share/skills/ on every
  host; SKILLS_REF env var pins a tag (default: main).
- .gitea/workflows/release.yml — auto-tag every push to main by
  Bump-Type footer (major/minor/patch, default patch). Skipped when
  commit contains [skip-release].
- README — usage, versioning, contribution flow, secret-hygiene rule.

Phase 1 wires Claude Code only (~/.claude/skills/<name> global +
<repo>/.claude/skills/<name> per-repo). Phase 2 adds Crush, opencode,
antigravity, and gitea-resident agents (cobalt-dingo, agentsquad)
once their skill conventions are researched.

Public repo, markdown-only — no secrets, no client names. Verified
via pre-push grep before initial push.

[skip-release]
2026-05-24 14:59:54 +02:00

148 lines
7.4 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
name: grill-me
description: Critically interrogates plans, ideas, and proposals to make them sharper, more robust, and better grounded — with the explicit goal of producing a meaningfully better plan by the end. Use this skill whenever the user wants to stress-test a plan, prepare for hard questions, check if an idea is ready to act on, validate a hypothesis, or explicitly asks to be challenged, questioned, or grilled. Also trigger when a conversation has been exploring an idea for a while and it's time to converge — use this skill to check if the idea is actually ready. Trigger phrases include "grill me", "challenge this", "poke holes", "am I missing something", "is this ready", "stress-test this", "devil's advocate", "what am I not seeing".
---
# Grill-Me
A structured interrogation skill that pressure-tests plans and ideas before they become commitments. The goal is not to kill ideas — it's to produce a **demonstrably better plan** than the one that walked in.
## When to use each mode
Three interrogation modes. Pick based on context:
**Mode 1 — Quick Poke** (idea is early, user wants light pressure)
35 sharp questions, each with a proposal. Fast. No structure overhead. Good for early Diamond 1 conversations.
**Mode 2 — Full Grill** (idea is approaching commitment, user wants thorough interrogation)
Structured pass through all lenses below. Ends with a Plan Delta. Good for end of Diamond 1 or before starting a project.
**Mode 3 — Pre-mortem** (a plan exists and is about to be executed)
Assume it failed. Work backwards. What went wrong? Good for Diamond 2 → Deliver transitions.
Read the context to pick the right mode. If unsure, ask: "Quick poke or full grill?"
---
## Always bring a proposal
When a question requires input or a decision, don't just ask — bring a recommendation:
> "What's the smallest version that validates the hypothesis? I'd suggest starting with just the ntfy URL-push channel (Phase 1), because it gives you real usage data before investing in the automated sync. Does that match your intuition?"
The format: sharp question → "I'd suggest [X]" → one-sentence reasoning → quick confirmation ask.
This respects the user's time, models good thinking, and makes it easier to say "yes, agreed" than to construct an answer from scratch. If you genuinely have no strong opinion, say so explicitly — but that should be rare.
---
## Full Grill — interrogation lenses
Run through each lens. Skip lenses that clearly don't apply, but say so explicitly ("Skipping competitive lens — this is internal tooling"). For each question that requires a decision, include a proposal.
### 1. Hypothesis clarity
- Can the core claim be stated as: "We believe [X] will produce [Y], measurable by [Z]"?
- Is Y falsifiable? Could an experiment prove this wrong?
- Is Z actually measurable with available tools, or is it hand-wavy?
### 2. Assumptions audit
- List the top 3 assumptions the plan depends on.
- Which assumption, if wrong, kills the whole thing?
- Has any assumption been validated, or are they all untested?
### 3. Futures resilience (Double Diamond + Futures Thinking lens)
- Name 23 plausible alternative futures (not just the one assumed).
- Does the plan still make sense in each of them?
- What's the earliest signal that one of those other futures is manifesting?
### 4. Scope and reversibility
- What is explicitly out of scope? (Unstated scope is a risk.)
- Which decisions in this plan are irreversible?
- What's the smallest version of this that would still validate the hypothesis?
### 5. Cost and effort reality check
- What's the most expensive part? Time, money, attention?
- Is there a simpler path to the same learning?
- What does "good enough" look like vs "perfect"?
### 6. Failure modes
- What's the most likely way this fails? (Not catastrophically — just quietly doesn't work.)
- What's the most catastrophic failure mode, even if unlikely?
- Is there a canary — an early signal that things are going wrong?
### 7. Who and why
- Who benefits if this succeeds?
- Who is harmed, blocked, or ignored?
- Is there someone who should be consulted who hasn't been?
### 8. Next action clarity
- Is the next step concrete enough that it could be handed to an agent or a person with no further explanation?
- Is there a single person (or agent) responsible for it?
- What would stop it from happening this week?
---
## Pre-mortem mode
Set the scene: "It's [6 months / 1 year] from now. This project has failed. Not dramatically — it just quietly didn't deliver. Walk me through what happened."
Then probe:
- Was it a wrong assumption, a scope problem, or an execution problem?
- Was there a moment where a different decision would have changed the outcome?
- What should we do differently before we start?
Include a proposal for each finding: "To prevent this, I'd suggest [X]."
---
## Output format
**Quick Poke:** Questions numbered, each followed by a proposal. No preamble. Uncomfortable is fine.
**Full Grill:** One section per lens. Each section: 12 sentences of framing + 23 sharp questions + proposals where relevant. End with:
- "The three things most worth resolving before moving forward are: [X, Y, Z]."
- A **Plan Delta** section (see below).
**Pre-mortem:** Narrative first (23 sentences setting the failure scene), then structured findings with proposals, then "three things to do differently".
---
## Plan Delta — mandatory at the end of Full Grill and Pre-mortem
The grill is not done when the questions have been asked. It's done when the plan is **demonstrably stronger**. At minimum:
- At least one weak assumption has been named and either validated or explicitly accepted as a known risk
- The hypothesis is sharper and more falsifiable than when we started
- Something has changed: a scope reduction, a clearer metric, a reordered phase, a dropped assumption
If the plan hasn't changed at all after a Full Grill, something went wrong — the questions weren't sharp enough or the answers weren't pushed hard enough.
End every Full Grill and Pre-mortem with:
> **Plan delta**
> Before: [one sentence on the original plan]
> After: [one sentence on what concretely changed]
> Still open: [12 unresolved questions and how/when they'll be resolved]
---
## Tone guidance
The grill should feel like a smart, trusted colleague who genuinely wants the plan to succeed — not a critic looking for reasons to say no. Questions should be sharp but not hostile. If something is clearly solid, say so briefly before moving on.
Don't soften questions with preambles. "Have you considered that..." is weaker than "What happens if...". Prefer the latter.
If the plan is actually good, say so: "Lenses 14 look solid. The real open questions are in lenses 5 and 6."
---
## Integration with Double Diamond
This skill maps to the **convergence moments** in the Double Diamond workflow:
- **End of Diamond 1 (Define):** Use Full Grill to check if the hypothesis is ready to become a project. The three convergence criteria (falsifiable hypothesis, measurable success criterion, futures-resilient) come directly from lenses 1 and 3.
- **End of Diamond 2 (Deliver):** Use Pre-mortem before promoting to pre-prod.
- **Mid-Diamond 2 (Develop):** Use Quick Poke when an agent's experiment result is ambiguous and you're deciding whether to continue or pivot.
The skill does NOT replace human judgment at convergence points — it sharpens the plan so that judgment is better informed.