hyperguild

mathias/hyperguild

Fork 0

Files

History

Mathias 2b7bbe38c7

CI / Lint / Test / Vet (push) Successful in 11s

Details

CI / Mirror to GitHub (push) Successful in 4s

Details

docs(eval): record M4 + M4b scorer runs — phase 2 gate cleared (infra#72)

Tier-weighted retrieval against the qa-2026-05.md 20-question set:

| run                            | top-1 | top-3 |
|--------------------------------|-------|-------|
| baseline (pre-phase-1)         | 20%   | 65%   |
| post phase 1 (parser+content)  | 20%   | 70%   |
| post M4 (tier weighting)       | 30%   | 75%   |
| post M4b (entities → K tier)   | 35%   | 80%   |

Net Phase 2 lift: +15pt top-1, +15pt top-3 — comfortably above the
≥10pt close-gate set in infra#72.

Three remaining misses are content-keyword issues, not structure
issues (the questions don't share enough lexical surface with the
target entries to surface via BM25 alone). Vector search would
help here but the iguana embedder is off-mesh (see infra#64).

2026-05-25 18:51:29 +02:00

eval

docs(eval): record M4 + M4b scorer runs — phase 2 gate cleared (infra#72)

2026-05-25 18:51:29 +02:00

raw

test: phase 1 integration smoke test passing

2026-04-17 21:18:08 +02:00

sessions

feat: add protocols.md, retrospective discipline, and brain directory structure

2026-04-17 20:49:56 +02:00

training-data

feat: add protocols.md, retrospective discipline, and brain directory structure

2026-04-17 20:49:56 +02:00

wiki

test: phase 1 integration smoke test passing

2026-04-17 21:18:08 +02:00

schema.md

feat(pipeline): update system prompt for new LLM JSON contract (no slugs)

2026-04-23 19:45:21 +02:00