hyperguild/brain/eval/qa-2026-05.md

# Brain retrieval eval set — 2026-05-24

20 hand-authored Q→expected-top-1-slug pairs. Used by `score.sh` to
measure brain_query top-1 + top-3 hit rate against the live brain.

Authoring rules:
- Each question maps to **one** clear-best entry. Avoid ambiguous
  questions where multiple slugs could be the right answer.
- Questions are phrased the way a future-me would actually ask, not
  the way the entry's title reads. Some lexical distance is the point.
- `expected` is the slug as stored in `brain_entities.slug`. Update
  if the slug renames.

## Pairs

```
q: how do I stop dex from logging users out on every pod restart?
expected: dex-in-memory-storage-wipes-oauth-tokens-on-every-pod-restart

q: my postgres-exporter broke after revoking PUBLIC CONNECT — why?
expected: postgres-least-privilege-migration-tenant-grant-bypass-2026-05

q: when is a NodePort acceptable vs needing a public ingress with bearer gate?
expected: homelab-network-perimeter-model

q: what does container exit code 255 with reason Unknown mean?
expected: exit-255-unknown-reason-not-oom

q: can gitea push-mirror create the github repo automatically?
expected: gitea-push-mirror-cannot-create-remote-repo-needs-pre-existing-github-repo

q: a flux kustomization is stuck after I removed a resource — why?
expected: flux-healthcheck-stale-on-resource-removal

q: the bytes buffer aliasing trap with Reset in a loop — what's the bug?
expected: go-bytes-buffer-bytes-reset-aliasing-trap

q: what are the homelab architecture principles from may 2026?
expected: homelab-architecture-principles-2026-05

q: where does the sops age private key live in the cluster?
expected: 2026-05-04-sops-age-key-from-flux-cluster

q: why do my grafana dashboards disappear after a pod restart?
expected: grafana-dashboards-as-code-not-ui-state

q: what is the double diamond methodology?
expected: double-diamond-methodology

q: my MCP server works from claude code but fails on claude.ai — what's different?
expected: 2026-05-04-mcp-transport-version-claude-ai-strict

q: how should I rate security findings — isolated bugs or exploit chains?
expected: homelab-security-chains-not-bugs

q: how should canonical context files relate to derived adapter files?
expected: 2026-05-03-canonical-vs-derived-context-flow

q: what is the homelab core vocabulary glossary?
expected: homelab-core-glossary

q: which models on koala llama-swap actually emit native tool_calls correctly?
expected: koala-llama-swap-native-tool-calls-survey-2026-05

q: what is qwen35-9b-fast and what's it used for?
expected: qwen35-9b-fast

q: in go, how do I prevent defer body close from silently dropping errors?
expected: go-defer-errcheck-body-close

q: what was the level 3 rewrite of hyperguild's ingestion pipeline?
expected: hyperguild-level3-pipeline-rewrite

q: what's the new-project ADR — is it gitea-first or github-first?
expected: adr-new-project-gitea-first-github-mirror
```