Snider
5d297daa35
feat: grammar scorer (v3) — deterministic uplift/sycophancy detection
Add lem-scorer binary that imports go-i18n grammar reversal engine to
score JSONL benchmark files. Measures conversational uplift (input vs
output grammar imprint), echo (sycophancy), and enrichment.
Key findings added to paper Section 8:
- LEK-1B: 100% positive uplift, 0% sycophancy (base: 90%, 5%)
- 1B-beats-27B holds in grammar space (79.12 > 77.12)
- LEK training aligns two independent scorers (corr -0.11 → 0.64)
- Delta analysis costs zero compute vs LLM-as-judge
Co-Authored-By: Virgil <virgil@lethean.io>