1
0
Fork 0
forked from lthn/LEM
LEM/paper
Snider 5d297daa35 feat: grammar scorer (v3) — deterministic uplift/sycophancy detection
Add lem-scorer binary that imports go-i18n grammar reversal engine to
score JSONL benchmark files. Measures conversational uplift (input vs
output grammar imprint), echo (sycophancy), and enrichment.

Key findings added to paper Section 8:
- LEK-1B: 100% positive uplift, 0% sycophancy (base: 90%, 5%)
- 1B-beats-27B holds in grammar space (79.12 > 77.12)
- LEK training aligns two independent scorers (corr -0.11 → 0.64)
- Delta analysis costs zero compute vs LLM-as-judge

Co-Authored-By: Virgil <virgil@lethean.io>
2026-02-19 13:12:49 +00:00
..
hf-cards Add missing HF model cards, sync script, and Parquet export 2026-02-15 00:14:26 +00:00
27b-curriculum-design.md feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
ADDENDUM-NATIVE-MLX.md feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
PAPER.md feat: grammar scorer (v3) — deterministic uplift/sycophancy detection 2026-02-19 13:12:49 +00:00