Add lem-scorer binary that imports go-i18n grammar reversal engine to score JSONL benchmark files. Measures conversational uplift (input vs output grammar imprint), echo (sycophancy), and enrichment. Key findings added to paper Section 8: - LEK-1B: 100% positive uplift, 0% sycophancy (base: 90%, 5%) - 1B-beats-27B holds in grammar space (79.12 > 77.12) - LEK training aligns two independent scorers (corr -0.11 → 0.64) - Delta analysis costs zero compute vs LLM-as-judge Co-Authored-By: Virgil <virgil@lethean.io> |
||
|---|---|---|
| .. | ||
| hf-cards | ||
| 27b-curriculum-design.md | ||
| ADDENDUM-NATIVE-MLX.md | ||
| PAPER.md | ||