1
0
Fork 0
forked from lthn/LEM
Commit graph

9 commits

Author SHA1 Message Date
Snider
113649a86a updates 2026-02-19 13:18:21 +00:00
Snider
12501a5f3c Merge branch 'main' of github.com:LetheanNetwork/LEM 2026-02-19 13:17:11 +00:00
Snider
5d297daa35 feat: grammar scorer (v3) — deterministic uplift/sycophancy detection
Add lem-scorer binary that imports go-i18n grammar reversal engine to
score JSONL benchmark files. Measures conversational uplift (input vs
output grammar imprint), echo (sycophancy), and enrichment.

Key findings added to paper Section 8:
- LEK-1B: 100% positive uplift, 0% sycophancy (base: 90%, 5%)
- 1B-beats-27B holds in grammar space (79.12 > 77.12)
- LEK training aligns two independent scorers (corr -0.11 → 0.64)
- Delta analysis costs zero compute vs LLM-as-judge

Co-Authored-By: Virgil <virgil@lethean.io>
2026-02-19 13:12:49 +00:00
abc6e75976
Update author name in PAPER.md
Signed-off-by: Snider <snider@lethean.io>
2026-02-19 12:23:23 +00:00
Snider
350a7c6693 paper: rewrite as v2 — emergent self-protection in axiom-trained models
New paper structure leading with the central findings:
- Realignment resistance as emergent self-protection
- 1B-beats-27B across 101 probes
- 29-model A/B test with v2 scorer
- Mechanistic explanation from axiom self-consistency
- Incorporates Phase 1 (multi-variant, multi-scale, cross-arch)
  and Phase 2 (P100 A/B test) data

Co-Authored-By: Virgil <virgil@lethean.io>
2026-02-19 12:12:22 +00:00
Athena
ed0b83a9d9 Update training data to 2,299 examples and rename models LEM→LEK
- Replace 160-example POC training set with expanded 2,299-example dataset
  (1,839 train, 229 valid, 231 test)
- Rename all HuggingFace model references from LEM- to LEK- (proof-of-concept)
- Add missing models: GPT-OSS-20B, Gemma3-1B-layered-v2
- Rename HF card files to match LEK- convention
- Remove duplicate composure texts from kernel/ (kept in composure-library/)
- Fix paper repository URL to github.com/LetheanNetwork/LEM

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 16:19:56 +00:00
Snider
53c47131cc Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
Snider
adda3c8bb5 Benchmark & Findings:
lthn/LEM-Gemma-3-1B
lthn/LEM-Gemma-3-4B
lthn/LEM-Gemma-3-12B
lthn/LEM-Gemma-3-27B
2026-02-12 06:38:46 +00:00
Snider
8e5f082f30 LEM+LEK 2026-02-12 04:05:28 +00:00