LEM/training/lem at 12e15ae7e97338dcbb22a04e9245f7cab1e0743a - lthn/LEM

lthn/LEM

Template

History

Snider 74ef174ec8 feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum Exact reproduction of all 7 CL-BPL phases for Gemma3-12B: - P0: LEK sandwich ethics (400 iters, LR 2e-5) - P1: Zen composure (300 iters, LR 1e-5) - P2: LEK sandwich reinforcement (300 iters, LR 1e-5) - P3: Freeflow multi-source (300 iters, LR 1e-5) - P4: 1B teacher tension distillation (300 iters, LR 1e-5) - P5: 1B teacher creative distillation (300 iters, LR 1e-5) - P6: Golden set graduation (13479 iters, LR 1e-5) Only model-size differences from 4B: 48GB/12GB Metal limits, 24 LoRA layers (vs 16), 12B base model path. All phases score at checkpoint cadence via lem-scorer. Previous wrong 12B models preserved as -no-axioms control group. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>		2026-02-25 20:44:03 +00:00
..
books	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00
composure	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00
creative	feat: add training data to repo + make paths repo-relative	2026-02-21 23:49:12 +00:00
deepseek-western-soak	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00
ethics	lems configs	2026-02-23 04:38:37 +00:00
eval	feat: add training data to repo + make paths repo-relative	2026-02-21 23:49:12 +00:00
model/gemma3	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
russian-bridge	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00
tension	feat: add training data to repo + make paths repo-relative	2026-02-21 23:49:12 +00:00
validation	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00
western-fresh	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00
zen	feat(distill): add Metal memory limit config fields	2026-02-22 17:59:11 +00:00