LEM/scripts
Snider 0923a08a7d feat: add OG base data as 3rd variant to 12B P0-P3 training scripts
Reverse cascade order: 4B (largest teacher) → 1B (graduated) → OG (base).
Three perspectives per prompt — cymatic cascading from expanded Q/K to modal primitives.

P0/P2: 404×3 = 1,212 (sandwich format, OG from lesson-lem1b.jsonl)
P1: 209×3 = 627 (OG from zen/golden multi-turn lessons)
P3: 225×3 = 675 (OG from western-fresh + russian-bridge + composure)
P4-P6: unchanged (no separate OG file — live distilled)

Co-Authored-By: Virgil <virgil@lethean.io>
2026-02-25 21:26:01 +00:00
..
ab_test.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
augment_ready_stop.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
benchmark_to_scorer.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
chat-4b-base.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
chat-4b-p2.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
chat-4b.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
compare_v1_v2.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
convert_adapter.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
distill_sandwich.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
distill_seeds.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
eval_adapter.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
export_parquet.py Add missing HF model cards, sync script, and Parquet export 2026-02-15 00:14:26 +00:00
extract_training.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
ingest_benchmarks.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
lek_content_scorer.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
lem_benchmark.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_cross_arch_benchmark.py Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
lem_cross_arch_train.py Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
lem_gemini3_generate.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
lem_gemini3flash_generate.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
lem_gemini25flash_generate.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
lem_generate_pipeline.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
lem_scale_benchmark.py Benchmark & Findings: 2026-02-12 06:38:46 +00:00
lem_scale_scorer.py Benchmark & Findings: 2026-02-12 06:38:46 +00:00
lem_scorer.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_self_scorer.py Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
lem_semantic_scorer.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_standard_benchmark.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_standard_scorer.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_train_15k.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
pipeline.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
probes.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
push_all_models.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
rephrase_probes.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
rescore.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
restructure_zen.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
run_all_ab.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
run_p100_top5.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
run_phase0.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
run_phase1.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
score_deepseek_sovereignty.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
score_gemma12b.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
score_r1_capability.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
scoring-agent.service feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
scoring_agent.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
self_distill.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
sync_hf.py Add missing HF model cards, sync script, and Parquet export 2026-02-15 00:14:26 +00:00
train-4b-lek.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p0-resume.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p0.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p1.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p2-resume.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p2.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p3.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p4.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p5.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p6.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-12b-p0.py feat: add OG base data as 3rd variant to 12B P0-P3 training scripts 2026-02-25 21:26:01 +00:00
train-12b-p1.py feat: add OG base data as 3rd variant to 12B P0-P3 training scripts 2026-02-25 21:26:01 +00:00
train-12b-p2.py feat: add OG base data as 3rd variant to 12B P0-P3 training scripts 2026-02-25 21:26:01 +00:00
train-12b-p3.py feat: add OG base data as 3rd variant to 12B P0-P3 training scripts 2026-02-25 21:26:01 +00:00
train-12b-p4.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p5.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p6.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00