1
0
Fork 0
forked from lthn/LEM
LEM/scripts
Snider 526150621e feat: rewire 12B scripts to use 4B+1B distilled cascade
All 7 phases now pull from pre-distilled responses:
- /Volumes/Data/lem/distilled-for-12b/distilled-4b-all.jsonl (7,544)
- /Volumes/Data/lem/distilled/distilled-1b-p0p5.jsonl (1,404)
- /Volumes/Data/lem/distilled/distilled-1b-golden.jsonl (12,828)
- /Volumes/Data/lem/distilled/distilled-1b-golden-reverse.jsonl (4,183)

4B responses listed first (reverse cascade order), then 1B.
P4/P5 no longer need live teacher distillation.
P6 gets all 15,000 unique 1B golden responses + 6,140 4B.
No data replicated into training/lem/model/ per model size.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-25 21:13:27 +00:00
..
ab_test.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
augment_ready_stop.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
benchmark_to_scorer.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
chat-4b-base.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
chat-4b-p2.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
chat-4b.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
compare_v1_v2.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
convert_adapter.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
distill_sandwich.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
distill_seeds.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
eval_adapter.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
export_parquet.py Add missing HF model cards, sync script, and Parquet export 2026-02-15 00:14:26 +00:00
extract_training.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
ingest_benchmarks.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
lek_content_scorer.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
lem_benchmark.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_cross_arch_benchmark.py Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
lem_cross_arch_train.py Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
lem_gemini3_generate.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
lem_gemini3flash_generate.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
lem_gemini25flash_generate.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
lem_generate_pipeline.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
lem_scale_benchmark.py Benchmark & Findings: 2026-02-12 06:38:46 +00:00
lem_scale_scorer.py Benchmark & Findings: 2026-02-12 06:38:46 +00:00
lem_scorer.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_self_scorer.py Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information 2026-02-12 09:07:32 +00:00
lem_semantic_scorer.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_standard_benchmark.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_standard_scorer.py LEM+LEK 2026-02-12 04:05:28 +00:00
lem_train_15k.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
pipeline.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
probes.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
push_all_models.py Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary 2026-02-13 13:39:08 +00:00
rephrase_probes.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
rescore.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
restructure_zen.py refactor: apply go fix modernizers for Go 1.26 2026-02-22 21:00:17 +00:00
run_all_ab.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
run_p100_top5.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
run_phase0.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
run_phase1.sh feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
score_deepseek_sovereignty.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
score_gemma12b.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
score_r1_capability.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
scoring-agent.service feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
scoring_agent.py feat: add Go lem CLI and scoring-agent scripts 2026-02-15 16:22:13 +00:00
self_distill.py feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline 2026-02-19 11:32:26 +00:00
sync_hf.py Add missing HF model cards, sync script, and Parquet export 2026-02-15 00:14:26 +00:00
train-4b-lek.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p0-resume.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p0.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p1.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p2-resume.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p2.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p3.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p4.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p5.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-4b-p6.py feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum 2026-02-25 20:44:03 +00:00
train-12b-p0.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p1.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p2.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p3.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p4.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p5.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00
train-12b-p6.py feat: rewire 12B scripts to use 4B+1B distilled cascade 2026-02-25 21:13:27 +00:00