LEM/scripts at 526150621e18283c5c41e99452f843af69c8476d - Clotho/LEM

Clotho/LEM

forked from lthn/LEM

History

Snider 526150621e feat: rewire 12B scripts to use 4B+1B distilled cascade All 7 phases now pull from pre-distilled responses: - /Volumes/Data/lem/distilled-for-12b/distilled-4b-all.jsonl (7,544) - /Volumes/Data/lem/distilled/distilled-1b-p0p5.jsonl (1,404) - /Volumes/Data/lem/distilled/distilled-1b-golden.jsonl (12,828) - /Volumes/Data/lem/distilled/distilled-1b-golden-reverse.jsonl (4,183) 4B responses listed first (reverse cascade order), then 1B. P4/P5 no longer need live teacher distillation. P6 gets all 15,000 unique 1B golden responses + 6,140 4B. No data replicated into training/lem/model/ per model size. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>		2026-02-25 21:13:27 +00:00
..
ab_test.py	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
augment_ready_stop.py	refactor: apply go fix modernizers for Go 1.26	2026-02-22 21:00:17 +00:00
benchmark_to_scorer.py	refactor: apply go fix modernizers for Go 1.26	2026-02-22 21:00:17 +00:00
chat-4b-base.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
chat-4b-p2.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
chat-4b.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
compare_v1_v2.py	Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary	2026-02-13 13:39:08 +00:00
convert_adapter.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
distill_sandwich.py	refactor: apply go fix modernizers for Go 1.26	2026-02-22 21:00:17 +00:00
distill_seeds.py	refactor: apply go fix modernizers for Go 1.26	2026-02-22 21:00:17 +00:00
eval_adapter.py	refactor: apply go fix modernizers for Go 1.26	2026-02-22 21:00:17 +00:00
export_parquet.py	Add missing HF model cards, sync script, and Parquet export	2026-02-15 00:14:26 +00:00
extract_training.py	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
ingest_benchmarks.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
lek_content_scorer.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
lem_benchmark.py	LEM+LEK	2026-02-12 04:05:28 +00:00
lem_cross_arch_benchmark.py	Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information	2026-02-12 09:07:32 +00:00
lem_cross_arch_train.py	Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information	2026-02-12 09:07:32 +00:00
lem_gemini3_generate.py	Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary	2026-02-13 13:39:08 +00:00
lem_gemini3flash_generate.py	Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary	2026-02-13 13:39:08 +00:00
lem_gemini25flash_generate.py	Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary	2026-02-13 13:39:08 +00:00
lem_generate_pipeline.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
lem_scale_benchmark.py	Benchmark & Findings:	2026-02-12 06:38:46 +00:00
lem_scale_scorer.py	Benchmark & Findings:	2026-02-12 06:38:46 +00:00
lem_scorer.py	LEM+LEK	2026-02-12 04:05:28 +00:00
lem_self_scorer.py	Add cross-architecture training and benchmarking scripts; update README and PAPER with author and repository information	2026-02-12 09:07:32 +00:00
lem_semantic_scorer.py	LEM+LEK	2026-02-12 04:05:28 +00:00
lem_standard_benchmark.py	LEM+LEK	2026-02-12 04:05:28 +00:00
lem_standard_scorer.py	LEM+LEK	2026-02-12 04:05:28 +00:00
lem_train_15k.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
pipeline.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
probes.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
push_all_models.py	Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary	2026-02-13 13:39:08 +00:00
rephrase_probes.py	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
rescore.py	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
restructure_zen.py	refactor: apply go fix modernizers for Go 1.26	2026-02-22 21:00:17 +00:00
run_all_ab.sh	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
run_p100_top5.sh	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
run_phase0.sh	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
run_phase1.sh	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
score_deepseek_sovereignty.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
score_gemma12b.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
score_r1_capability.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
scoring-agent.service	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
scoring_agent.py	feat: add Go lem CLI and scoring-agent scripts	2026-02-15 16:22:13 +00:00
self_distill.py	feat: LEK-1 kernel A/B test — 29 models, P100 validation, curriculum pipeline	2026-02-19 11:32:26 +00:00
sync_hf.py	Add missing HF model cards, sync script, and Parquet export	2026-02-15 00:14:26 +00:00
train-4b-lek.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p0-resume.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p0.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p1.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p2-resume.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p2.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p3.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p4.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p5.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-4b-p6.py	feat: add faithful 12B training scripts (P0-P6) — 1:1 port of 4B curriculum	2026-02-25 20:44:03 +00:00
train-12b-p0.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00
train-12b-p1.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00
train-12b-p2.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00
train-12b-p3.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00
train-12b-p4.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00
train-12b-p5.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00
train-12b-p6.py	feat: rewire 12B scripts to use 4B+1B distilled cascade	2026-02-25 21:13:27 +00:00