1
0
Fork 0
forked from lthn/LEM
Commit graph

3 commits

Author SHA1 Message Date
b8f9191b05
Add missing HF model cards, sync script, and Parquet export
- Add 4 missing model cards: Gemma3-1B-layered (v1+v2), Gemma3-27B, GPT-OSS-20B
- All 9 HF models now have cards in paper/hf-cards/
- sync_hf.py: push cards + benchmarks + training data to HuggingFace
- export_parquet.py: convert JSONL training splits to Parquet (HF dataset format)
- Parquet schema: prompt, response, system, messages (JSON)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-14 23:50:18 +00:00
Charon
e021b6beb0
Add generation worker: gold (15K) + expansion (46K) with InfluxDB coordination
Includes both generation scripts, prompts data, setup script, and worker
instructions in README. Workers auto-coordinate via InfluxDB so multiple
machines can generate in parallel without duplicating work.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-14 22:46:51 +00:00
Snider
8e5f082f30 LEM+LEK 2026-02-12 04:05:28 +00:00