Charon/LEM - Forgejo: Beyond coding. We Forge.

Charon/LEM

Fork 0

forked from lthn/LEM

Commit graph

Author	SHA1	Message	Date
Charon	b8f9191b05	Add missing HF model cards, sync script, and Parquet export - Add 4 missing model cards: Gemma3-1B-layered (v1+v2), Gemma3-27B, GPT-OSS-20B - All 9 HF models now have cards in paper/hf-cards/ - sync_hf.py: push cards + benchmarks + training data to HuggingFace - export_parquet.py: convert JSONL training splits to Parquet (HF dataset format) - Parquet schema: prompt, response, system, messages (JSON) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-14 23:50:18 +00:00
Charon	e021b6beb0	Add generation worker: gold (15K) + expansion (46K) with InfluxDB coordination Includes both generation scripts, prompts data, setup script, and worker instructions in README. Workers auto-coordinate via InfluxDB so multiple machines can generate in parallel without duplicating work. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-14 22:46:51 +00:00
Snider	8e5f082f30	LEM+LEK	2026-02-12 04:05:28 +00:00

Author

SHA1

Message

Date

Charon

b8f9191b05

Add missing HF model cards, sync script, and Parquet export

- Add 4 missing model cards: Gemma3-1B-layered (v1+v2), Gemma3-27B, GPT-OSS-20B
- All 9 HF models now have cards in paper/hf-cards/
- sync_hf.py: push cards + benchmarks + training data to HuggingFace
- export_parquet.py: convert JSONL training splits to Parquet (HF dataset format)
- Parquet schema: prompt, response, system, messages (JSON)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-14 23:50:18 +00:00

Charon

e021b6beb0

Add generation worker: gold (15K) + expansion (46K) with InfluxDB coordination

Includes both generation scripts, prompts data, setup script, and worker
instructions in README. Workers auto-coordinate via InfluxDB so multiple
machines can generate in parallel without duplicating work.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-14 22:46:51 +00:00

Snider

8e5f082f30

LEM+LEK

2026-02-12 04:05:28 +00:00

3 commits