LEM/training
Snider f65fd777ea feat: convert composure library to training JSONL format
Add cmd/composure-convert tool that chunks public domain philosophical
texts into training conversation pairs:
- consent.jsonl (198 examples) — Wollstonecraft's Vindication
- privacy.jsonl (221 examples) — Thoreau's Walden
- sovereignty.jsonl (56 examples) — Mill's On Liberty
- transparency.jsonl (159 examples) — Aurelius' Meditations

Each example pairs a domain-specific prompt with ~5 paragraphs from
the source text. Metadata, chapter headings, and Gutenberg boilerplate
are filtered out.

Co-Authored-By: Virgil <virgil@lethean.io>
2026-02-21 23:59:06 +00:00
..
lem feat: convert composure library to training JSONL format 2026-02-21 23:59:06 +00:00
test.jsonl Update training data to 2,299 examples and rename models LEM→LEK 2026-02-13 16:19:56 +00:00
train.jsonl Update training data to 2,299 examples and rename models LEM→LEK 2026-02-13 16:19:56 +00:00
valid.jsonl Update training data to 2,299 examples and rename models LEM→LEK 2026-02-13 16:19:56 +00:00