{{template "head" "Dataset"}} {{template "nav" "dataset"}}
| Table | Rows | Size |
|---|---|---|
{{.Name}} |
{{fmtInt .Rows}} |
No golden set data available.
| Worker | Generations |
|---|---|
{{.Worker}} |
{{fmtInt .Count}} |
dataset_statsSeed browser coming soon. Use lem export --seeds to explore locally.
| Domain | Count | Avg Gen Time | Coverage |
|---|---|---|---|
{{.Domain}} |
{{.Count}} | {{pct .AvgGenTime}}s |
No domain data available.
| Voice | Count | Avg Chars | Avg Gen Time |
|---|---|---|---|
{{.Voice}} |
{{.Count}} | {{pct .AvgChars}} | {{pct .AvgGenTime}}s |
No voice data available.
dataset_statsExpansion pipeline: use lem expand to generate responses from trained models, then lem score to filter by quality.
Export formats:
| Format | Command | Use |
|---|---|---|
JSONL (MLX) |
lem export --format jsonl |
MLX LoRA training (train/valid/test splits) |
Parquet |
lem export --format parquet |
HuggingFace dataset upload |
CSV |
lem export --format csv |
Spreadsheet analysis |