Runs the same prompts through baseline and fine-tuned models, scores both with the heuristic scorer, and outputs a comparison report with LEK score deltas and improvement/regression counts. Uses built-in content probes by default, or custom prompts file. Co-Authored-By: Virgil <virgil@lethean.io>
7 lines
88 B
Go
7 lines
88 B
Go
//go:build darwin && arm64
|
|
|
|
package ml
|
|
|
|
func init() {
|
|
mlCmd.AddCommand(benchmarkCmd)
|
|
}
|