Benchmark suite (testing.B), parallel slots via go-inference, flash attention manual comparison. Co-Authored-By: Virgil <virgil@lethean.io> |
||
|---|---|---|
| .. | ||
| plans | ||
Benchmark suite (testing.B), parallel slots via go-inference, flash attention manual comparison. Co-Authored-By: Virgil <virgil@lethean.io> |
||
|---|---|---|
| .. | ||
| plans | ||