5 tasks: go-inference ParallelSlots, wire --parallel, benchmark suite, flash attention comparison, documentation. Co-Authored-By: Virgil <virgil@lethean.io> |
||
|---|---|---|
| .. | ||
| plans | ||
5 tasks: go-inference ParallelSlots, wire --parallel, benchmark suite, flash attention comparison, documentation. Co-Authored-By: Virgil <virgil@lethean.io> |
||
|---|---|---|
| .. | ||
| plans | ||