- This PR is to make it on path for truncating by tokens. This path will be initially used by unified exec and context manager (responsible for MCP calls mainly). - We are exposing new config `calls_output_max_tokens` - Use `tokens` as the main budget unit but truncate based on the model family by Introducing `TruncationPolicy`. - Introduce `truncate_text` as a router for truncation based on the mode. In next PRs: - remove truncate_with_line_bytes_budget - Add the ability to the model to override the token budget. |
||
|---|---|---|
| .. | ||
| cache | ||
| git | ||
| image | ||
| json-to-toml | ||
| pty | ||
| readiness | ||
| string | ||
| tokenizer | ||