v0.35.0

🎉 Overview

v0.35.0 adds support for OpenAI-compatible APIs as model providers, enabling the use of OpenAI, Ollama, vLLM, LM Studio, Together AI, RunPod, and any other service that exposes an OpenAI-compatible chat completions endpoint. Truncation strategies now preserve the first user message across summarization to retain the original task instructions, and the truncation headroom has been doubled to reduce the chance of hitting context limits immediately after truncation.

✨ New Features

OpenAIVlmProvider — VLM provider for any OpenAI-compatible API (OpenAI, vLLM, LM Studio, Together AI, etc.) by @philipph-askui in #268
OpenAIImageQAProvider — image Q&A provider for any OpenAI-compatible API by @philipph-askui in #268
OllamaVlmProvider — convenience wrapper for local Ollama instances with sensible defaults (base_url=http://localhost:11434/v1, model_id=qwen3.5) by @philipph-askui in #268
OllamaImageQAProvider — image Q&A via local Ollama instances by @philipph-askui in #268
OpenAICompatibleVlmProvider — VLM provider for endpoints that require an exact URL (e.g., RunPod, custom proxies) where the OpenAI SDK's automatic path appending would break the request by @philipph-askui in #268
OpenAIMessagesApi — full translation layer between the internal MessageParam format and OpenAI's chat completions API, handling tool calls, image content, thinking blocks, and role alternation by @philipph-askui in #268
OpenAIGetModel — GetModel implementation for OpenAI-compatible APIs with structured output support by @philipph-askui in #268
Built-in pricing data for gpt-5.4, gpt-5.4-mini, and gpt-5.4-nano models by @philipph-askui in #268

🔧 Improvements

Truncation strategies now preserve the first user message across summarization, ensuring the original task instructions are never lost when the conversation is truncated by @philipph-askui in #280
MAX_INPUT_TOKENS increased from 100k to 200k and TRUNCATION_THRESHOLD lowered from 0.7 to 0.56, roughly doubling the headroom after truncation to reduce the chance of re-triggering truncation immediately by @philipph-askui in #280
process_id parameter in list_process_windows tool is now auto-converted to int, preventing tool errors when the agent passes it as a string by @philipph-askui in #279

🐛 Bug Fixes

AgentSpeaker now handles the case where the model returns stop_reason='tool_use' but no actual tool call blocks in the content, preventing stopped executions by prompting the model to retry with a valid tool call by @philipph-askui in #278

Full Changelog: v0.34.0...v0.35.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.35.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

v0.35.0

🎉 Overview

✨ New Features

🔧 Improvements

🐛 Bug Fixes

Contributors

Uh oh!