Skip to content

v0.35.0

Latest

Choose a tag to compare

@philipph-askui philipph-askui released this 02 Jun 08:53
89a8cec

v0.35.0

🎉 Overview

v0.35.0 adds support for OpenAI-compatible APIs as model providers, enabling the use of OpenAI, Ollama, vLLM, LM Studio, Together AI, RunPod, and any other service that exposes an OpenAI-compatible chat completions endpoint. Truncation strategies now preserve the first user message across summarization to retain the original task instructions, and the truncation headroom has been doubled to reduce the chance of hitting context limits immediately after truncation.

✨ New Features

  • OpenAIVlmProvider — VLM provider for any OpenAI-compatible API (OpenAI, vLLM, LM Studio, Together AI, etc.) by @philipph-askui in #268
  • OpenAIImageQAProvider — image Q&A provider for any OpenAI-compatible API by @philipph-askui in #268
  • OllamaVlmProvider — convenience wrapper for local Ollama instances with sensible defaults (base_url=http://localhost:11434/v1, model_id=qwen3.5) by @philipph-askui in #268
  • OllamaImageQAProvider — image Q&A via local Ollama instances by @philipph-askui in #268
  • OpenAICompatibleVlmProvider — VLM provider for endpoints that require an exact URL (e.g., RunPod, custom proxies) where the OpenAI SDK's automatic path appending would break the request by @philipph-askui in #268
  • OpenAIMessagesApi — full translation layer between the internal MessageParam format and OpenAI's chat completions API, handling tool calls, image content, thinking blocks, and role alternation by @philipph-askui in #268
  • OpenAIGetModelGetModel implementation for OpenAI-compatible APIs with structured output support by @philipph-askui in #268
  • Built-in pricing data for gpt-5.4, gpt-5.4-mini, and gpt-5.4-nano models by @philipph-askui in #268

🔧 Improvements

  • Truncation strategies now preserve the first user message across summarization, ensuring the original task instructions are never lost when the conversation is truncated by @philipph-askui in #280
  • MAX_INPUT_TOKENS increased from 100k to 200k and TRUNCATION_THRESHOLD lowered from 0.7 to 0.56, roughly doubling the headroom after truncation to reduce the chance of re-triggering truncation immediately by @philipph-askui in #280
  • process_id parameter in list_process_windows tool is now auto-converted to int, preventing tool errors when the agent passes it as a string by @philipph-askui in #279

🐛 Bug Fixes

  • AgentSpeaker now handles the case where the model returns stop_reason='tool_use' but no actual tool call blocks in the content, preventing stopped executions by prompting the model to retry with a valid tool call by @philipph-askui in #278

Full Changelog: v0.34.0...v0.35.0