llm-anthropic 0.25 Adds Claude-Opus-4.7 with xhigh thinking_effort

Posted on Apr 16, 2026 by CurrentLens in Models

Photo by Sumaid pal Singh Bakshi on Unsplash

The update also introduces thinking_display and thinking_adaptive options, increases default max_tokens to each model’s limit, and removes an obsolete structured-outputs header.

AI Quick Take

claude-opus-4.
Default max_tokens raised to model limits and the older structured-outputs-2025-11-13 beta header has been removed for legacy models.

Simon Willison released llm-anthropic 0.25, which adds a new model, claude-opus-4.7, and introduces support for a higher internal effort setting via thinking_effort: xhigh.

The package also adds two boolean flags, thinking_display and thinking_adaptive. Summarized output produced by thinking_display is currently exposed only in JSON output or JSON logs. Separately, the release increases the default max_tokens setting to each model’s maximum allowed value and removes the older structured-outputs-2025-11-13 beta header previously used for legacy models.

Operationally, these are configuration and compatibility changes: the new thinking flags formalize how callers can request and receive internal 'thinking' summaries, but those summaries are limited to JSON for now. The raised max_tokens simplifies usage where full context is expected, and eliminating the obsolete header reduces beta-era legacy behavior. Teams should watch for further SDK documentation and any expansion of thinking_display beyond JSON outputs.

Latest
Trending

Models & Launches

DKPS method cuts model-evaluation queries using cached responses

CurrentLens
Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.

Models & Launches

PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans

CurrentLens
Jun 2, 2026

A physics-informed foundation model called PIGMENT learns a universal microstructure prior and adapts zero-shot to individual diffusion MRI scans, enabling reliable maps from sparse and heterogeneous data.

Models & Launches

ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025

CurrentLens
May 27, 2026

A new ATOM analysis of about 1,500 open language models maps downloads, derivatives, inference share and performance, and reports Chinese models surpassed U.S.

Models & Launches

Authors Release OpenEval and Demand Item-Level Benchmark Standards

CurrentLens
May 25, 2026

A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.