Search: Release | CurrentLens.com

Models & Launches

Authors Release OpenEval and Demand Item-Level Benchmark Standards

CurrentLens
May 25, 2026

A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.

Open Source & Research

Civitai Launches High-Fidelity Studious Scout LoRA for Fortnite

CurrentLens
Apr 26, 2026

Civitai releases the Studious Scout 🎒 LoRA for Fortnite, designed for flexibility and character consistency.

AI in Coding

llm-openai-via-codex 0.1a0 Integrates LLM API with Codex CLI for Developers

CurrentLens
Apr 24, 2026

The release of llm-openai-via-codex 0.1a0 simplifies API calls for developers using Codex CLI.

Open Source & Research

Hugging Face Releases ml-intern to Automate LLM Post‑Training Workflows

CurrentLens
Apr 23, 2026

ml-intern is an open-source agent that automates literature review, dataset discovery, training script runs, and iterative evaluation for LLM post-training work.

Chips & Infrastructure

Gas-Powered Data Centers May Emit More GHG Than Nations

CurrentLens
Apr 23, 2026

Emerging gas-powered data centers linked to major tech firms could release over 129 million tons of greenhouse gases annually.

Models & Launches

Xiaomi Launches MiMo-V2.5-Pro and MiMo-V2.5 at Lower Costs

CurrentLens
Apr 23, 2026

Xiaomi's new MiMo models achieve frontier benchmarks while reducing token costs significantly.

Models & Launches

OpenAI Releases ChatGPT Images 2.0

CurrentLens
Apr 21, 2026

OpenAI published ChatGPT Images 2.0; Simon Willison ran a Where's‑Waldo‑style prompt to compare it with gpt-image-1 and rival models.

Models & Launches

Anthropic Ships Claude Opus 4.7 for Agentic Coding and High‑Res Vision

CurrentLens
Apr 19, 2026

Anthropic released Claude Opus 4.7, a focused successor to Opus 4.6 that emphasizes agentic software engineering, high-resolution vision and long-horizon autonomy.

Agents & Automation

AWS launches Spring AI SDK for Amazon Bedrock AgentCore

CurrentLens
Apr 17, 2026

AWS has released an open-source Spring AI AgentCore SDK that embeds Bedrock AgentCore capabilities into Spring AI and targets production-ready agent workflows.

Chips & Infrastructure

NVIDIA releases NVbandwidth to profile GPU interconnect and memory throughput

CurrentLens
Apr 17, 2026

NVIDIA published NVbandwidth, a developer tool for measuring data-transfer and memory performance in CUDA-powered single- and multi-GPU systems.

Models & Launches

Anthropic ships Claude Opus 4.7 as its most powerful generally available model

CurrentLens
Apr 17, 2026

Opus 4.7 arrives as Anthropic’s strongest generally available Claude release, claiming upgrades for advanced coding, image analysis and instruction following.

AI in Coding

Datasette 1.0a28 fixes alpha breakages, adds shutdown and test-cleanup APIs

CurrentLens
Apr 17, 2026

Release 1.0a28 repairs compatibility regressions from 1.0a27, adds datasette.close and database.close behavior, and ships a pytest plugin to avoid fd leaks.

Models & Launches

Qwen3.6-35B-A3B bests Claude Opus 4.7 on Willison's pelican test

CurrentLens
Apr 16, 2026

Simon Willison reports that a local, quantized Qwen3.6-35B-A3B run produced better pelican and flamingo illustrations than Anthropic's Claude Opus 4.

Science & Healthcare

EVE Releases Open-Source 24B Earth-Intelligence LLM and Benchmarks

CurrentLens
Apr 16, 2026

EVE publishes EVE-Instruct, a 24B Mistral-based model and a suite of Earth-science datasets, benchmarks, and tooling for domain-specific LLM deployment.

Models & Launches

llm-anthropic 0.25 Adds Claude-Opus-4.7 with xhigh thinking_effort

CurrentLens
Apr 16, 2026

Simon Willison released llm-anthropic 0.25, which ships claude-opus-4.7 supporting thinking_effort: xhigh and new thinking flags.

Models & Launches

DeepMind Ships Gemini Robotics‑ER 1.6 for Physical Robot Reasoning

CurrentLens
Apr 15, 2026

Gemini Robotics‑ER 1.6 adds instrument-reading plus improved visual, spatial and planning skills to DeepMind's embodied-reasoning model for robots.

Latest
Trending

Science & Healthcare

Africa CDC and WHO launch $518M continental Ebola response plan

CurrentLens
Jun 6, 2026

A six-month 'One Response' plan targets the Bundibugyo Ebola outbreak with unified coordination, surveillance, clinical care and community engagement across affected countries.

Policy & Safety

HASC adds right-to-repair language to FY27 defense policy bill

CurrentLens
Jun 6, 2026

The House Armed Services Committee inserted right-to-repair provisions into its FY27 defense policy draft, aiming to ease barriers that limit troops' ability to fix equipment.

AI Creative

Startups Pull Users Off Phones With In-Person Games and DIY Cyberdecks

CurrentLens
Jun 6, 2026

TechCrunch highlights founders building physical social products: Board raised funding for in-person games, and cyberdeck DIYs are going viral.

Agents & Automation

MicroPython WASM Sandbox Enables Safer Datasette Plugin Execution

CurrentLens
Jun 6, 2026

Simon Willison published an alpha MicroPython-in-WASM sandbox (micropython-wasm) and a Datasette plugin (datasette-agent-micropython) to run plugin code with constrained access.

Models & Launches

DKPS method cuts model-evaluation queries using cached responses

CurrentLens
Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.