Models & Launches - AI News | CurrentLens.com

Major model releases, flagship updates, launches, benchmarks and product unveilings.

Models & Launches

DKPS method cuts model-evaluation queries using cached responses

CurrentLens
Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.

Models & Launches

PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans

CurrentLens
Jun 2, 2026

A physics-informed foundation model called PIGMENT learns a universal microstructure prior and adapts zero-shot to individual diffusion MRI scans, enabling reliable maps from sparse and heterogeneous data.

Models & Launches

ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025

CurrentLens
May 27, 2026

A new ATOM analysis of about 1,500 open language models maps downloads, derivatives, inference share and performance, and reports Chinese models surpassed U.S.

Models & Launches

Authors Release OpenEval and Demand Item-Level Benchmark Standards

CurrentLens
May 25, 2026

A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.

Models & Launches

New Study Reveals Limits of Model-Level Evaluations in Alignment Assessments

CurrentLens
May 8, 2026

A recent paper argues that alignment evaluation cannot solely rely on model-level assessments.

Models & Launches

Aymara AI Launches Safety Evaluation System for 20 Language Models

CurrentLens
May 1, 2026

Aymara AI unveils a platform for custom safety evaluations of large language models, revealing performance gaps.

Models & Launches

Goodfire Launches Silico, a New Tool for Debugging LLMs

CurrentLens
Apr 30, 2026

Silico allows developers to fine-tune AI model parameters during training, enhancing control.

Models & Launches

Investors Fund Skye's AI Home Screen App Ahead of iPhone Launch

CurrentLens
Apr 28, 2026

Skye's AI home screen application secures investor backing pre-launch, highlighting interest in smarter iPhones.

Models & Launches

Microsoft Launches VibeVoice, a New Speech-to-Text Model

CurrentLens
Apr 28, 2026

Microsoft introduces VibeVoice, a Whisper-style speech-to-text model with speaker diarization.

Models & Launches

Test-Time Matching Enhances Compositional Reasoning in Multimodal Models

CurrentLens
Apr 27, 2026

A new test-time matching method improves compositional reasoning in AI models, achieving state-of-the-art results.

Models & Launches

OpenAI Introduces Parameter Golf in Model Craft Initiative

CurrentLens
Apr 26, 2026

OpenAI's latest initiative, Parameter Golf, aims to refine model performance metrics.

Models & Launches

DenoiseRank Introduces Generative Approach to Learning to Rank

CurrentLens
Apr 26, 2026

DenoiseRank leverages diffusion models for a fresh generative angle on learning to rank tasks.

Models & Launches

Nemobot Introduces Strategic AI Agents for Interactive Gaming

CurrentLens
Apr 26, 2026

Nemobot leverages large language models to create customizable AI agents for strategic games.

Models & Launches

AI Models Show Risks for Biological Misuse Amid Evolving Safeguards

CurrentLens
Apr 24, 2026

Recent benchmarks reveal AI models may enable biological weaponization by low-expertise users, raising urgent policy concerns.

Models & Launches

Xiaomi Launches MiMo-V2.5-Pro and MiMo-V2.5 at Lower Costs

CurrentLens
Apr 23, 2026

Xiaomi's new MiMo models achieve frontier benchmarks while reducing token costs significantly.

Models & Launches

OpenAI Makes ChatGPT Free for Verified U.S. Healthcare Professionals

CurrentLens
Apr 23, 2026

OpenAI has announced that verified U.S. physicians, nurse practitioners, and pharmacists can now access ChatGPT for Clinicians at no charge.

Models & Launches

RepIt Framework Enables Concept-Specific Refusal in Language Models

CurrentLens
Apr 23, 2026

A new framework exposes vulnerabilities in language model safety evaluations through concept-specific manipulations.

Models & Launches

OpenAI Adds Codex-Powered Workspace Agents to ChatGPT

CurrentLens
Apr 22, 2026

OpenAI introduced workspace agents in ChatGPT: Codex-powered cloud agents designed to automate complex workflows and scale team work across tools securely.

Models & Launches

Firefox 150 Fixes 271 Vulnerabilities Found Using Claude Mythos Preview

CurrentLens
Apr 22, 2026

Mozilla patched 271 vulnerabilities after an initial security evaluation that used an early Claude Mythos Preview in collaboration with Anthropic.

Models & Launches

Full fine-tuning concentrates LLM attribution in code-compliance models

CurrentLens
Apr 21, 2026

An arXiv study uses perturbation-based attribution to compare FFT, LoRA, and quantized LoRA across model sizes and finds FFT yields more focused interpretive patterns.

Models & Launches

OpenAI Releases ChatGPT Images 2.0

CurrentLens
Apr 21, 2026

OpenAI published ChatGPT Images 2.0; Simon Willison ran a Where's‑Waldo‑style prompt to compare it with gpt-image-1 and rival models.

Models & Launches

AllenAI launches vla-eval to unify Vision-Language-Action benchmarking

CurrentLens
Apr 21, 2026

vla-eval decouples model inference from simulator execution with a WebSocket+msgpack protocol and Docker isolation, supporting 14 benchmarks and six model servers.

Models & Launches

Anthropic updates Claude Opus 4.7 system prompt with new tools and tighter safety guidance

CurrentLens
Apr 19, 2026

Anthropic revised the Claude Opus 4.7 system prompt to add a PowerPoint agent, expand child-safety rules, and change interaction guidance.

Models & Launches

Anthropic Ships Claude Opus 4.7 for Agentic Coding and High‑Res Vision

CurrentLens
Apr 19, 2026

Anthropic released Claude Opus 4.7, a focused successor to Opus 4.6 that emphasizes agentic software engineering, high-resolution vision and long-horizon autonomy.

Models & Launches

Anthropic ships Claude Opus 4.7 as its most powerful generally available model

CurrentLens
Apr 17, 2026

Opus 4.7 arrives as Anthropic’s strongest generally available Claude release, claiming upgrades for advanced coding, image analysis and instruction following.

Models & Launches

OpenAI Debuts GPT-Rosalind for Drug Discovery and Genomics

CurrentLens
Apr 17, 2026

OpenAI launched GPT-Rosalind, its first life‑sciences model aimed at accelerating drug discovery and genomic analysis and cutting long development timelines.

Models & Launches

Qwen3.6-35B-A3B bests Claude Opus 4.7 on Willison's pelican test

CurrentLens
Apr 16, 2026

Simon Willison reports that a local, quantized Qwen3.6-35B-A3B run produced better pelican and flamingo illustrations than Anthropic's Claude Opus 4.

Models & Launches

llm-anthropic 0.25 Adds Claude-Opus-4.7 with xhigh thinking_effort

CurrentLens
Apr 16, 2026

Simon Willison released llm-anthropic 0.25, which ships claude-opus-4.7 supporting thinking_effort: xhigh and new thinking flags.

Models & Launches

Google Launches Gemini 3.1 Flash TTS with 70+ Language, Multi‑Speaker Support

CurrentLens
Apr 16, 2026

Gemini 3.1 Flash TTS is a preview that refocuses Google’s speech work on expressive control, natural‑language audio tags, and native multilingual, multi‑speaker output.

Models & Launches

DeepMind Ships Gemini Robotics‑ER 1.6 for Physical Robot Reasoning

CurrentLens
Apr 15, 2026

Gemini Robotics‑ER 1.6 adds instrument-reading plus improved visual, spatial and planning skills to DeepMind's embodied-reasoning model for robots.

Models & Launches

NVIDIA Launches Ising AI Models to Tackle Noisy Qubits

CurrentLens
Apr 14, 2026

NVIDIA unveiled Ising, an open family of AI models with Calibration and Decoding domains designed to help build fault-tolerant quantum processors.

Models & Launches

OpenAI pushes to lock users and expand enterprise in internal memo

CurrentLens
Apr 14, 2026

CRO Denise Dresser told staff to prioritize user retention and enterprise sales and to build a product 'moat' as users easily switch between top models.