Search: metrics | CurrentLens.com

Open Source & Research

New Audit Reveals Flaws in Shapley Value Benchmarks for Explainable AI

CurrentLens
Apr 28, 2026

A recent study critiques Shapley values, finding misalignment in evaluation metrics and human utility.

Models & Launches

Test-Time Matching Enhances Compositional Reasoning in Multimodal Models

CurrentLens
Apr 27, 2026

A new test-time matching method improves compositional reasoning in AI models, achieving state-of-the-art results.

Models & Launches

OpenAI Introduces Parameter Golf in Model Craft Initiative

CurrentLens
Apr 26, 2026

OpenAI's latest initiative, Parameter Golf, aims to refine model performance metrics.

AI in Education

Researchers Build an Index to Measure the Human Relationship with Nature

CurrentLens
Apr 16, 2026

Conservationists are moving from exclusionary models toward metrics that count human stewardship alongside ecological health.

Latest
Trending

Science & Healthcare

Africa CDC and WHO launch $518M continental Ebola response plan

CurrentLens
Jun 6, 2026

A six-month 'One Response' plan targets the Bundibugyo Ebola outbreak with unified coordination, surveillance, clinical care and community engagement across affected countries.

Policy & Safety

HASC adds right-to-repair language to FY27 defense policy bill

CurrentLens
Jun 6, 2026

The House Armed Services Committee inserted right-to-repair provisions into its FY27 defense policy draft, aiming to ease barriers that limit troops' ability to fix equipment.

AI Creative

Startups Pull Users Off Phones With In-Person Games and DIY Cyberdecks

CurrentLens
Jun 6, 2026

TechCrunch highlights founders building physical social products: Board raised funding for in-person games, and cyberdeck DIYs are going viral.

Agents & Automation

MicroPython WASM Sandbox Enables Safer Datasette Plugin Execution

CurrentLens
Jun 6, 2026

Simon Willison published an alpha MicroPython-in-WASM sandbox (micropython-wasm) and a Datasette plugin (datasette-agent-micropython) to run plugin code with constrained access.

Models & Launches

DKPS method cuts model-evaluation queries using cached responses

CurrentLens
Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.