Search: Red | CurrentLens.com

Open Source & Research

DeepTrap uncovers contextual vulnerabilities in OpenClaw agents

CurrentLens
Jun 16, 2026

A new arXiv paper introduces DeepTrap, a black-box framework that finds execution-context attacks against OpenClaw and publishes a 42-case benchmark and code.

Models & Launches

Google Releases Gemini-SQL2; Gemini 3.1 Pro Scores 80.04% on BIRD

CurrentLens
Jun 13, 2026

Google Research announced Gemini-SQL2, a Gemini 3.1 Pro-powered text-to-SQL capability that posted 80.04% execution accuracy on the BIRD single-model leaderboard.

Science & Healthcare

Africa CDC and WHO launch $518M continental Ebola response plan

CurrentLens
Jun 6, 2026

A six-month 'One Response' plan targets the Bundibugyo Ebola outbreak with unified coordination, surveillance, clinical care and community engagement across affected countries.

Models & Launches

DKPS method cuts model-evaluation queries using cached responses

CurrentLens
Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.

Open Source & Research

MPMMine standardizes benchmarks for constraint-acquisition research

CurrentLens
May 27, 2026

An arXiv preprint introduces MPMMine, a benchmark suite built to supply the domain artifacts and structured data constraint-acquisition methods need for reproducible evaluation.

Agents & Automation

OpenAI, Thrive and Crete Build Self‑Improving Tax Agent Using Codex

CurrentLens
May 27, 2026

OpenAI and partners built a Codex-powered tax agent they say automates filings, improves accuracy, and accelerates tax workflows for developers and operators.

Models & Launches

New Study Reveals Limits of Model-Level Evaluations in Alignment Assessments

CurrentLens
May 8, 2026

A recent paper argues that alignment evaluation cannot solely rely on model-level assessments.

AI Defense & Warfare

NATO Calls for Governance Standards on AI-Enhanced Geospatial Intelligence

CurrentLens
May 8, 2026

NATO emphasizes the need for policies to govern the sharing of AI-powered geospatial intel to enhance allied operations.

AI in Coding

OpenAI-Microsoft AGI Clause Ends, Changing IP Landscape

CurrentLens
Apr 28, 2026

The unique AGI clause between OpenAI and Microsoft has been redefined, impacting IP rights.

Agents & Automation

Sierra Acquires YC-Backed AI Startup Fragment to Enhance Customer Service

CurrentLens
Apr 26, 2026

Sierra, founded by Bret Taylor, has acquired French AI startup Fragment, bolstering its customer service capabilities.

Chips & Infrastructure

AI's Growth Demands Robust Data Fabric for Business Impact

CurrentLens
Apr 23, 2026

As AI technologies proliferate in enterprises, the need for a strong data fabric becomes crucial.

Chips & Infrastructure

Gas-Powered Data Centers May Emit More GHG Than Nations

CurrentLens
Apr 23, 2026

Emerging gas-powered data centers linked to major tech firms could release over 129 million tons of greenhouse gases annually.

Models & Launches

Xiaomi Launches MiMo-V2.5-Pro and MiMo-V2.5 at Lower Costs

CurrentLens
Apr 23, 2026

Xiaomi's new MiMo models achieve frontier benchmarks while reducing token costs significantly.

AI in Coding

Qwen 3.6-27B Model Surpasses Previous Coding Benchmarks

CurrentLens
Apr 23, 2026

The new Qwen 3.6-27B model delivers superior coding performance with a significantly reduced size.

Open Source & Research

RARE Introduces Framework for Evaluating High-Similarity Document Retrieval

CurrentLens
Apr 23, 2026

The RARE framework addresses evaluation flaws in redundancy-heavy document retrieval, particularly in legal and financial sectors.

Models & Launches

OpenAI Adds Codex-Powered Workspace Agents to ChatGPT

CurrentLens
Apr 22, 2026

OpenAI introduced workspace agents in ChatGPT: Codex-powered cloud agents designed to automate complex workflows and scale team work across tools securely.

Chips & Infrastructure

NVIDIA releases NVbandwidth to profile GPU interconnect and memory throughput

CurrentLens
Apr 17, 2026

NVIDIA published NVbandwidth, a developer tool for measuring data-transfer and memory performance in CUDA-powered single- and multi-GPU systems.

Open Source & Research

Merge GNN Predictions with LLM Reasoning in GLOW for Open-World QA

CurrentLens
Apr 16, 2026

GLOW pairs a pre-trained GNN with an LLM to answer questions over incomplete knowledge graphs and ships GLOW-BENCH, a 1,000-question evaluation.

Latest
Trending

Open Source & Research

DeepTrap uncovers contextual vulnerabilities in OpenClaw agents

CurrentLens
Jun 16, 2026

A new arXiv paper introduces DeepTrap, a black-box framework that finds execution-context attacks against OpenClaw and publishes a 42-case benchmark and code.

Enterprise AI

HPE Expands AI Factory With NVIDIA for Agentic Deployments

CurrentLens
Jun 16, 2026

HPE and NVIDIA expanded the HPE AI Factory to include NVIDIA Vera CPU and the NVIDIA Agent Toolkit, positioning the offering for agent-first production use.

Chips & Infrastructure

NVIDIA Blackwell Sweeps MLPerf Training v6.0, Tops Per‑GPU and Scale

CurrentLens
Jun 16, 2026

NVIDIA reported a clean sweep of MLPerf Training v6.

AI in Coding

Z.ai Ships GLM-5.2 with Usable 1M-Token Context

CurrentLens
Jun 16, 2026

GLM-5.2 arrives across GLM Coding tiers with a 1M-token context, two effort modes, Anthropic-compatible endpoints and no benchmarks at launch.

Agents & Automation

Adds execute_write_sql tool to request approval before DB writes

CurrentLens
Jun 16, 2026

datasette-agent 0.3a0 introduces execute_write_sql to prompt for user approval and apply DB permissions, plus chat CLI approval flags.

Models & Launches

Extend Vision-Language-Action Policies to New Tasks via Retrieval

CurrentLens
Jun 16, 2026

An arXiv paper shows frozen vision-language-action policies can absorb new tasks at test time by retrieving pool-side demonstrations instead of per-task fine-tuning.