Saturday, June 13, 2026
  • x
  • facebook
  • instagram

CurrentLens.com

Insight Today. Impact Tomorrow.

  • Home
  • Models
  • Agents
  • Coding
  • Creative
  • Policy
  • Infrastructure
  • Topics
    • Enterprise
    • Open Source
    • Science
    • Education
    • AI & Warfare
Latest News
  • OpenAI Launches Three Academy Courses on Agents and Workflows
  • Google Releases Gemini-SQL2; Gemini 3.1 Pro Scores 80.04% on BIRD
  • Africa CDC and WHO launch $518M continental Ebola response plan
  • HASC adds right-to-repair language to FY27 defense policy bill
  • Startups Pull Users Off Phones With In-Person Games and DIY Cyberdecks
  • MicroPython WASM Sandbox Enables Safer Datasette Plugin Execution
  • OpenAI Launches Three Academy Courses on Agents and Workflows
  • Google Releases Gemini-SQL2; Gemini 3.1 Pro Scores 80.04% on BIRD
  • Africa CDC and WHO launch $518M continental Ebola response plan
  • HASC adds right-to-repair language to FY27 defense policy bill
  • Startups Pull Users Off Phones With In-Person Games and DIY Cyberdecks
  • MicroPython WASM Sandbox Enables Safer Datasette Plugin Execution
  • Home
  • Models & Launches
  • Google Releases Gemini-SQL2; Gemini 3.1 Pro Scores 80.04% on BIRD

Google Releases Gemini-SQL2; Gemini 3.1 Pro Scores 80.04% on BIRD

Posted on Jun 13, 2026 by CurrentLens in Models
Google Releases Gemini-SQL2; Gemini 3.1 Pro Scores 80.04% on BIRD

Photo by Andrey Matveev on Unsplash

AI Quick Take

  • Gemini-SQL2, powered by Gemini 3.1 Pro, achieved 80.04% execution accuracy on the BIRD single-model leaderboard.
  • Google has not published full implementation or evaluation details; readers should watch for released configs, APIs, or multi-model comparisons.

Google Research announced Gemini-SQL2 on June 12, 2026: a text-to-SQL capability powered by Gemini 3.1 Pro that posted 80.04% execution accuracy on the BIRD single-model leaderboard. The announcement highlights that execution-accuracy number as the core performance metric and frames Gemini-SQL2 as a schema-grounded text-to-SQL approach intended for practical database query generation.

The reported 80.04% figure is the BIRD benchmark’s execution-accuracy result for Gemini 3.1 Pro on the single-model leaderboard; the source material explains what that metric measures and how leaderboard placement is assessed. At the same time, Google has not yet published full implementation or evaluation details needed to reproduce the benchmark or immediately integrate the capability into production systems. The release also discusses use cases and a schema-grounded implementation pattern, which suggests recommended integration approaches even in the absence of full technical disclosure.

For engineers and product teams, the immediate takeaway is that Gemini-SQL2 establishes a public performance point for Gemini 3.1 Pro on a recognized text-to-SQL benchmark, but adoption decisions will depend on further details. Expect to watch for follow-up releases from Google that provide evaluation configurations, prompts or templates, APIs or SDKs, and multi-model comparisons that enable reproducible testing and practical deployment planning.

Posted in Models & Launches | Tags: google, gemini, text-to-sql, benchmarks, gemini-3.1-pro, bird, models-launches, Gemini
  • Latest
  • Trending
DKPS method cuts model-evaluation queries using cached responses
  • Models & Launches

DKPS method cuts model-evaluation queries using cached responses

  • CurrentLens
  • Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.

Read More: DKPS method cuts model-evaluation queries using cached responses
PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans
  • Models & Launches

PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans

  • CurrentLens
  • Jun 2, 2026

A physics-informed foundation model called PIGMENT learns a universal microstructure prior and adapts zero-shot to individual diffusion MRI scans, enabling reliable maps from sparse and heterogeneous data.

Read More: PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans
ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025
  • Models & Launches

ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025

  • CurrentLens
  • May 27, 2026

A new ATOM analysis of about 1,500 open language models maps downloads, derivatives, inference share and performance, and reports Chinese models surpassed U.S.

Read More: ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025
Authors Release OpenEval and Demand Item-Level Benchmark Standards
  • Models & Launches

Authors Release OpenEval and Demand Item-Level Benchmark Standards

  • CurrentLens
  • May 25, 2026

A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.

Read More: Authors Release OpenEval and Demand Item-Level Benchmark Standards
Authors Release OpenEval and Demand Item-Level Benchmark Standards
  • Models & Launches

Authors Release OpenEval and Demand Item-Level Benchmark Standards

  • CurrentLens
  • May 25, 2026

A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.

Read More: Authors Release OpenEval and Demand Item-Level Benchmark Standards
ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025
  • Models & Launches

ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025

  • CurrentLens
  • May 27, 2026

A new ATOM analysis of about 1,500 open language models maps downloads, derivatives, inference share and performance, and reports Chinese models surpassed U.S.

Read More: ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025
PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans
  • Models & Launches

PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans

  • CurrentLens
  • Jun 2, 2026

A physics-informed foundation model called PIGMENT learns a universal microstructure prior and adapts zero-shot to individual diffusion MRI scans, enabling reliable maps from sparse and heterogeneous data.

Read More: PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans
DKPS method cuts model-evaluation queries using cached responses
  • Models & Launches

DKPS method cuts model-evaluation queries using cached responses

  • CurrentLens
  • Jun 6, 2026

An arXiv paper introduces a DKPS-based approach that uses cached model outputs to predict benchmark scores while substantially reducing the number of queries.

Read More: DKPS method cuts model-evaluation queries using cached responses

Categories

  • Models & Launches›
  • Agents & Automation›
  • AI in Coding›
  • AI Creative›
  • Policy & Safety›
  • Chips & Infrastructure›
  • Enterprise AI›
  • Open Source & Research›
  • Science & Healthcare›
  • AI in Education›
  • AI Defense & Warfare›
CurrentLens.com

Navigate

  • Home
  • Topics
  • About
  • Contact
  • Privacy Policy
  • Terms of Use

Coverage

  • Models & Launches
  • Agents & Automation
  • AI in Coding
  • AI Creative
  • Policy & Safety
  • Chips & Infrastructure

Newsletter

AI news that matters, straight to your inbox.

© 2026 CurrentLens.comAll rights reserved