NVIDIA reported a clean sweep of MLPerf Training v6.
33 results for: data
Adds execute_write_sql tool to request approval before DB writes
datasette-agent 0.3a0 introduces execute_write_sql to prompt for user approval and apply DB permissions, plus chat CLI approval flags.
Blackwell Ultra Tops AgentPerf, Runs 20x More Agents per MW
Artificial Analysis's AgentPerf benchmark names NVIDIA's Blackwell Ultra NVL72 the leader in early agentic-AI infrastructure tests, citing a 20x agents-per-megawatt figure in published results.
WHO data show voluntary blood donations top 85%, but access gaps remain
WHO reports sustained gains in blood safety as voluntary donations exceed 85%, while uneven access and weak governance continue to limit coverage.
MicroPython WASM Sandbox Enables Safer Datasette Plugin Execution
Simon Willison published an alpha MicroPython-in-WASM sandbox (micropython-wasm) and a Datasette plugin (datasette-agent-micropython) to run plugin code with constrained access.
Pentagon Seeks JWCC Follow-On to Build Three-Tier Cloud Marketplace
A draft solicitation proposes a three-tier cloud ecosystem for AI, tactical edge operations and secure data sharing across the Defense Department.
Amazon Bedrock AgentCore Adds Policy and Lambda Interceptors for Secure Agents
AWS demonstrates layering deterministic Policy checks with Lambda interceptors in the Bedrock AgentCore gateway using a lakehouse data agent to enforce geography-based controls.
PIGMENT extends quantitative diffusion MRI to sparse, multi-site and low-field scans
A physics-informed foundation model called PIGMENT learns a universal microstructure prior and adapts zero-shot to individual diffusion MRI scans, enabling reliable maps from sparse and heterogeneous data.
MPMMine standardizes benchmarks for constraint-acquisition research
An arXiv preprint introduces MPMMine, a benchmark suite built to supply the domain artifacts and structured data constraint-acquisition methods need for reproducible evaluation.
NVIDIA Vera CPU Runs Fast and Sustained in Early Phoronix Tests
Initial Phoronix benchmarks published on NVIDIA's blog show the Vera CPU delivers the fast cores, memory bandwidth and full-core throughput targeted at agentic AI workloads.
Paper Proposes Three-Step Framework for Knowledge-Work Benchmarks
An arXiv paper argues that LLM evaluation still mirrors traditional NLP tasks and offers a three-step method to align benchmarks with real workplace activity.
Datasette Adds Extensible 'Jump to' Menu in 1.0a30
Datasette 1.0a30 introduces a customizable, searchable 'Jump to...' menu and a plugin hook for adding entries to its index.
Authors Release OpenEval and Demand Item-Level Benchmark Standards
A position paper argues AI evaluation must publish item-level benchmark responses and ships OpenEval - 10M model responses across 155k items - to prove the point.
UK Invests in AI Firm Developing Knowledge-Discovery Technology
The UK government is backing a company focused on AI aimed at automating knowledge discovery.
NVIDIA Unveils Framework for In-Vehicle AI Systems from Cloud to Car
NVIDIA details a transformative cloud-to-car framework for in-vehicle AI, shifting automotive interfaces.
OpenClassGen Provides Extensive Python Classes for LLM Research
OpenClassGen introduces a comprehensive dataset of Python classes, enhancing LLM evaluation.
Meta Establishes HSM-based Backup Vault for Encrypted Messaging Data
Meta unveils a hardware security module (HSM)-based Backup Key Vault to enhance encryption for user data.
Research Proposes MedCheck Framework to Enhance Medical AI Benchmarks
A new framework aims to improve the assessment of medical AI benchmarks, addressing key shortcomings.
Experts Assess LLM Performance on Japanese Bar Exam's Open-Ended Tasks
A new study evaluates LLMs' legal reasoning using the Japanese bar exam's writing component.
New Framework Streamlines Adaptive Medical Image Processing for Clinical Settings
A novel artifact-based agent framework enhances adaptability and reproducibility in medical imaging.
Unauthorized Access to Anthropic's Mythos Highlights Security Risks in AI
Discord sleuths gain unauthorized access to Anthropic's Mythos, revealing vulnerabilities in AI security.
NVIDIA Advances Federated Learning with New FLARE Capabilities
NVIDIA enhances federated learning, streamlining processes for managing valuable yet immovable data.
AI and GPUs Accelerate Cosmic Data Analysis This Spring Astronomy Day
AI technologies and GPUs are streamlining the analysis of vast cosmic datasets for astronomers.
AI's Growth Demands Robust Data Fabric for Business Impact
As AI technologies proliferate in enterprises, the need for a strong data fabric becomes crucial.
Hugging Face Releases ml-intern to Automate LLM Post‑Training Workflows
ml-intern is an open-source agent that automates literature review, dataset discovery, training script runs, and iterative evaluation for LLM post-training work.
Gas-Powered Data Centers May Emit More GHG Than Nations
Emerging gas-powered data centers linked to major tech firms could release over 129 million tons of greenhouse gases annually.
Amazon Invests $5B in Anthropic; Anthropic Commits $100B to AWS
Amazon is investing $5 billion in Anthropic while Anthropic has pledged $100 billion in AWS spending, linking the startup’s compute demand directly to Amazon.
AWS launches G7e SageMaker instances with NVIDIA RTX PRO 6000 Blackwell GPUs
AWS added G7e instances to SageMaker AI using NVIDIA RTX PRO 6000 Blackwell GPUs, offering 96 GB GDDR7 per GPU and 1/2/4/8 GPU node sizes to simplify hosting large open-source FMs.
Cerebras Files for IPO After AWS Capacity Deal and Reported $10B OpenAI Contract
The AI chipmaker has moved toward a public listing after securing AWS data-center placement and a reported multibillion-dollar agreement with OpenAI.
Capcom’s PRAGMATA Launches on GeForce NOW Day One
Capcom’s sci‑fi action PRAGMATA is available on NVIDIA’s GeForce NOW the same day it launches, letting players stream the game to many devices without a console.
NVIDIA releases NVbandwidth to profile GPU interconnect and memory throughput
NVIDIA published NVbandwidth, a developer tool for measuring data-transfer and memory performance in CUDA-powered single- and multi-GPU systems.
Datasette 1.0a28 fixes alpha breakages, adds shutdown and test-cleanup APIs
Release 1.0a28 repairs compatibility regressions from 1.0a27, adds datasette.close and database.close behavior, and ships a pytest plugin to avoid fd leaks.
EVE Releases Open-Source 24B Earth-Intelligence LLM and Benchmarks
EVE publishes EVE-Instruct, a 24B Mistral-based model and a suite of Earth-science datasets, benchmarks, and tooling for domain-specific LLM deployment.