NVIDIA announced new software, open-source models and platform partnerships to build autonomous AI agents for engineering, healthcare, development and business operations.
44 results for: New
Pope Leo XIV Declares AI 'Not a Purely Technical Matter' in New Encyclical
Pope Leo XIV published Magnifica Humanitas, framing AI decisions as moral and social; Anthropic's Christopher Olah attended the unveiling and reactions in tech were mixed.
ATOM Report Finds Chinese Open Models Overtook Western Peers in 2025
A new ATOM analysis of about 1,500 open language models maps downloads, derivatives, inference share and performance, and reports Chinese models surpassed U.S.
Grimoire - Prompt Builder with AI Generation for SD / ComfyUI (Other)
What is new here is that grimoire - Prompt Builder with AI Generation for SD / ComfyUI (Other).
Inside Anduril and Meta’s quest to make smart glasses for warfare
What is new here is that inside Anduril and Meta’s quest to make smart glasses for warfare.
Musk v. Altman proved that AI is led by the wrong people
What is new here is that musk v. Altman proved that AI is led by the wrong people.
Turkey’s STM debuts new unmanned systems, is ‘really open’ to Gulf collaboration
What is new here is that turkey’s STM debuts new unmanned systems, is ‘really open’ to Gulf collaboration.
Here’s what Mira Murati’s AI company is up to
What is new here is that here’s what Mira Murati’s AI company is up to. Image, video, music, design, audio and creator-facing generative AI.
Multimodal LLMs Underperform in Real-World Dermatology Evaluation
A new study reveals that multimodal large language models struggle with clinical dermatology tasks.
New Study Reveals Limits of Model-Level Evaluations in Alignment Assessments
A recent paper argues that alignment evaluation cannot solely rely on model-level assessments.
NSA Tests Anthropic's Mythos Preview for Vulnerability Assessment
The NSA's new initiative leverages Anthropic's AI tooling to enhance cybersecurity measures against vulnerabilities.
Microsoft's New AI Agent for Word Aims to Transform Legal Workflow
Microsoft unveils a dedicated AI agent in Word designed for legal teams, enhancing contract management.
RPC-Bench Introduces Fine-Grained Benchmark for Research Paper Comprehension
RPC-Bench addresses gaps in understanding academic papers for AI models with a new benchmark.
Research Proposes MedCheck Framework to Enhance Medical AI Benchmarks
A new framework aims to improve the assessment of medical AI benchmarks, addressing key shortcomings.
ATBench Introduces New Safety Evaluation Benchmarks for OpenClaw and Codex
ATBench unveils domain-specific benchmarks, ATBench-Claw and ATBench-Codex, enhancing trajectory safety evaluation.
NVIDIA Empowers AI Factories with New Enterprise Reference Architectures
NVIDIA announces its Enterprise Reference Architectures to support AI factories, enhancing productivity.
Goodfire Launches Silico, a New Tool for Debugging LLMs
Silico allows developers to fine-tune AI model parameters during training, enhancing control.
Experts Assess LLM Performance on Japanese Bar Exam's Open-Ended Tasks
A new study evaluates LLMs' legal reasoning using the Japanese bar exam's writing component.
New LLM Framework Enhances Mathematical Reasoning Evaluation
A novel LLM-based framework provides flexible evaluation of mathematical reasoning, addressing limitations of symbolic methods.
New Audit Reveals Flaws in Shapley Value Benchmarks for Explainable AI
A recent study critiques Shapley values, finding misalignment in evaluation metrics and human utility.
Microsoft Launches VibeVoice, a New Speech-to-Text Model
Microsoft introduces VibeVoice, a Whisper-style speech-to-text model with speaker diarization.
New Framework Streamlines Adaptive Medical Image Processing for Clinical Settings
A novel artifact-based agent framework enhances adaptability and reproducibility in medical imaging.
Anthropic Tests Marketplace for AI Agent Commerce
Anthropic's new marketplace allows AI agents to facilitate real transactions between buyers and sellers.
Test-Time Matching Enhances Compositional Reasoning in Multimodal Models
A new test-time matching method improves compositional reasoning in AI models, achieving state-of-the-art results.
NATO Report Highlights Readiness Divide Among Eastern Flank Countries
A new report reveals critical gaps in military readiness for NATO's eastern flank nations, particularly in sustainment capabilities.
WHO Prequalifies First-Ever Malaria Treatment for Newborns and Infants
The WHO has prequalified the first specialized malaria treatment for newborns and young infants, addressing a critical healthcare gap.
NVIDIA Advances Federated Learning with New FLARE Capabilities
NVIDIA enhances federated learning, streamlining processes for managing valuable yet immovable data.
OpenCLAW-P2P v6.0 Enhances Decentralized AI Peer Review with New Features
OpenCLAW-P2P v6.0 introduces advanced subsystems for decentralized AI peer review, improving paper resilience and retrieval.
Anthropic Restructures OpenClaw, Imposing New Costs for Users
Anthropic enforces restrictions on OpenClaw, impacting AI agent tool users amid profit pressures.
Amazon Bedrock AgentCore Accelerates Agent Development Process
Amazon introduces new features in Bedrock AgentCore, enhancing agent development speed and efficiency.
NVIDIA Advances Optimizers to Speed Up LLM Training
NVIDIA introduces new higher-order optimizers to enhance training efficiency for large language models.
Google Launches Two New Tensor TPUs for AI Inference and Training
Google has unveiled two new specialized TPUs for distinct AI functions, enhancing its Tensor chip lineup.
Xiaomi Launches MiMo-V2.5-Pro and MiMo-V2.5 at Lower Costs
Xiaomi's new MiMo models achieve frontier benchmarks while reducing token costs significantly.
GitHub Copilot Tightens Pricing and Usage Limits for Individual Plans
GitHub Copilot imposes new usage limits and pauses signups for individual plans amid rising demand.
Qwen 3.6-27B Model Surpasses Previous Coding Benchmarks
The new Qwen 3.6-27B model delivers superior coding performance with a significantly reduced size.
RepIt Framework Enables Concept-Specific Refusal in Language Models
A new framework exposes vulnerabilities in language model safety evaluations through concept-specific manipulations.
Build Agent-First Governance to Secure a Growing Non‑Human Identity Footprint
As agentic AI proliferates, enterprises face a new attack surface: insecure agents and exploding non‑human identities that can be manipulated to reach sensitive systems.
Anthropic updates Claude Opus 4.7 system prompt with new tools and tighter safety guidance
Anthropic revised the Claude Opus 4.7 system prompt to add a PowerPoint agent, expand child-safety rules, and change interaction guidance.
Making AI operational in constrained public sector environments
What is new here is that making AI operational in constrained public sector environments.
Commission approves €211 million Italian State aid measure to support photonic chips development
What is new here is that commission approves €211 million Italian State aid measure to support photonic chips development.
Second GPAI Signatory Taskforce meeting - Copyright Chapter
What is new here is that second GPAI Signatory Taskforce meeting - Copyright Chapter.
llm-anthropic 0.25 Adds Claude-Opus-4.7 with xhigh thinking_effort
Simon Willison released llm-anthropic 0.25, which ships claude-opus-4.7 supporting thinking_effort: xhigh and new thinking flags.
NVIDIA Accelerates Adobe Premiere Color Grading Mode on RTX GPUs
NVIDIA is showcasing a new Adobe Premiere color grading mode accelerated on its GPUs at NAB Show 2026, targeting pro editors and live workflows.