Search: llm | CurrentLens.com

NVIDIA Advances Optimizers to Speed Up LLM Training

Chips & Infrastructure

NVIDIA Advances Optimizers to Speed Up LLM Training

CurrentLens
Apr 23, 2026

NVIDIA introduces new higher-order optimizers to enhance training efficiency for large language models.

Qwen 3.6-27B Model Surpasses Previous Coding Benchmarks

AI in Coding

Qwen 3.6-27B Model Surpasses Previous Coding Benchmarks

CurrentLens
Apr 23, 2026

The new Qwen 3.6-27B model delivers superior coding performance with a significantly reduced size.

Run Claude Cowork and Claude Code Desktop in Amazon Bedrock

AI in Coding

Run Claude Cowork and Claude Code Desktop in Amazon Bedrock

CurrentLens
Apr 22, 2026

AWS now supports Claude Cowork and Claude Code Desktop inside Amazon Bedrock, available either directly or via an LLM gateway to broaden use beyond individual developer desktops.

Hugging Face Releases ml-intern to Automate LLM Post‑Training Workflows

Open Source & Research

Hugging Face Releases ml-intern to Automate LLM Post‑Training Workflows

CurrentLens
Apr 22, 2026

ml-intern is an open-source agent that automates literature review, dataset discovery, training script runs, and iterative evaluation for LLM post-training work.

Firefox 150 Fixes 271 Vulnerabilities Found Using Claude Mythos Preview

Models & Launches

Firefox 150 Fixes 271 Vulnerabilities Found Using Claude Mythos Preview

CurrentLens
Apr 22, 2026

Mozilla patched 271 vulnerabilities after an initial security evaluation that used an early Claude Mythos Preview in collaboration with Anthropic.

Evaluates LLMs on Vietnamese legal text with a dual-aspect framework

Open Source & Research

Evaluates LLMs on Vietnamese legal text with a dual-aspect framework

CurrentLens
Apr 21, 2026

An arXiv paper introduces a quantitative-plus-error-analysis benchmark for Vietnamese legal text, comparing GPT-4o, Claude 3 Opus, Gemini 1.5 Pro and Grok-1.

Full fine-tuning concentrates LLM attribution in code-compliance models

Models & Launches

Full fine-tuning concentrates LLM attribution in code-compliance models

CurrentLens
Apr 21, 2026

An arXiv study uses perturbation-based attribution to compare FFT, LoRA, and quantized LoRA across model sizes and finds FFT yields more focused interpretive patterns.

Qwen3.6-35B-A3B bests Claude Opus 4.7 on Willison's pelican test

Models & Launches

Qwen3.6-35B-A3B bests Claude Opus 4.7 on Willison's pelican test

CurrentLens
Apr 16, 2026

Simon Willison reports that a local, quantized Qwen3.6-35B-A3B run produced better pelican and flamingo illustrations than Anthropic's Claude Opus 4.

EVE Releases Open-Source 24B Earth-Intelligence LLM and Benchmarks

Science & Healthcare

EVE Releases Open-Source 24B Earth-Intelligence LLM and Benchmarks

CurrentLens
Apr 16, 2026

EVE publishes EVE-Instruct, a 24B Mistral-based model and a suite of Earth-science datasets, benchmarks, and tooling for domain-specific LLM deployment.

Merge GNN Predictions with LLM Reasoning in GLOW for Open-World QA

Open Source & Research

Merge GNN Predictions with LLM Reasoning in GLOW for Open-World QA

CurrentLens
Apr 16, 2026

GLOW pairs a pre-trained GNN with an LLM to answer questions over incomplete knowledge graphs and ships GLOW-BENCH, a 1,000-question evaluation.

llm-anthropic 0.25 Adds Claude-Opus-4.7 with xhigh thinking_effort

Models & Launches

llm-anthropic 0.25 Adds Claude-Opus-4.7 with xhigh thinking_effort

CurrentLens
Apr 16, 2026

Simon Willison released llm-anthropic 0.25, which ships claude-opus-4.7 supporting thinking_effort: xhigh and new thinking flags.

Latest
Trending

Xiaomi Launches MiMo-V2.5-Pro and MiMo-V2.5 at Lower Costs

Models & Launches

Xiaomi Launches MiMo-V2.5-Pro and MiMo-V2.5 at Lower Costs

CurrentLens
Apr 23, 2026

Xiaomi's new MiMo models achieve frontier benchmarks while reducing token costs significantly.

NVIDIA Advances Optimizers to Speed Up LLM Training

Chips & Infrastructure

NVIDIA Advances Optimizers to Speed Up LLM Training

CurrentLens
Apr 23, 2026

NVIDIA introduces new higher-order optimizers to enhance training efficiency for large language models.

Space Force Accelerates Recruitment Amid Looming Budget Boost

AI Defense & Warfare

Space Force Accelerates Recruitment Amid Looming Budget Boost

CurrentLens
Apr 23, 2026

With a significant budget increase on the horizon, the Space Force is ramping up recruitment efforts.

Anthropic Unveils Responsible Scaling Policy for AI Governance

Policy & Safety

Anthropic Unveils Responsible Scaling Policy for AI Governance

CurrentLens
Apr 23, 2026

Anthropic announces its Responsible Scaling Policy aimed at enhancing AI governance amid rising concerns.

Google Launches Two New TPUs for AI Inference and Training

AI Creative

Google Launches Two New TPUs for AI Inference and Training

CurrentLens
Apr 23, 2026

Google introduces two new Tensor Processing Units tailored for the evolving AI landscape.

GitHub Copilot Tightens Pricing and Usage Limits for Individual Plans

AI in Coding

GitHub Copilot Tightens Pricing and Usage Limits for Individual Plans

CurrentLens
Apr 23, 2026

GitHub Copilot imposes new usage limits and pauses signups for individual plans amid rising demand.

ChatGPT Images 2.0 Excels in Text Generation Capabilities

AI Creative

ChatGPT Images 2.0 Excels in Text Generation Capabilities

CurrentLens
Apr 23, 2026

OpenAI's ChatGPT Images 2.0 model showcases a surprising proficiency in text generation.