A new study evaluates LLMs' legal reasoning using the Japanese bar exam's writing component.
5 results for: dataset
AI and GPUs Accelerate Cosmic Data Analysis This Spring Astronomy Day
AI technologies and GPUs are streamlining the analysis of vast cosmic datasets for astronomers.
Hugging Face Releases ml-intern to Automate LLM Post‑Training Workflows
ml-intern is an open-source agent that automates literature review, dataset discovery, training script runs, and iterative evaluation for LLM post-training work.
Datasette 1.0a28 fixes alpha breakages, adds shutdown and test-cleanup APIs
Release 1.0a28 repairs compatibility regressions from 1.0a27, adds datasette.close and database.close behavior, and ships a pytest plugin to avoid fd leaks.
EVE Releases Open-Source 24B Earth-Intelligence LLM and Benchmarks
EVE publishes EVE-Instruct, a 24B Mistral-based model and a suite of Earth-science datasets, benchmarks, and tooling for domain-specific LLM deployment.