June 1, 2026

Strike Force heroes4

Connecting the World with Advanced Technology

New AGI benchmark indicates whether a future AI model could cause ‘catastrophic harm’

New AGI benchmark indicates whether a future AI model could cause ‘catastrophic harm’

Scientists have designed a new set of tests that measure whether artificial intelligence (AI) agents can modify their own code and improve its capabilities without human instruction.

The benchmark, dubbed “MLE-bench,” is a compilation of 75 Kaggle tests, each one a challenge that tests machine learning engineering. This work involves training AI models, preparing datasets, and running scientific experiments, and the Kaggle tests measure how well the machine learning algorithms perform at specific tasks.

link

Copyright © All rights reserved. | Newsphere by AF themes.