Back to news
Ethics Policy

New Test Platform Measures How Dangerous AI Models Can Be in Biological Threats

Researchers have developed a systematic way to build a test dataset that can assess how much advanced AI models can facilitate the design of biological weapons or bioterrorism.

The concern is that large language models — AIs that produce and understand text like humans — could provide detailed advice on handling or exploiting dangerous bacteria. Model developers and policymakers need a reliable way to measure such risks so they can also be mitigated.

The recent work describes how the so-called Bacterial Biothreat Benchmark (B3) dataset is constructed as part of a broader Biothreat Benchmark Generation framework. The aim is to create a collection of precisely defined questions and tasks to test whether AI provides answers that could practically facilitate the realization of a biological threat.

The dataset was compiled in three complementary ways. Firstly, researchers utilized an online question and task bank where various biological threat scenarios are converted into prompts for AI. Secondly, a so-called red team method was used, where experts intentionally try to find weaknesses in security restrictions. The third approach complemented these, but the article's summary only mentions it at the headline level.

The research does not yet focus on the results of individual AI models but on the production of the dataset itself. The goal is that such standardized tests will help in the future to compare the biological risk potential of different models and support legislators and developers in setting boundaries.

Source: Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models II: Benchmark Generation Process, ArXiv (AI).

This text was generated with AI assistance and may contain errors. Please verify details from the original source.

Original research: Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models II: Benchmark Generation Process
Publisher: ArXiv (AI)
Authors: Gary Ackerman, Zachary Kallenborn, Anna Wetzel, Hayley Peterson, Jenna LaTourette, Olivia Shoemaker, Brandon Behlendorf, Sheriff Almakki, Doug Clifford, Noah Sheinbaum
December 28, 2025
Read original →