an open source tool for automated behavioral evaluations

other other 2026-02-16 raw
Summary
We&#x27;re releasing <strong>Bloom</strong>, an open source agentic framework for generating behavioral evaluations of frontier AI models. <strong>Bloom</strong> takes a researcher-specified behavior and quantifies its frequency and severity across automatically generated scenarios. <strong>Bloom</strong>&#x27;s evaluations correlate strongly with our ...
View original source →