• Campaign Progress Update: Cognition AI releases new “Acceptable Usage Policy”

    Campaign Progress Update: Cognition AI releases new “Acceptable Usage Policy”

    Over 500 people have signed our petition against Cognition since we launched it this summer. In that time, we’ve been calling the company out for having never once publicly discussed safety and responsible usage. But yesterday, that changed. Cognition released an acceptable usage policy that details what obligations users have to them in using their…

  • Cognition

    Released an acceptable usage policy, along with a reporting email for security vulnerabilities.

  • OpenAI

    Removed author from GPT-4o system card.

  • Following the trendlines: The pace of AI progress

    Following the trendlines: The pace of AI progress

    If there’s one thing to know about the current state of AI development, it’s this: Things are moving faster than anyone anticipated. For a long time, there was uncertainty about whether the set of methods known as machine learning would ever be able to achieve human-level general intelligence, let alone surpass it. In the late…

  • OpenAI

    Adjusted authorship for a two-year-old article on their approach to alignment (with no substantive changes to the content)

  • Join “Red Teaming In Public”

    Join “Red Teaming In Public”

    “Red Teaming in Public” is a project, originally started by Nathan Labenz and Pablo Eder in June 2024. The goal is to catalyze a shift toward higher standards for AI developers. Labenz shared the following details in the project’s announcement on X: For context, we are pro-technology “AI Scouts” who believe in the immense potential…

  • Incentive gradients and The Midas Project’s theory of change

    Incentive gradients and The Midas Project’s theory of change

    Why start an industry watchdog organization calling out irresponsible AI developers? Companies move along incentive gradients. Imagine this as a 3D landscape with peaks and valleys, downward slopes and upward climbs.  Companies move along this landscape. They want to follow the path of least resistance. They’re constantly moving in the easiest, cheapest direction, just as…

  • OpenAI

    Released the preparedness scorecard for GPT-4o (many months behind promised schedule)

  • Which tech companies are taking AI risk seriously?

    Which tech companies are taking AI risk seriously?

    Tech companies are locked in an all-out race to develop and deploy advanced AI systems. There’s a lot of money to be made, and indeed, plenty of opportunities to improve the world. But there are also serious risks — and racing to move as quickly as possible can make detecting and averting these risks a…

  • Magic.dev has finally released a risk evaluation policy. How does it measure up?

    Magic.dev has finally released a risk evaluation policy. How does it measure up?

    Big news: the AI coding startup Magic.dev has released a new risk evaluation policy this week. Referred to as their “AGI Readiness Policy” and developed in collaboration with the nonprofit METR, this announcement follows in the footsteps of Responsible Scaling Policies (RSPs) released by companies like Anthropic, OpenAI, and Google Deepmind. So how does it…

Got any book recommendations?