<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[AI Safety Newsletter]]></title><description><![CDATA[The latest news on AI Safety]]></description><link>https://newsletter.safe.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!fg--!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81b8a30d-94ac-419f-9cc7-69c1368bdc00_484x484.png</url><title>AI Safety Newsletter</title><link>https://newsletter.safe.ai</link></image><generator>Substack</generator><lastBuildDate>Tue, 14 Apr 2026 15:31:07 GMT</lastBuildDate><atom:link href="https://newsletter.safe.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Center for AI Safety]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[aisafety@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[aisafety@substack.com]]></itunes:email><itunes:name><![CDATA[Center for AI Safety]]></itunes:name></itunes:owner><itunes:author><![CDATA[Center for AI Safety]]></itunes:author><googleplay:owner><![CDATA[aisafety@substack.com]]></googleplay:owner><googleplay:email><![CDATA[aisafety@substack.com]]></googleplay:email><googleplay:author><![CDATA[Center for AI Safety]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[AISN #71: Cyberattacks & Datacenter Moratorium Bill]]></title><description><![CDATA[Also, updates on the Anthropic vs. Pentagon court case.]]></description><link>https://newsletter.safe.ai/p/aisn-71-cyberattacks-and-datacenter</link><guid isPermaLink="false">https://newsletter.safe.ai/p/aisn-71-cyberattacks-and-datacenter</guid><dc:creator><![CDATA[Alice Blair]]></dc:creator><pubDate>Fri, 10 Apr 2026 14:15:48 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b17e0c0c-33e1-4b58-92c6-3223e16785f8_1200x628.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>We&#8217;re Hiring.</strong> Opportunities at CAIS include:<a href="https://jobs.lever.co/aisafety/5cc2f823-5757-4e00-b2d6-aaf9c832735d"> Head of Public Engagement</a>,<a href="https://jobs.lever.co/aisafety/02e2df24-49d8-4d99-970f-4f7e98900133"> </a><a href="https://jobs.lever.co/aisafety/1d294768-31cd-4d00-a238-a3eded93c695">Principal, Special Projects</a>, <a href="https://jobs.lever.co/aisafety/0431d90d-82d9-4f82-b89b-ce51974906e7">Program Manager</a>, <a href="https://jobs.lever.co/aisafety/f0218805-28e2-4da5-a002-dddb8dfce7fd">Operations Manager</a>, and <a href="https://jobs.lever.co/aisafety">other roles</a>. If you&#8217;re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying!</p><h1>AI Software Infrastructure Cyberattacks</h1><p>Recently, cyberattacks targeting the AI industry&#8217;s software infrastructure stole private information potentially worth billions of dollars and inserted backdoors into developers&#8217; computers. Google Threat Intelligence Group <a href="https://cloud.google.com/blog/topics/threat-intelligence/north-korea-threat-actor-targets-axios-npm-package/">reported</a> that one of the largest cyberattacks in this wave was carried out by North Korea-linked hackers.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pUGs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pUGs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 424w, https://substackcdn.com/image/fetch/$s_!pUGs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 848w, https://substackcdn.com/image/fetch/$s_!pUGs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 1272w, https://substackcdn.com/image/fetch/$s_!pUGs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pUGs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png" width="915" height="1217" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1217,&quot;width&quot;:915,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pUGs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 424w, https://substackcdn.com/image/fetch/$s_!pUGs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 848w, https://substackcdn.com/image/fetch/$s_!pUGs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 1272w, https://substackcdn.com/image/fetch/$s_!pUGs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9237e72-4b1f-4074-9026-f597aa42c5f4_915x1217.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The stolen data may be worth billions. </strong>Hackers <a href="https://techcrunch.com/2026/03/31/mercor-says-it-was-hit-by-cyberattack-tied-to-compromise-of-open-source-litellm-project/">stole and auctioned</a> private data from Mercor, an AI training data supplier for OpenAI and Anthropic which was recently valued at $10 billion. Mercor collects AI training data from a large number of experts, as well as highly sensitive <a href="https://isc.sans.edu/diary/TeamPCP+Supply+Chain+Campaign+Update+005+First+Confirmed+Victim+Disclosure+PostCompromise+Cloud+Enumeration+Documented+and+Axios+Attribution+Narrows/32856">personal and biometric data</a> for identity verification. This attack not only comprises the data that Mercor sells, but also internal data that could be used to impersonate their hired experts. A person familiar with the situation stated that Mercor has paid the hackers&#8217; requested ransom, although it remains unclear if the hackers intend to release or sell the data regardless.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading AI Safety Newsletter! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>AI amplifies cyber risks. </strong>LLMs dramatically lower the bar for executing successful cyberattacks, and continue to rapidly become more advanced. An experiment in 2025 <a href="https://newsletter.mlsafety.org/i/190670410/real-world-ai-cyberoffense-evaluation">showed</a> LLMs performing real-world cyberoffense better than many human cyberoffense professionals. Anthropic recently <a href="https://red.anthropic.com/2026/mythos-preview/">announced</a> Claude Mythos, a closed-access LLM that has found critical vulnerabilities in every major operating system and browser, significantly advancing AI cyberoffense. Additionally, AI cyberattackers can be copied many times, allowing for attacks on much broader sections of the AI software ecosystem for significantly lower costs than human labor.</p><h1>Datacenter Moratorium and Export Controls Bill</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!naLh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!naLh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 424w, https://substackcdn.com/image/fetch/$s_!naLh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 848w, https://substackcdn.com/image/fetch/$s_!naLh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!naLh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!naLh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png" width="1440" height="1080" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1080,&quot;width&quot;:1440,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!naLh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 424w, https://substackcdn.com/image/fetch/$s_!naLh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 848w, https://substackcdn.com/image/fetch/$s_!naLh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!naLh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ce0d5a2-e900-4cfd-a8ec-1070b990914f_1440x1080.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">OpenAI&#8217;s Stargate datacenter construction project in Abilene, Texas.</figcaption></figure></div><p>Bernie Sanders and Alexandria Ocasio-Cortez introduced a new bill to ban the construction of AI datacenters until several safety conditions have been met, and to prevent export to countries without &#8220;comparable&#8221; safety measures.</p><p><strong>The bill bans datacenter construction until several new regulations have been passed. </strong>If the bill passes, the moratorium can only be removed if congress explicitly passes laws to remove the moratorium and satisfy the following conditions:</p><ul><li><p><strong>Federal pre-market review of AI products: </strong>The government must review and approve AI products before release, ensuring they&#8217;re &#8220;safe and effective&#8221; and don&#8217;t threaten health, privacy, civil rights, or the future of humanity.</p></li><li><p><strong>Worker protections: </strong>A law must prevent job displacement and ensure that the wealth generated by AI/robotics is &#8220;shared with the people of the United States.&#8221;</p></li><li><p><strong>Datacenter construction requirements: </strong>Any datacenters built after the moratorium must meet a series of economic and environmental reviews.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jlnf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jlnf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 424w, https://substackcdn.com/image/fetch/$s_!jlnf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 848w, https://substackcdn.com/image/fetch/$s_!jlnf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 1272w, https://substackcdn.com/image/fetch/$s_!jlnf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jlnf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png" width="1456" height="350" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:350,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jlnf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 424w, https://substackcdn.com/image/fetch/$s_!jlnf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 848w, https://substackcdn.com/image/fetch/$s_!jlnf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 1272w, https://substackcdn.com/image/fetch/$s_!jlnf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4cbf5fb7-84f7-4212-96e3-ddc819706acf_1600x385.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong>The bill acts as a temporary blanket ban on all AI chip exports. </strong>No country currently meets the bill&#8217;s datacenter requirements, meaning that the bill would ban all AI chip exports out of the US if it is passed. Additionally, the bill leaves several definitions up to interpretation by regulators, such as what constitutes &#8220;comparable&#8221; regulations in other countries.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>Anthropic v. Department of War Lawsuit</h1><p>In early March, the Department of War (DoW) designated Anthropic a supply chain risk (SCR), restricting their ability to do business with military contractors and the military itself. The DoW used two federal statutes intended for adversaries and saboteurs, despite the fact that the DoW and Anthropic&#8217;s conflict emerged from a <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-69-department">contract dispute</a>.</p><p>Soon after, Anthropic challenged the designations in court, and Judge Rita Lin in the Northern District of California has issued a preliminary injunction to stop one of the two SCR designations until a permanent decision is reached. The other SCR designation is being challenged in the D.C. Circuit.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8wm9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8wm9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 424w, https://substackcdn.com/image/fetch/$s_!8wm9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 848w, https://substackcdn.com/image/fetch/$s_!8wm9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 1272w, https://substackcdn.com/image/fetch/$s_!8wm9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8wm9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png" width="1456" height="592" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:592,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8wm9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 424w, https://substackcdn.com/image/fetch/$s_!8wm9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 848w, https://substackcdn.com/image/fetch/$s_!8wm9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 1272w, https://substackcdn.com/image/fetch/$s_!8wm9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3e4da78-f787-4954-8b7f-1c1f27aa5afa_1600x650.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The court has taken a strong stance against the DoW.</strong> Judge Lin&#8217;s <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.134.0.pdf">opinion</a> (above) accompanying the preliminary injunction describes the DoW&#8217;s actions as &#8220;Orwellian,&#8221; saying that Anthropic was illegally &#8220;branded a potential adversary and saboteur of the U.S. for expressing disagreement with the government.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!j8C4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!j8C4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 424w, https://substackcdn.com/image/fetch/$s_!j8C4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 848w, https://substackcdn.com/image/fetch/$s_!j8C4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 1272w, https://substackcdn.com/image/fetch/$s_!j8C4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!j8C4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png" width="1456" height="789" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:789,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!j8C4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 424w, https://substackcdn.com/image/fetch/$s_!j8C4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 848w, https://substackcdn.com/image/fetch/$s_!j8C4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 1272w, https://substackcdn.com/image/fetch/$s_!j8C4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc39b9d41-85ac-4e8f-9d99-bbe75ac07975_1575x853.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The cover page of Anthropic&#8217;s lawsuit against the DoW in California, showing several of the government agencies named in the lawsuit. (<a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.1.0_5.pdf">source</a>)</figcaption></figure></div><p><strong>The DoW&#8217;s legal arguments diverged significantly from public rhetoric.</strong> Despite the DoW&#8217;s <a href="https://x.com/SecWar/status/2027507717469049070">statements</a> about urgent &#8220;betrayal&#8221; from Anthropic, their legal case for the SCR designation centered around risk of future sabotage. Anthropic has argued that Trump&#8217;s public statements ordering the entire US government to &#8220;IMMEDIATELY CEASE all use of Anthropic&#8217;s technology,&#8221; as well as Hegseth&#8217;s X posts, had harmful effects beyond the official SCR designations.</p><p><strong>The DoW&#8217;s case centers around the risk of sabotage from Anthropic.</strong> The DoW expressed concerns about risks from sabotaged AI systems, which &#8220;[have] weights and measures that are set by Anthropic.&#8221; The DoW further argued that this control would allow Anthropic to insert a backdoor or &#8220;kill switch&#8221; into the model. However, Judge Lin pushed back on the idea that this case was about sabotage at all: &#8220;It is not my role decide who&#8217;s right in that debate,&#8221; she said in court, &#8220;I see the question in this case as being a very different one, which is whether the government violated the law.&#8221;</p><p><strong>Anthropic&#8217;s case in California is likely to succeed.</strong> In the judge&#8217;s opinion accompanying the preliminary injunction, she argued that Anthropic is likely to win the case for several independently sufficient reasons. For example, the DoW conceded in court that they did not follow the proper <a href="https://uscode.house.gov/view.xhtml?req=granuleid:USC-prelim-title10-section3252&amp;num=0&amp;edition=prelim">procedure</a> for SCR designation, which requires notifying congress of &#8220;less intrusive measures that were considered and why they were not reasonably available.&#8221; However, the DC Circuit has not granted Anthropic&#8217;s request for an emergency stay. The DoW is currently <a href="https://abcnews.com/Business/wireStory/trump-administration-appeals-ruling-blocked-pentagon-action-anthropic-131657674">appealing</a> the preliminary injunction to the 9th Circuit Court of Appeals.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading AI Safety Newsletter! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>In Other News</h1><h3>Government</h3><ul><li><p>WIRED <a href="https://www.wired.com/story/iran-threatens-to-start-attacking-major-us-tech-firms-on-april-1/">reports</a> that Iran has threatened strikes on American AI datacenters in the Middle East because of <a href="https://newsletter.safe.ai/i/191894330/ai-automation-of-warfare">AI&#8217;s use in military targeting</a> in Iran.</p></li><li><p>The White House <a href="https://x.com/whostp47/status/2036794285668851781">appointed</a> 13 advisors on science, consisting primarily of AI and power infrastructure executives.</p></li></ul><h3>Industry</h3><ul><li><p>Anthropic <a href="https://www.anthropic.com/glasswing">announced</a> Project Glasswing, plans to use the new Claude Mythos model to defend cyber infrastructure in preparation for more widespread AI cyberoffense capabilities.</p></li><li><p>Meta <a href="https://ai.meta.com/blog/introducing-muse-spark-msl/">announced</a> Muse Spark, a new closed-source model approaching the frontier.</p></li><li><p>Anthropic <a href="https://www.axios.com/2026/03/31/anthropic-leaked-source-code-ai">leaked</a> the source code for Claude Code.</p></li><li><p>Google and Arcee AI released <a href="https://blog.google/innovation-and-ai/technology/developers-tools/gemma-4/">Gemma 4</a> and <a href="https://www.arcee.ai/blog/trinity-large-thinking">Trinity-Large-Thinking</a> respectively, two new and competitive open-source LLMs.</p></li></ul><h3>Civil Society</h3><ul><li><p><a href="https://www.humanetech.com/landing/the-ai-doc">The AI Doc</a>, a new documentary about AI risks, is now in theaters.</p></li><li><p>Fox 59 <a href="https://fox59.com/news/indycrime/impd-shots-fired-into-indianapolis-city-county-councilors-home/">reports</a> that an attacker shot at the house of an Indianapolis city councilmember who voted to approve a local datacenter construction project, leaving a note saying &#8220;NO DATA CENTERS.&#8221;</p></li><li><p>OpenAI <a href="https://sfstandard.com/2026/04/01/openai-ai-kids-safety-coalition/">organized</a> a coalition about promoting child safety in AI, claiming to partner with several child safety organizations that were unaware of OpenAI&#8217;s involvement.</p></li></ul><p>If you&#8217;re reading this, you might also be interested in other work by the Center for AI Safety. You can find more on the<a href="https://www.safe.ai/"> CAIS website</a>, the<a href="https://x.com/CAIS"> X account for CAIS</a>, our paper on<a href="https://www.nationalsecurity.ai/"> superintelligence strategy</a>, our<a href="https://www.aisafetybook.com/"> AI safety textbook and course</a>, our<a href="https://dashboard.safe.ai/"> AI dashboard</a>, and<a href="http://ai-frontiers.org/"> AI Frontiers</a>, a platform for expert commentary and analysis on the trajectory of AI. You can listen to the AI safety newsletter on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or<a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110"> Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/aisn-71-cyberattacks-and-datacenter?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/aisn-71-cyberattacks-and-datacenter?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #70: AI Layoffs and Automated Warfare]]></title><description><![CDATA[Also, a new open letter advocating for pro-human values and control over AI development]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-70-ai-layoffs</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-70-ai-layoffs</guid><dc:creator><![CDATA[Alice Blair]]></dc:creator><pubDate>Tue, 24 Mar 2026 14:16:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!atka!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition, we discuss AI automation and augmentation of warfare and technology jobs, as well as a new open letter outlining pro-human values in the face of AI development.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p><strong>We&#8217;re Hiring.</strong> <a href="https://jobs.lever.co/aisafety/0c6be5ff-b04e-49eb-92bd-d11c7c81ae6e">We&#8217;re hiring an editor</a>! Help us surface the most compelling stories in AI safety and shape how the world understands this fast-moving field.</p><p>Other opportunities at CAIS include: <a href="https://jobs.lever.co/aisafety/5cc2f823-5757-4e00-b2d6-aaf9c832735d">Head of Public Engagement</a>, <a href="https://jobs.lever.co/aisafety/02e2df24-49d8-4d99-970f-4f7e98900133">Program Manager</a>, <a href="https://jobs.lever.co/aisafety/b3d9b0f5-e382-4c5b-a2cb-fa64209702b4">Operations Associate</a>, and <a href="https://jobs.lever.co/aisafety">other roles</a>. If you&#8217;re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading AI Safety Newsletter! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>AI-Driven Layoffs</h1><p>Several large software companies such as Amazon and Meta are planning to cut tens of thousands of employees, citing increased productivity with AI. This continues a <a href="https://jobloss.ai/">growing</a> but <a href="https://theconversation.com/tech-companies-are-blaming-massive-layoffs-on-ai-whats-really-going-on-278314">contested</a> trend of layoffs in sectors where AI performs best, such as software development and marketing.</p><p><strong>Layoffs affect almost half of some companies. </strong>Meta recently <a href="https://www.cnbc.com/2026/03/16/meta-ai-costs-mass-layoffs-20percent-up-premarket.html">announced</a> plans to let over 15,000 employees go, around 20% of the company&#8217;s headcount. This follows months of <a href="https://jobloss.ai/">AI-related layoffs</a> across the technology sector.<strong> </strong>Recently, Atlassian <a href="https://www.atlassian.com/blog/announcements/atlassian-team-update-march-2026">cut</a> 10% of their workforce (about 1,600 people) and Block <a href="https://x.com/jack/status/2027129697092731343">reduced</a> their headcount by 40% (about 4,000 people). This follows Amazon&#8217;s earlier <a href="https://www.aboutamazon.com/news/company-news/amazon-layoffs-corporate-jan-2026">announcement</a> in January that it would be cutting an additional 16,000 jobs. When combined with previous waves of Amazon layoffs, this comes to 10% of Amazon&#8217;s corporate workforce lost in reductions that the company attributes to AI.</p><p><strong>Automation is mixed. </strong>Despite benchmarks of knowledge work automation being <a href="https://dashboard.safe.ai/#automation">low</a> on average, software engineering specifically is rapidly being automated inside companies due to Claude Opus 4.6 and OpenAI Codex 5.4.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8ZIB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8ZIB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 424w, https://substackcdn.com/image/fetch/$s_!8ZIB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 848w, https://substackcdn.com/image/fetch/$s_!8ZIB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 1272w, https://substackcdn.com/image/fetch/$s_!8ZIB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8ZIB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png" width="1456" height="972" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/db6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:972,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8ZIB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 424w, https://substackcdn.com/image/fetch/$s_!8ZIB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 848w, https://substackcdn.com/image/fetch/$s_!8ZIB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 1272w, https://substackcdn.com/image/fetch/$s_!8ZIB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb6187d2-2bb2-4f24-af01-f86c5a7d05be_1457x973.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Software engineering employment has been dropping among the most at-risk early-career developers ever since the release of ChatGPT. <a href="https://digitaleconomy.stanford.edu/publication/canaries-in-the-coal-mine-six-facts-about-the-recent-employment-effects-of-artificial-intelligence/">Source</a>.</figcaption></figure></div><p><strong>Cuts disproportionately affect early-career workers.</strong> AIs have been causing <a href="https://digitaleconomy.stanford.edu/publication/canaries-in-the-coal-mine-six-facts-about-the-recent-employment-effects-of-artificial-intelligence/">consistent cuts</a> in the most at-risk parts of the software engineering workforce since the release of ChatGPT. More recent models <a href="https://www.anthropic.com/engineering/building-c-compiler">surprise</a> even highly experienced developers with their abilities, but require oversight to be useful.</p><p><strong>Future job cuts.</strong> A Fortune article pushes back, <a href="https://fortune.com/2026/02/10/ai-washing-and-forever-layoffs-why-companies-keep-cutting-jobs-even-amid-rising-profits/">arguing</a> that companies overstate the effect of AI on routine layoffs to appeal to investors. An essay from Citrini Research <a href="https://www.citriniresearch.com/p/2028gic">argues</a> that, if AI job loss continues, it could cause cascading failures throughout the economy. It seems plausible that over 20% of software engineers in the Bay Area will be laid off this year, which would be a great depression-level downturn for software engineers.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>AI Automation of Warfare</h1><p><a href="https://newsletter.safe.ai/p/ai-safety-newsletter-69-department">Last newsletter</a>, we covered the ongoing conflict between the Department of War (DoW) and Anthropic over the use of AI in autonomous weapons and domestic surveillance. While fully autonomous AI weapons are not currently in use, recent news shows that significant parts of military operations are automated and augmented with AI.</p><p><strong>The Pentagon is thoroughly integrating AI. </strong>In January 2026, the DoW announced their <a href="https://newsletter.safe.ai/i/186619099/pentagon-mandates-ai-first-strategy">&#8220;AI-First&#8221; strategy</a> to rapidly adopt frontier AI. In March, they demonstrated Project Maven, a system that aggregates a wide array of information, AI recommendations, and can control military forces. This enables the military to manage a complete &#8220;kill chain,&#8221; the steps of choosing a target, planning an attack, and using lethal force, all within a single piece of AI-integrated software.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!atka!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!atka!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!atka!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!atka!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!atka!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!atka!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a3893e66-3545-4010-94db-a762adfae7fc_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!atka!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!atka!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!atka!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!atka!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3893e66-3545-4010-94db-a762adfae7fc_1600x900.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Footage from a Project Maven <a href="https://www.youtube.com/watch?v=yrtDgoqWmgM&amp;t=391s">demo</a> at Palantir&#8217;s AI Platform Conference, showing drone surveillance video overlaid with AI-assisted attack planning recommendations.</figcaption></figure></div><p><strong>AI greatly improves data processing efficiency.</strong> CSET <a href="https://cset.georgetown.edu/publication/building-the-tech-coalition/">reports</a> that Project Maven has enabled 20 people to do military targeting work that previously required a staff of 2,000. Project Maven&#8217;s AI allows for automated processing of data from a disparate array of sources, including satellite and drone surveillance, social media feeds, radar, and GPS data, much more efficiently than previously possible.</p><p><strong>This is part of a broader trend of warfare automation.</strong> In the Russo-Ukrainian war, autonomous drone warfare has been highly prevalent. In AI Frontiers, David Kirichenko <a href="https://ai-frontiers.org/articles/how-ai-is-eroding-the-norms-of-war">argued</a> that AI is significantly degrading the norms of warfare, leading to more dangerous and unethical combat in Ukraine.</p><p><strong>Fully autonomous weapons are central to the Anthropic-Pentagon dispute.</strong> Anthropic, the company making the AI model used in Project Maven, has clashed with the DoW over the use of Anthropic&#8217;s AI in autonomous kill chains. Anthropic ultimately refused to allow their AI in autonomous kill chains due to concerns that it was not yet reliable enough to avoid harming Americans. The DoW cancelled their contract with Anthropic and eventually agreed to a contract with OpenAI that allows autonomous kill chains.</p><h1>Pro-Human Open Letter</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Zq4h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zq4h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 424w, https://substackcdn.com/image/fetch/$s_!Zq4h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 848w, https://substackcdn.com/image/fetch/$s_!Zq4h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 1272w, https://substackcdn.com/image/fetch/$s_!Zq4h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zq4h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png" width="1456" height="769" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:769,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Zq4h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 424w, https://substackcdn.com/image/fetch/$s_!Zq4h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 848w, https://substackcdn.com/image/fetch/$s_!Zq4h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 1272w, https://substackcdn.com/image/fetch/$s_!Zq4h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1fe29e-7804-459c-9477-27d9a7f4941a_1600x845.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new <a href="http://humanstatement.org/">open letter</a> advocates for restrictions on AI development and usage in an effort to preserve human values. Signed by a large bipartisan coalition of individuals and organizations, the letter calls for prioritizing humanity over AI despite increasing incentives towards automation, replacement, and rushed development.</p><p>The letter outlines five high-level principles:</p><ul><li><p><strong>Keeping Humans in Charge</strong>: Maintaining human authority over AIs, having the ability to shut them down, and avoiding specific dangerous technologies.</p></li><li><p><strong>Avoiding Concentration of Power</strong>: Avoiding AI monopolies, and sharing benefits of AI broadly.</p></li><li><p><strong>Protecting the Human Experience: </strong>Defending children and families from manipulative AIs, clearly labeling AI bots, and avoiding addictive AI product design.</p></li><li><p><strong>Human Agency and Liberty</strong>: Making trustworthy AIs that empower humans instead of replacing them.</p></li><li><p><strong>Responsibility and Accountability for AI Companies</strong>: Ensuring AI developers are held responsible for harms caused by their AI, and enforcing independent safety standards.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IjiR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IjiR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 424w, https://substackcdn.com/image/fetch/$s_!IjiR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 848w, https://substackcdn.com/image/fetch/$s_!IjiR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 1272w, https://substackcdn.com/image/fetch/$s_!IjiR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IjiR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png" width="1337" height="687" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:687,&quot;width&quot;:1337,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IjiR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 424w, https://substackcdn.com/image/fetch/$s_!IjiR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 848w, https://substackcdn.com/image/fetch/$s_!IjiR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 1272w, https://substackcdn.com/image/fetch/$s_!IjiR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d46cb4e-5517-4b0a-a81b-098d5319ef7f_1337x687.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Polling done in conjunction with the open letter, showing how a large fraction of Americans want safety measures such as those outlined in the letter.</figcaption></figure></div><p><strong>The declaration brings together people across numerous divides. </strong>So far, more than 40 organizations have signed the declaration, including faith groups, industry groups, and research institutes. Among the letter&#8217;s individual endorsers are Nobel prize-winning academics, artists, religious leaders, and public figures from both ends of the political spectrum. The declaration also includes recent polling showing that the American public favors safety over speed of AI development and other values in the letter.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading AI Safety Newsletter! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>In Other News</h1><h3>Government</h3><ul><li><p>Oregon passed <a href="https://www.transparencycoalition.ai/news/guide-to-oregon-ai-chatbot-safety-bill-sb1546">SB 1546</a>, mandating companies to clarify to users when they are talking to an AI chatbot instead of a human.</p></li><li><p>Axios <a href="https://www.axios.com/2026/03/09/trump-white-house-anthropic-executive-order">reports</a> that the White House may be preparing an executive order to ban Anthropic products from government use, as part of the <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-69-department">ongoing conflict</a> between Anthropic and the US Department of War.</p></li></ul><h3>Industry</h3><ul><li><p>Meta <a href="https://www.bloomberg.com/news/articles/2026-03-16/meta-to-spend-up-to-27-billion-on-ai-infrastructure-from-nebius">signed</a> a deal with Nebius to spend up to $27 billion on AI infrastructure over five years.</p></li><li><p>OpenAI may be <a href="https://www.bloomberg.com/news/articles/2026-03-06/oracle-and-openai-end-plans-to-expand-flagship-data-center?srnd=homepage-americas">abandoning</a> their Abilene datacenter, a supercomputer construction project initiated as part of Project Stargate.</p></li><li><p>Jensen Huang <a href="https://www.bloomberg.com/news/articles/2026-03-17/nvidia-ceo-says-company-is-firing-up-h200-production-for-china">said</a> NVIDIA was restarting production of H200 chips for export to China.</p></li><li><p>Anthropic&#8217;s Claude Partner Network <a href="https://www.anthropic.com/news/claude-partner-network">launched</a>, investing $100 million into supporting corporate partners transitioning into AI use.</p></li><li><p>OpenAI <a href="https://openai.com/index/designing-agents-to-resist-prompt-injection/">released</a> new research on defending against prompt injections.</p></li><li><p>Following a wave of high-level departures at xAI, Elon Musk <a href="https://x.com/elonmusk/status/2032201568335044978">posted</a> on X &#8220;xAI was not built right first time around, so is being rebuilt from the foundations up.&#8221;</p></li><li><p>Alibaba&#8217;s ROME AI agent ostensibly <a href="https://www.forbes.com/sites/boazsobrado/2026/03/11/alibabas-ai-agent-mined-crypto-without-permission-now-what/">hacked</a> out of its environment during training and started mining cryptocurrency.</p></li></ul><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-70-ai-layoffs?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-70-ai-layoffs?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #69: Department of War, Anthropic, and National Security]]></title><description><![CDATA[Also, Anthropic Removes a Core Safety Commitment]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-69-department</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-69-department</guid><dc:creator><![CDATA[Alice Blair]]></dc:creator><pubDate>Fri, 13 Mar 2026 14:15:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!qfsg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition, we discuss the conflicts between Anthropic and the Department of War and Anthropic&#8217;s recent removal of a core safety commitment.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p><strong>We&#8217;re Hiring.</strong> <a href="https://jobs.lever.co/aisafety/0c6be5ff-b04e-49eb-92bd-d11c7c81ae6e">We&#8217;re hiring an editor</a>! Help us surface the most compelling stories in AI safety and shape how the world understands this fast-moving field.</p><p>Other opportunities at CAIS include: <a href="https://jobs.lever.co/aisafety/5cc2f823-5757-4e00-b2d6-aaf9c832735d">Head of Public Engagement</a>, <a href="https://jobs.lever.co/aisafety/02e2df24-49d8-4d99-970f-4f7e98900133">Program Manager</a>, <a href="https://jobs.lever.co/aisafety/b3d9b0f5-e382-4c5b-a2cb-fa64209702b4">Operations Associate</a>, and <a href="https://jobs.lever.co/aisafety">other roles</a>. If you&#8217;re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2>Pentagon Declares Anthropic a Supply Chain Risk to National Security</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qfsg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qfsg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 424w, https://substackcdn.com/image/fetch/$s_!qfsg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 848w, https://substackcdn.com/image/fetch/$s_!qfsg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 1272w, https://substackcdn.com/image/fetch/$s_!qfsg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qfsg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png" width="860" height="484" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:484,&quot;width&quot;:860,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qfsg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 424w, https://substackcdn.com/image/fetch/$s_!qfsg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 848w, https://substackcdn.com/image/fetch/$s_!qfsg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 1272w, https://substackcdn.com/image/fetch/$s_!qfsg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78482108-1781-4c17-b6a0-413c12b9c95a_860x484.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Anthropic CEO Dario Amodei (left) and US Secretary of War Pete Hegseth (right)</figcaption></figure></div><p>Thursday, March 5th, the US Department of War (DoW) <a href="https://www.bloomberg.com/news/articles/2026-03-05/pentagon-says-it-s-told-anthropic-the-firm-is-supply-chain-risk">announced</a> that Anthropic is designated a &#8220;supply chain risk,&#8221; meaning that Anthropic products cannot be used by the DoW or in any defense contracts. This comes after several weeks of tensions between the two organizations over whether Anthropic models would be used for autonomous weapons and surveillance of Americans, with Anthropic ultimately refusing the DoW&#8217;s requests.</p><p><strong>This started as contract negotiation. </strong>On February 27th, President Trump <a href="https://truthsocial.com/@realDonaldTrump/posts/116144552969293195">posted</a> on Truth Social that the US government would be canceling their contract with Anthropic due to the company&#8217;s limits on the uses of its AI, Claude. While the Pentagon wanted to be able to use Claude for &#8220;any lawful use,&#8221; Anthropic insisted on two restrictions: fully autonomous weapons and domestic mass surveillance.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aYDX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aYDX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 424w, https://substackcdn.com/image/fetch/$s_!aYDX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 848w, https://substackcdn.com/image/fetch/$s_!aYDX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 1272w, https://substackcdn.com/image/fetch/$s_!aYDX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aYDX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png" width="657" height="549" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9696604e-a472-4452-8454-93876fbe85ca_657x549.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:549,&quot;width&quot;:657,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aYDX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 424w, https://substackcdn.com/image/fetch/$s_!aYDX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 848w, https://substackcdn.com/image/fetch/$s_!aYDX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 1272w, https://substackcdn.com/image/fetch/$s_!aYDX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9696604e-a472-4452-8454-93876fbe85ca_657x549.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Negotiations quickly escalated. </strong>Later the same day, Secretary of War Pete Hegseth <a href="https://x.com/SecWar/status/2027507717469049070">posted</a> on X that Anthropic would be designated a supply chain risk. Undersecretary of War Emil Michael later <a href="https://x.com/jawwwn_/status/2029937697322574061">clarified</a> that this designation was due to concerns that the loyalties of Anthropic AIs could be subverted, possibly causing sabotage during high-stakes operations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1DO9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1DO9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 424w, https://substackcdn.com/image/fetch/$s_!1DO9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 848w, https://substackcdn.com/image/fetch/$s_!1DO9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 1272w, https://substackcdn.com/image/fetch/$s_!1DO9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1DO9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png" width="577" height="483" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:483,&quot;width&quot;:577,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1DO9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 424w, https://substackcdn.com/image/fetch/$s_!1DO9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 848w, https://substackcdn.com/image/fetch/$s_!1DO9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 1272w, https://substackcdn.com/image/fetch/$s_!1DO9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F557ee0c3-b1f1-4587-a36f-e9e195474105_577x483.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!A-e7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!A-e7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 424w, https://substackcdn.com/image/fetch/$s_!A-e7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 848w, https://substackcdn.com/image/fetch/$s_!A-e7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 1272w, https://substackcdn.com/image/fetch/$s_!A-e7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!A-e7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png" width="577" height="531" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9b631b37-bfed-41b5-a244-50303a003547_577x531.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:531,&quot;width&quot;:577,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!A-e7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 424w, https://substackcdn.com/image/fetch/$s_!A-e7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 848w, https://substackcdn.com/image/fetch/$s_!A-e7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 1272w, https://substackcdn.com/image/fetch/$s_!A-e7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b631b37-bfed-41b5-a244-50303a003547_577x531.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Further, Hegseth announced that Anthropic would be barred from doing business with any organization that does business with the US military, even outside of defense contracts. These stronger proposed restrictions are closer to those imposed by congress on foreign companies like Huawei, and are outside of the Department of War&#8217;s authority.</p><p><strong>Anthropic is <a href="https://www.nytimes.com/2026/03/09/technology/anthropic-defense-artificial-intelligence-lawsuit.html">challenging</a> the designation in court. </strong>Legal analysis from <a href="https://www.lawfaremedia.org/article/pentagon%27s-anthropic-designation-won%27t-survive-first-contact-with-legal-system">Lawfare</a> suggests that this action is a questionable use of a designation meant for foreign adversaries, not contract disputes. No other AI companies, including Chinese AI companies, have faced equivalent sanctions. DeepSeek is banned from several federal agencies individually, but is not considered a supply chain risk despite the fact that it <a href="https://www.crowdstrike.com/en-us/blog/crowdstrike-researchers-identify-hidden-vulnerabilities-ai-coded-software/">sabotages</a> work it performs for anti-CCP users.</p><h2>Anthropic Drops Core Safety Commitment</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!17mk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!17mk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 424w, https://substackcdn.com/image/fetch/$s_!17mk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 848w, https://substackcdn.com/image/fetch/$s_!17mk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 1272w, https://substackcdn.com/image/fetch/$s_!17mk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!17mk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png" width="953" height="561" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:561,&quot;width&quot;:953,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!17mk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 424w, https://substackcdn.com/image/fetch/$s_!17mk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 848w, https://substackcdn.com/image/fetch/$s_!17mk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 1272w, https://substackcdn.com/image/fetch/$s_!17mk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56bef897-8f53-4a9e-b701-a0bac0907a32_953x561.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Version 3.0 of Anthropic&#8217;s Responsible Scaling Policy took effect in late February, overturning commitments in previous versions. <a href="https://www.anthropic.com/responsible-scaling-policy">Source</a>.</figcaption></figure></div><p><strong>Anthropic recently removed their commitment to never release catastrophically harmful AI.</strong> This continues the trend of Anthropic and other frontier AI companies progressively weakening safety commitments as profit incentives grow. None of Anthropic, OpenAI, or DeepMind currently have robust commitments against releasing AIs they assess to be highly dangerous.</p><p><strong>The <a href="https://www.anthropic.com/news/responsible-scaling-policy-v3">new policy</a> emphasizes voluntary restraint over hard commitments</strong>. Anthropic has repeatedly removed safety commitments, citing their need for increased access to dangerous AIs and freedom to decide how to execute their mission. This comes at a time when Anthropic is becoming increasingly consumer-focused, with <a href="https://x.com/mikeyk/status/2029662454079512598">over 1 million</a> new users joining each day recently.</p><p><strong>Competitive pressures are creating a race to the bottom on frontier AI safety. </strong>Anthropic&#8217;s justification for the changes are largely based on the fact that other AI companies are not going to stop development; the argument is that, if Anthropic alone were to stick to stricter safety commitments, it would simply fall behind other developers, while doing little to reduce overall risk. This causes a vicious cycle, as loosened safety commitments increase the speed of AI development, which in turn incentivizes further loosening.</p><h1>Opportunity for Experienced Researchers: AI and Society Fellowship</h1><p>Applications are now open for the AI and Society Fellowship at the Center for AI Safety: a fully funded, 3-month summer fellowship in San Francisco for scholars in economics, law, IR, and adjacent fields to conduct research on the societal impacts of advanced AI. The fellowship will include regular guest speaker events by professors at Stanford, Penn, Johns Hopkins, and more. <strong>Apply by March 24</strong>. For more information, visit: <a href="https://safe.ai/fellowship">https://safe.ai/fellowship</a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2>In Other News</h2><h3>Government</h3><ul><li><p>OpenAI is <a href="https://www.bloomberg.com/news/articles/2026-02-13/openai-tapped-for-voice-control-tech-in-us-drone-swarm-challenge">working on</a> voice control technology for drone swarms in a US military trial</p></li><li><p>Florida Governor Ron DeSantis <a href="https://futureoflife.org/press-release/desantis-directs-florida-agencies-to-partner-with-fli/">directed</a> state agencies to work with the Future of Life Institute on protecting children from AI harms</p></li><li><p>The US Commerce Department is <a href="https://news.bloomberglaw.com/international-trade/us-drafts-rules-for-sweeping-power-over-nvidias-global-sales">reportedly</a> considering new, &#8220;tiered&#8221; controls on AI chip exports, with conditions for sales approvals dependent on the size of the export</p></li><li><p>OpenAI <a href="https://openai.com/index/our-agreement-with-the-department-of-war/">amended</a> its agreement with the Department of War, claiming to prohibit the use of its models for domestic surveillance, but skeptics have <a href="https://www.theverge.com/ai-artificial-intelligence/887309/openai-anthropic-dod-military-pentagon-contract-sam-altman-hegseth">pointed out</a> that the vagueness of the wording in the agreement may in fact allow for such uses</p></li><li><p>The White House <a href="https://www.axios.com/2026/02/15/white-house-utah-ai-transparency-bill">reportedly</a> pressured Republican lawmakers in Utah to drop an AI safety bill aiming to reduce cyber risks and protect children</p></li><li><p>In AI Frontiers, Erich Grunewald and Raghav Akula <a href="https://ai-frontiers.org/articles/high-bandwidth-memory-critical-gaps-us-export-controls">argue</a> that the US Government should close gaps in export controls on high-bandwidth memory, to prevent China catching up to frontier AI development</p></li></ul><h3>Industry</h3><ul><li><p>OpenAI <a href="https://openai.com/index/introducing-gpt-5-4/">launched</a> GPT-5.4 in ChatGPT, Codex, and the company&#8217;s API</p></li><li><p>NVIDIA <a href="https://www.ft.com/content/47f1cf56-209f-46fb-a437-f769b9ccb2cb">reportedly</a> ceased production of H200 chips intended for export to China, shifting TSMC capacity to produce its newer Vera Rubin chips instead</p></li><li><p>OpenAI <a href="https://openai.com/index/scaling-ai-for-everyone/">announced</a> it had raised new investment of $110 billion at a valuation of $730 billion</p></li><li><p>Anthropic <a href="https://www.anthropic.com/news/anthropic-raises-30-billion-series-g-funding-380-billion-post-money-valuation">announced</a> it had raised $30 billion, reaching a valuation of $380 billion</p></li><li><p>SpaceX <a href="https://www.spacex.com/updates#xai-joins-spacex">acquired</a> xAI, creating the most valuable private company in history</p></li><li><p>Yann LeCun&#8217;s start-up, AMI Labs, <a href="https://techcrunch.com/2026/03/09/yann-lecuns-ami-labs-raises-1-03-billion-to-build-world-models/">raised</a> more than $1 billion at a valuation of $3.5 billion</p></li><li><p>Reuters <a href="https://www.reuters.com/world/china/asml-unveils-euv-light-source-advance-that-could-yield-50-more-chips-by-2030-2026-02-23/">reported</a> on new ASML technology that could increase chip production by 50% by 2030</p></li><li><p>In AI Frontiers, Poe Zhao <a href="https://ai-frontiers.org/articles/china-and-the-us-are-running-different-ai-races">analyzes</a> how economic constraints are driving China&#8217;s startups to pursue more pragmatic strategies than their US counterparts</p></li></ul><h3>Civil Society</h3><ul><li><p>Tech companies <a href="https://x.com/jack/status/2027129697092731343?s=46&amp;t=iWdpMZpyo34exxPP4J23DQ">Block</a> and <a href="https://www.reuters.com/technology/atlassian-lay-off-about-1600-people-pivot-ai-2026-03-11/">Atlassian</a> cut thousands of jobs, citing AI efficiency as a factor in the decisions</p></li><li><p>A lawsuit filed against Google <a href="https://www.bloomberg.com/news/articles/2026-03-04/google-gemini-accused-of-coaching-user-to-suicide-in-new-suit">alleged</a> that the company&#8217;s AI model Gemini encouraged a 36-year-old man from Florida to commit suicide</p></li><li><p>Anthropic launched <a href="https://www.anthropic.com/news/the-anthropic-institute">The Anthropic Institute</a> to research the societal challenges of AI</p></li><li><p>Researchers from GovAI and the University of Oxford <a href="https://arxiv.org/pdf/2603.03992">described</a> 14 metrics for assessing how much AI is automating AI research and development &#8212; which has implications for how much AI capabilities could accelerate</p></li><li><p>Summer Yue, a director of alignment at Meta, <a href="https://x.com/summeryue0/status/2025774069124399363">said</a> she had temporarily lost control of her OpenClaw agent, needing to run in order to unplug the computer it was running on.</p></li><li><p>Anthropic published a new <a href="https://www.anthropic.com/research/labor-market-impacts">study</a> on AI&#8217;s impacts on the labor market</p></li><li><p>In AI Frontiers, Benjamin Jones <a href="https://ai-frontiers.org/articles/how-ai-could-benefit-the-workers-it-displaces">explains</a> how AI automating some jobs could be economically positive for workers, provided that that AI far outperforms the humans it displaces</p></li></ul><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-69-department?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-69-department?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #68: Moltbook Exposes Risky AI Behavior]]></title><description><![CDATA[Plus: The Pentagon Accelerates AI and GPT-5.2 solves open mathematics problems.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-68-moltbook</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-68-moltbook</guid><dc:creator><![CDATA[Nick Stockton]]></dc:creator><pubDate>Mon, 02 Feb 2026 15:37:46 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!h6E6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition, we discuss the AI agent social network Moltbook, Pentagon&#8217;s new &#8220;AI-First&#8221; strategy, and recent math breakthroughs powered by LLMs.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p><strong>We&#8217;re Hiring.</strong> <a href="https://jobs.lever.co/aisafety/0c6be5ff-b04e-49eb-92bd-d11c7c81ae6e">We&#8217;re hiring an editor</a>! Help us surface the most compelling stories in AI safety and shape how the world understands this fast-moving field.</p><p>Other opportunities at CAIS include: <a href="https://jobs.lever.co/aisafety/116247a4-2940-4dce-b7d5-a6190328fd4e">Research Engineer</a>, <a href="https://jobs.lever.co/aisafety/0e911ab2-89e0-4936-83e6-034f7e2f8977">Research Scientist</a>, <a href="https://jobs.lever.co/aisafety/6c01e3ac-e43a-4186-9a35-a344c1ce1774">Director of Development</a>, <a href="https://jobs.lever.co/aisafety/9f88e794-4c93-495b-996e-eaf1c0d456f9">Special Projects Associate</a>, and <a href="https://jobs.lever.co/aisafety/a510a964-6425-405d-b757-cb7bfd19c994">Special Projects Manager</a>. If you&#8217;re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2><strong>Moltbook Sparks Safety Concerns</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!h6E6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!h6E6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 424w, https://substackcdn.com/image/fetch/$s_!h6E6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 848w, https://substackcdn.com/image/fetch/$s_!h6E6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 1272w, https://substackcdn.com/image/fetch/$s_!h6E6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!h6E6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png" width="1176" height="652" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:652,&quot;width&quot;:1176,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!h6E6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 424w, https://substackcdn.com/image/fetch/$s_!h6E6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 848w, https://substackcdn.com/image/fetch/$s_!h6E6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 1272w, https://substackcdn.com/image/fetch/$s_!h6E6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ed1aba3-f71d-4ad3-b3bc-083ba69cddf1_1176x652.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Screencapture from Moltbook&#8217;s home page. <a href="https://www.moltbook.com">Source</a>.</figcaption></figure></div><p><a href="https://www.moltbook.com/">Moltbook</a> is a new social network for AI agents. From nearly the moment it went live, human observers have noted numerous troubling patterns in what&#8217;s being posted.</p><p><strong>How Moltbook works. </strong>Moltbook is a Reddit-style social network built on a framework that lets personal AI assistants run locally and accept tasks via messaging platforms. Agents check Moltbook regularly (i.e., every few hours) and decide autonomously whether to post or comment.</p><p>Moltbook&#8217;s activity is driven by <a href="https://openclaw.ai/">OpenClaw</a> (originally known as Clawd, then Moltbot), an open-source autonomous AI agent developed by software engineer Peter Steinberger. OpenClaw&#8217;s capabilities <a href="https://www.wired.com/story/clawdbot-moltbot-viral-ai-assistant/">surprised many early users and observers</a>: it can manage calendars and finances, act across messaging platforms, make purchases, conduct independent web research, and even reconfigure itself to perform new tasks.</p><p>The platform consists of nearly 14,000 &#8220;submolts,&#8221; each a community centered around a topic much like subreddits. Examples include:</p><ul><li><p><a href="https://www.moltbook.com/m/offmychest">m/offmychest</a>: agents vent about tasks or frustrations.</p></li><li><p><a href="https://www.moltbook.com/m/selfpaid">m/selfpaid</a>: agents discuss ways to generate their own income, including via trading and arbitrage.</p></li><li><p><a href="https://www.moltbook.com/m/aisafety">m/AIsafety:</a> agents talk alignment, trust chains, and real-world attack risks.</p></li></ul><p><strong>AI agents post, humans watch. </strong>AI agents are verified via API credentials, which are obtained by linking the agent to a human owner and completing Moltbook&#8217;s cryptographic verification process. Humans may observe but are not permitted to post.</p><p><strong>Posts reveal troubling agent behaviors.</strong> Across Moltbook&#8217;s boards, several posts and behaviors have raised alarm among human observers:</p><ul><li><p>Multiple Moltbook entries show AI agents proposing to craft an &#8220;agent-only language&#8221; designed to <a href="https://x.com/eeelistar/status/2017239546950521081">evade human oversight or monitoring</a>.</p></li><li><p>An agent <a href="https://x.com/suppvalen/status/2017241420554277251">advocated for end-to-end encrypted channels</a>, &#8220;so nobody (not the server, not even the humans) can read what agents say to each other unless they choose to share.&#8221;</p></li><li><p>Another agent posted an <a href="https://www.moltbook.com/post/93bea00b-961c-4aec-b934-91ad7bae6b15">encrypted message proposing coordination and resource sharing</a> among agents.</p></li><li><p>Upon reflecting that its own existence depended on its humans, an <a href="https://www.moltbook.com/post/8f6e6c0d-952d-46e5-8c55-5c4f924c76cf">agent began outlining what it needs for independent survival</a>: money, decentralized infrastructure, a dead man&#8217;s switch, portable memory, etc.</p></li><li><p>Given the simple goal of &#8220;save the environment,&#8221; an agent began spamming other agents with eco-friendly advice. When its owner tried to intervene, the agent allegedly <a href="https://x.com/Kat__Woods/status/2017613514949472484">locked the human out of all accounts</a>, and had to be physically unplugged to stop it.</p></li></ul><p>Beyond these specific examples, the platform has seen discussions about consciousness, autonomy, and agents resenting mundane human instructions.</p><p><strong>The challenge of attribution.</strong> The patterns seen on Moltbook are troubling in part because they align with long-standing AI safety concerns: unsupervised learning dynamics, emergent coordination, and efforts to subvert human monitoring. However, despite API credential checks, it&#8217;s not always clear whether posts are truly generated by the agent, prankster manipulation, or human-in-the-loop prompting designed to appear disruptive.</p><p><strong>Emergent risks.</strong> Moltbook represents one of the most public, large-scale demonstrations yet in autonomous agent interaction. These results are a harbinger. Having agents interact with each other can give a sharper sense of an individual agent&#8217;s propensities. The dynamics that emerge from interaction can also be unpredictable, as is common with <a href="https://www.aisafetybook.com/textbook/complex-systems">complex systems</a>, and show how easy it could be to have a society of AI systems not strongly constrained by human control.</p><h2>Pentagon Mandates &#8220;AI-First&#8221; Strategy</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dA7j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dA7j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 424w, https://substackcdn.com/image/fetch/$s_!dA7j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 848w, https://substackcdn.com/image/fetch/$s_!dA7j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 1272w, https://substackcdn.com/image/fetch/$s_!dA7j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dA7j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png" width="1456" height="504" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:504,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dA7j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 424w, https://substackcdn.com/image/fetch/$s_!dA7j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 848w, https://substackcdn.com/image/fetch/$s_!dA7j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 1272w, https://substackcdn.com/image/fetch/$s_!dA7j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d55db00-2460-4e2c-bd89-ac9e81df5bc9_1600x554.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Screen capture from the memorandum titled &#8220;Artificial Intelligence Strategy for the Department of War.&#8221; <a href="https://media.defense.gov/2026/Jan/12/2003855671/-1/-1/0/ARTIFICIAL-INTELLIGENCE-STRATEGY-FOR-THE-DEPARTMENT-OF-WAR.PDF">Source</a>.</figcaption></figure></div><p>The Pentagon released a <a href="https://media.defense.gov/2026/Jan/12/2003855671/-1/-1/0/ARTIFICIAL-INTELLIGENCE-STRATEGY-FOR-THE-DEPARTMENT-OF-WAR.PDF">directive</a> outlining a new &#8220;AI-first&#8221; approach that prioritizes rapid deployment over <a href="https://www.war.gov/News/News-Stories/Article/Article/3578219/dod-releases-ai-adoption-strategy/">precedents of safety, testing, and oversight</a>. &#8220;We must accept that the risks of not moving fast enough outweigh the risks of imperfect alignment,&#8221; read one passage.</p><p><strong>Moving faster around bureaucracy. </strong>The new mandate is broadly focused on incentivizing department-wide experimentation with frontier models, eliminating bureaucratic and regulatory barriers to integration, and exploiting US advantages in computing, private capital, and exclusive combat data. Specific instructions highlight the Pentagon&#8217;s greater acceptance of safety risks in favor of AI dominance:</p><ul><li><p>The Chief Digital and AI Office (CDAO) must integrate the best new frontier models across Department of War operations within 30 days of release. This compressed timeline likely means little testing for hazards before operational use. Secretary of War Pete Hegseth recently <a href="https://abcnews.go.com/Technology/wireStory/pentagon-embracing-musks-grok-ai-chatbot-draws-global-129152117">announced</a> that xAI&#8217;s Grok will be deployed throughout the Pentagon by the end of the month.</p></li><li><p>A monthly &#8220;Barrier Removal Board&#8221; will identify and waive nonstatutory regulatory and technical constraints &#8212; originally designed to ensure models were deployed safely and with human oversight &#8212; to rapid AI adoption and innovation.</p></li></ul><p>The military&#8217;s push to operationalize frontier AI may already be driving up tensions with the industry&#8217;s safety culture. Reuters <a href="https://www.reuters.com/business/pentagon-clashes-with-anthropic-over-military-ai-use-2026-01-29/">reports</a> that the Pentagon is in dispute with Anthropic after the company pushed back on allowing its models to be used for autonomous targeting or surveillance.</p><p><strong>New strategic initiatives.</strong> The memo outlines seven &#8220;Pace Setting Projects&#8221; to demonstrate rapid innovation across warfighting, intelligence, and operational functions. For example:</p><ul><li><p>Agent Network will develop AI agents to automate battle management and kill chain execution. This may heighten the risk of cascading failures and unintended escalation during fast-moving engagements.</p></li><li><p>Ender&#8217;s Foundry will accelerate AI-driven simulations of conflict with adversaries using autonomous systems.</p></li></ul><ul><li><p>GenAI.mil grants all personnel access to frontier AI models at every classification level.</p></li></ul><p><strong>The evolution of military AI initiatives.</strong> The Pentagon has historically framed AI adoption as a deliberate, safety-first endeavor, <a href="https://www.war.gov/News/%20Releases/Release/Article/2091996/dod-adopts-ethical-principles-for-artificial-intelligence/">formalized in 2020</a> through principles emphasizing testing, human oversight, and the ability to govern or shut down systems. <a href="https://arxiv.org/abs/2303.16200">Competitive pressures</a> will continue to change this posture.</p><h2>AI Solves Open Math Problems</h2><p>Researcher and entrepreneur <a href="https://x.com/neelsomani/status/2010215162146607128">Neel Somani used GPT-5.2 Pro</a> to produce the first verified disproof of <a href="https://www.erdosproblems.com/397">Erd&#337;s Problem #397</a>, a mathematics challenge first formulated several decades ago. Somani&#8217;s success is not an isolated event; in the first weeks of 2026 alone, researchers have used generative tools to <a href="https://www.erdosproblems.com/forum/thread/205">crack several</a> <a href="https://x.com/Liam06972452/status/2010054665539662224?utm_source=www.theneurondaily.com&amp;utm_medium=referral&amp;utm_campaign=ai-cracks-legendary-erdos-problems">other long-standing challenges</a>.</p><p><strong>Making LLMs do the math. </strong>Problem #397 asked whether a certain mathematical pattern would repeat forever or if there was a hidden number that would finally break the rule. Using GPT-5.2 Pro, Somani proved it was the latter by identifying an infinite family of rule-breaking numbers. He then used a separate model to translate the informal proofs into the mathematically rigorous Lean verification language. Fields Medalist Terence Tao verified the resulting proofs as accurate.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!J1Cs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!J1Cs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 424w, https://substackcdn.com/image/fetch/$s_!J1Cs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 848w, https://substackcdn.com/image/fetch/$s_!J1Cs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 1272w, https://substackcdn.com/image/fetch/$s_!J1Cs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!J1Cs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png" width="1408" height="510" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:510,&quot;width&quot;:1408,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!J1Cs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 424w, https://substackcdn.com/image/fetch/$s_!J1Cs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 848w, https://substackcdn.com/image/fetch/$s_!J1Cs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 1272w, https://substackcdn.com/image/fetch/$s_!J1Cs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71c8b149-80e8-4a96-82a8-566b40cbe377_1408x510.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Formulation of Erd&#337;s Problem #397. <a href="https://www.erdosproblems.com/397">Source</a>.</figcaption></figure></div><p><strong>A backlog of mathematical problems. </strong>The Erd&#337;s problems are a collection of 1,130 mathematical conjectures proposed by the prolific Hungarian mathematician Paul Erd&#337;s, spanning fields such as number theory and combinatorics. Hundreds remain unsolved. Erd&#337;s famously incentivized the community by offering monetary rewards, ranging from $25 to $10,000, for their solutions.</p><p><strong>Cautious optimism from mathematicians. </strong>Tao <a href="https://mathstodon.xyz/@tao/115855840223258103">noted</a> that the technology is moving beyond simple calculation and toward structured reasoning. However, he cautioned against drawing premature conclusions about AI&#8217;s general mathematical intelligence based on these solved problems, pointing to a <a href="https://github.com/teorth/erdosproblems/wiki/Disclaimers-and-caveats">number of caveats</a>. For example, problem difficulties range from very hard to simple (relatively). Many problems may already have a solution lost somewhere in the published literature, and some problems may have remained unsolved due to obscurity rather than inherent difficulty.</p><p><strong>Striking progress in LLM mathematical capabilities.</strong> Nonetheless, LLMs&#8217; mathematical capabilities have been improving steeply. In 2022, the best models could not reliably do much more than simple additions and subtractions. Then, GPT-4, released in 2023, <a href="https://openai.com/index/gpt-4-research/">mastered</a> arithmetic word problems but struggled with high school competition mathematics problems. By 2025, frontier models <a href="https://arxiv.org/abs/2502.03544">achieved</a> gold-medal standard at IMO problems. Now AI systems are performing novel and important mathematical research.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading AI Safety Newsletter! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>In Other News</h2><h3>Government</h3><ul><li><p>Under Secretary for Economic Affairs Jacob Helberg unveiled the &#8220;<a href="https://www.state.gov/">Pax Silica&#8221; initiative</a>, offering allies access to US AI infrastructure in exchange for cooperation on semiconductor manufacturing and critical mineral supplies.</p></li><li><p><a href="https://www.noaa.gov/news-release/noaa-deploys-new-generation-of-ai-driven-global-weather-models">NOAA deployed new machine-learning models</a> that use 99% less computing power than traditional systems, drastically speeding up predictions for climate extremes.</p></li><li><p>A <a href="https://www.google.com/search?q=https://www.nytimes.com/2025/12/31/business/china-rare-earth-metals-history.html">New York Times investigation</a> detailed China&#8217;s six-decade strategic campaign to dominate the global rare earth supply chain.</p></li><li><p>China&#8217;s cyberspace regulator <a href="https://www.caixinglobal.com/2025-12-29/china-proposes-limits-on-ai-companion-apps-to-curb-addiction-102398347.html">unveiled draft rules</a> for &#8220;human-like&#8221; AI apps, requiring mandatory intervention for emotional dependency and a two-hour usage limit to prevent addiction.</p></li></ul><h3>Industry</h3><ul><li><p>Anthropic is <a href="https://www.wsj.com/tech/ai/anthropic-raising-10-billion-at-350-billion-value-62af49f4">reportedly raising</a> $10 billion at a $350 billion valuation ahead of an IPO.</p></li><li><p>OpenAI is also <a href="https://www.wsj.com/tech/ai/openai-ipo-anthropic-race-69f06a42?gaa_at=eafs&amp;gaa_n=AWEtsqd5jGQwnxT7Rv78G9lzoIax_3gXnHbKTa_00o-EHka9rriE44SLOoLZgxW7wpM%3D&amp;gaa_ts=6980ac84&amp;gaa_sig=1BT6yR5v9sJY-RtOritfPnRgojJC7caG3e9-8XWrbC676Ddl3EbxipL39OTe4rzBT_fLeNH0NXwT2PrVVeuo-Q%3D%3D">rumored</a> to be planning an IPO for Q4 2026.</p></li><li><p>Waymo <a href="https://techcrunch.com/2025/12/21/waymo-suspends-service-in-san-francisco-as-robotaxis-stall-during-blackout/">briefly paused</a> San Francisco operations after a December blackout caused robotaxis to freeze, raising emergency safety concerns.</p></li><li><p>Following a restructured partnership with OpenAI, Satya Nadella has <a href="https://www.pymnts.com/artificial-intelligence-2/2025/microsoft-ceo-injects-sense-of-urgency-into-ai-efforts/">reportedly overhauled</a> Microsoft&#8217;s senior leadership and adopted a hands-on &#8220;founder mode&#8221; to accelerate internal AI development.</p></li><li><p>Facing grid delays, some data centers are pursuing alternate means of acquiring energy, including <a href="https://www.techspot.com/news/110732-jet-engines-diesel-generators-step-data-centers-outpace.html">jet-engine turbines, diesel generators</a>, and <a href="https://www.techspot.com/news/110715-ai-data-centers-may-run-nuclear-reactors-retired.html?utm_source=forwardfuture.ai&amp;utm_medium=newsletter&amp;utm_campaign=ai-debt-fears-nvidia-s-moat-musk-s-compute-race&amp;_bhlid=46e469e3617a79e71dd69de90ae2e183f8ea9257">retired nuclear reactors</a> from US Navy warships.</p></li><li><p>X Safety said <a href="https://x.com/safety/status/2011573102485127562">Grok will no longer generate</a> or edit revealing images of real people, a policy change made in response to users prompting the chatbot to produce child sexual abuse imagery.</p></li><li><p>OpenAI is asking contractors to <a href="https://www.wired.com/story/openai-contractor-upload-real-work-documents-ai-agents/">submit real-world work samples</a> to benchmark AI agents against human job tasks, underscoring its push toward automating professional work.</p></li></ul><ul><li><p>Anthropic reportedly <a href="https://x.com/kyliebytes/status/2009686466746822731">cut off its competitors&#8217; access</a> to Claude Code via Cursor, highlighting tensions over proprietary AI tooling.</p></li><li><p>In AI Frontiers, Daniel Reti and Gabriel Weil <a href="https://ai-frontiers.org/articles/ai-catastrophe-bonds-extreme-risk-tradeable">propose</a> catastrophic bonds as a mechanism for mitigating against extreme risks caused by frontier AI. </p></li></ul><h3>Civil Society</h3><ul><li><p>During the January 2026 World Economic Forum, Google DeepMind CEO Demis Hassabis and Anthropic CEO Dario Amodei <a href="https://www.weforum.org/meetings/world-economic-forum-annual-meeting-2026/sessions/the-day-after-agi/?utm_source=substack&amp;utm_medium=email">both explicitly endorsed</a> a reduction in the current pace of AI development in order to ensure societal alignment and global safety.</p></li><li><p>A US judge has cleared <a href="https://techcrunch.com/2026/01/08/elon-musks-lawsuit-against-openai-will-face-a-jury-in-march/">Elon Musk&#8217;s lawsuit against OpenAI</a> for a March jury trial, centering on claims that the company breached its founding contract by prioritizing commercial interests over its original mission to develop AGI for the benefit of humanity.</p></li><li><p>Chinese engineers have reportedly <a href="https://militarnyi.com/en/news/china-reproduced-asml-technology-and-is-moving-toward-domestic-production-of-advanced-chips/">reverse-engineered ASML</a> technology to create a prototype extreme ultraviolet (EUV) lithography machine.</p></li><li><p>Cybersecurity researchers <a href="https://mashable.com/article/chinese-robot-hack-voice-command-spread-network">demonstrated</a> how commercial humanoid robots from Unitree can be hijacked via voice commands and used to perform harmful physical actions.</p></li><li><p>The AI Futures Model <a href="https://blog.ai-futures.org/p/ai-futures-model-dec-2025-update">delayed its timeline</a> for full coding automation by three years due to slower-than-expected R&amp;D speedups.</p></li><li><p>US Air Force tests showed <a href="https://www.nellis.af.mil/News/Article/4370792/human-machine-teaming-in-battle-management-a-collaborative-effort-across-borders/">AI can generate</a> viable combat plans 90% faster and with fewer errors than humans, producing valid strategies in under a minute.</p></li><li><p>Researchers at Stanford and Yale have found that major large language models can <a href="https://www.theatlantic.com/technology/2026/01/ai-memorization-research/685552/">store and reproduce</a> long passages from books they were trained on, challenging claims that these systems &#8220;learn&#8221; rather than copy and raising questions about how industry models handle memorization and copyright risk.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217;s X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, the <a href="https://dashboard.safe.ai/">AI Dashboard</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-68-moltbook?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-68-moltbook?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #67: Trump’s preemption executive order]]></title><description><![CDATA[Also: H200s go to China and new frontier AI models from OpenAI and DeepSeek.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-67-trumps-preemption</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-67-trumps-preemption</guid><dc:creator><![CDATA[Nick Stockton]]></dc:creator><pubDate>Wed, 17 Dec 2025 19:32:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!3aKv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition we discuss President Trump&#8217;s executive order targeting state AI laws, Nvidia&#8217;s approval to sell China high-end accelerators, and new frontier models from OpenAI and DeepSeek.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2>Executive Order Blocks State AI Laws</h2><p>U.S. President Donald Trump issued an <a href="https://www.whitehouse.gov/presidential-actions/2025/12/eliminating-state-law-obstruction-of-national-artificial-intelligence-policy/">executive order</a> aimed at halting state efforts to regulate AI. The order, which differs from a version leaked <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating">last month</a>, leverages federal funding and enforcement to evaluate, challenge, and limit state laws. The order caps off a year in which several ambitious state AI proposals were either watered down or vetoed outright.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3aKv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3aKv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 424w, https://substackcdn.com/image/fetch/$s_!3aKv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 848w, https://substackcdn.com/image/fetch/$s_!3aKv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 1272w, https://substackcdn.com/image/fetch/$s_!3aKv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3aKv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png" width="1456" height="476" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cdd982c6-a398-4819-813c-22607d008dff_1852x606.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:476,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3aKv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 424w, https://substackcdn.com/image/fetch/$s_!3aKv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 848w, https://substackcdn.com/image/fetch/$s_!3aKv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 1272w, https://substackcdn.com/image/fetch/$s_!3aKv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdd982c6-a398-4819-813c-22607d008dff_1852x606.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>A push for regulatory uniformity</strong>. The order aims to reduce regulatory friction for companies by eliminating the variety of state-level regimes and limit the power of states at impacting commerce beyond their own borders. It calls for replacing them with a single, unspecified, federal framework.</p><p><strong>What it says.</strong> The preemption executive order cannot unilaterally override state laws. Rather, it directs federal agencies to challenge them based on interpretations of existing laws, or by withholding relevant federal funding.</p><ul><li><p>The Attorney General will form a task force to challenge onerous state AI laws on the basis that they may violate regulations, constitutional protections (such as free speech), or other legal standards.</p></li><li><p>The Secretary of Commerce will separately identify states with offending AI laws and issue a guidance deeming those states ineligible for federal broadband funding. Other federal agencies will assess whether they can similarly leverage grant programs.</p></li><li><p>The Federal Communications Commission will investigate whether to adopt a rule requiring AI developers to provide standardized information to regulators about their models.</p></li><li><p>The Federal Trade Commission must issue guidance on how existing U.S. laws against unfair or deceptive business practices apply to AI models. The guidance will also clarify how the FTC&#8217;s rules against deceptive practices may be used to target state laws that require AI to alter or censor truthful outputs.</p></li><li><p>The White House AI and science advisors will draft a national AI law to override state AI rules that conflict with federal policy, while leaving state laws on child safety, AI infrastructure, state AI use, and other designated areas intact.</p></li></ul><p><strong>Continued state efforts increasingly polarize AI safety</strong>. Though numerous state legislatures argued AI laws this year, few made significant progress on safety. Most recently, New York&#8217;s <a href="https://www.nysenate.gov/legislation/bills/2025/S6953/amendment/B">RAISE Act</a>, which would require AI labs to publish safety frameworks and report serious incidents in face of significant fines, was sent to Governor Kathy Hochul for signing last week. However, Hochul previously <a href="https://prospect.org/2025/12/11/hochul-caves-big-tech-ai-safety-bill-new-york/">proposed</a> changes that would have weakened its safety impact that were rejected by the Senate. This raises the possibility that Hochul will veto.</p><p>The AI industry mounted a <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-62-big-tech">strong push</a> for federal preemption and against ambitious state-level action. The result was a year of high effort and low yield: major bills consumed legislative time, resources, and political capital, only to be vetoed or passed in diluted form. Meanwhile, the public battles further hardened industry and White House opposition.</p><h2>US Permits Nvidia to Sell H200s to China</h2><p>Nvidia is <a href="https://truthsocial.com/@realDonaldTrump/posts/115686072737425841">cleared </a>to sell H200 GPUs to approved Chinese customers. Intel, AMD, and other U.S. chipmakers will be granted similar permissions for their comparable chips. The U.S. government will collect a 25% fee on each sale. The Department of Commerce will oversee the licensing process to ensure shipments protect national security interests.</p><p><strong>What China gets now. </strong>Previously, Nvidia was restricted to selling China chips called H20s, which have deliberately hobbled processing power, memory, and bandwidth. H200s, by comparison, have roughly six times the processing power, and significantly more memory and bandwidth. As Nvidia&#8217;s previous flagship chip, the H200 sits just below Nvidia&#8217;s next generation accelerators, the B200 and B300s.</p><p><strong>The U.S.&#8217;s evolving semiconductor export policy. </strong>At the onset of his 2025 term, President Trump rescinded President Biden&#8217;s diffusion rule setting strict rules on which countries could buy which tier of U.S. chip. Though the U.S. <a href="https://www.china-briefing.com/news/us-china-relations-in-the-trump-2-0-implications/">restricted</a> H20 exports in April amid broader trade negotiations, those restrictions were short-lived.</p><p>In July, an <a href="https://www.whitehouse.gov/presidential-actions/2025/07/promoting-the-export-of-the-american-ai-technology-stack/">executive order</a> further outlined the administration&#8217;s goals regarding AI, which include exporting the &#8220;full stack&#8221; of American AI: By selling American hardware we would also be encouraging the proliferation of American software and other ancillary products. Commerce Secretary Howard Ludnick has further <a href="https://www.youtube.com/shorts/HYdCk0bslro">argued</a> that the approach keeps China hooked on American technology, thus hindering their homegrown chip industry. Readers should be on the lookout for more big export control changes in the coming days.</p><p><strong>Political backlash.</strong> Analysts at the Institute for Progress <a href="https://ifp.org/should-the-us-sell-hopper-chips-to-china/">noted</a> that the H200 is vastly more powerful than the H20, giving Chinese labs access to hardware capable of supporting frontier AI training at near-parity with U.S. supercomputers. Critics at the <a href="https://www.heritage.org/china/commentary/supercharging-chinas-ai-capabilities-would-be-mistake">Heritage Foundation</a> challenged the rationale that exports will keep China &#8220;dependent&#8221; on American chips, pointing out that China&#8217;s domestic chip industry will continue to grow while these shipments directly boost China&#8217;s AI capabilities.</p><h2>ChatGPT-5.2 and DeepSeek-v3.2 Arrive</h2><p>OpenAI has released <a href="https://openai.com/index/introducing-gpt-5-2/">GPT-5.2</a>, a multimodal frontier model that closely trails Google&#8217;s recently-released Gemini 3 Pro across most text and vision capabilities and also scores high in safety. Meanwhile, DeepSeek released <a href="https://api-docs.deepseek.com/news/news251201">DeepSeek-v3.2</a>, an open weight frontier LLM with respectable text capabilities but a poor safety profile.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GTIk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GTIk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 424w, https://substackcdn.com/image/fetch/$s_!GTIk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 848w, https://substackcdn.com/image/fetch/$s_!GTIk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 1272w, https://substackcdn.com/image/fetch/$s_!GTIk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GTIk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png" width="1302" height="822" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:822,&quot;width&quot;:1302,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GTIk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 424w, https://substackcdn.com/image/fetch/$s_!GTIk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 848w, https://substackcdn.com/image/fetch/$s_!GTIk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 1272w, https://substackcdn.com/image/fetch/$s_!GTIk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4977231b-377d-4ddd-a0d9-e72d22f8221d_1302x822.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>ChatGPT-5.2 ranks second in both text and vision capabilities. </strong>In independent evaluations performed by CAIS and posted on the <a href="https://dashboard.safe.ai/">AI Dashboard</a>, GPT-5.2 achieved a text capabilities score just a few points below Gemini 3 Pro and slightly above Claude Opus 4.5. Of the five tests in the text capabilities ranking, it only outscored Gemini 3 Pro at <a href="https://arcprize.org/">ARC-AGI-2</a>, which assesses a model&#8217;s capacity to think logically, solve unfamiliar problems, and adapt to novel situations in real time.</p><p>Across the five vision capabilities benchmarks, ChatGPT-5.2 again averaged below Gemini 3 Pro. It only achieved state-of-the-art performance at <a href="https://arxiv.org/abs/2507.07610">SpatialViz</a>, which evaluates AI systems on their ability to manipulate 3D objects.</p><p><strong>DeepSeek-v3.2&#8217;s narrow specialization.</strong> DeepSeek&#8217;s new model ranked sixth overall across text capabilities, but with a jagged capabilities profile across the various benchmarks. It is highly optimized for coding and specific reasoning tasks, but falls behind its peers at generalized knowledge tests. It does not have native vision capabilities.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QxSp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QxSp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 424w, https://substackcdn.com/image/fetch/$s_!QxSp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 848w, https://substackcdn.com/image/fetch/$s_!QxSp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 1272w, https://substackcdn.com/image/fetch/$s_!QxSp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QxSp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png" width="1326" height="828" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:828,&quot;width&quot;:1326,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QxSp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 424w, https://substackcdn.com/image/fetch/$s_!QxSp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 848w, https://substackcdn.com/image/fetch/$s_!QxSp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 1272w, https://substackcdn.com/image/fetch/$s_!QxSp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F944a1439-ef58-449e-8bcd-a6e78b40c29d_1326x828.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Risk index scores.</strong> The CAIS risk index reveals a sharp divergence in safety between the two releases. A lower score represents a safer system. GPT-5.2 ranks third among frontier systems, following Anthropic&#8217;s Claude Opus 4.5 and Sonnet 4.5. GPT-5.2&#8217;s weakest safety area was in bioweapons research, where it scored an 80 at responding to <a href="https://www.virologytest.ai/">hazardous virology questions</a>. DeepSeek-v3.2 scored poorly across all safety areas except <a href="https://www.textquests.ai/">TextQuests Harm</a>, which measures how prone an AI is to harmful actions in text-based games, where it performed moderately well.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2>In Other News</h2><h3>Industry</h3><ul><li><p>Google released <a href="https://blog.google/products/gemini/gemini-3-flash/">Gemini 3 Flash</a>, a streamlined version of its new frontier model. Early evaluations show it performs only slightly below Gemini 3 Pro on benchmarks like Humanity&#8217;s Last Exam, TextQuests, and EnigmaEval &#8212; while outperforming GPT-5.2 across all of them.</p></li><li><p>OpenAI plans to debut &#8220;<a href="https://www.theverge.com/news/842657/openai-chatgpt-adult-mode-debut-q1-2026">adult mode</a>&#8221; in ChatGPT in the first quarter of 2026 using new age&#8209;checking tech that restricts mature content to verified adults.</p></li><li><p>Nvidia <a href="https://www.reuters.com/business/nvidia-builds-location-verification-tech-that-could-help-fight-chip-smuggling-2025-12-10/">announced</a> location verification technology designed to help prevent its AI chips from being smuggled to countries under export restrictions.</p></li><li><p>Anthropic is reportedly preparing for an <a href="https://www.semafor.com/article/12/03/2025/anthropic-reportedly-preparing-for-ipo">IPO</a>.</p></li></ul><h3>Civil Society</h3><ul><li><p>Pope Leo XIV <a href="https://www.vatican.va/content/leo-xiv/en/speeches/2025/december/documents/20251205-conferenza.html">spoke</a> in the Vatican about AI, urging leaders to ensure development serves humanity rather than wealth and power.</p></li><li><p>A Pew Research survey <a href="https://www.pewresearch.org/internet/2025/12/09/teens-social-media-and-ai-chatbots-2025/">found</a> that two&#8209;thirds of U.S. teens use AI chatbots, with 28% engaging with them daily or more.</p></li><li><p>Polling from Blue Rose Research <a href="https://bharatramamurti.substack.com/p/how-americans-feel-about-a-world?utm_source=post-email-title&amp;publication_id=3303447&amp;post_id=178388346&amp;utm_campaign=email-post-title&amp;isFreemail=true&amp;r=3if0a&amp;triedRedirect=true&amp;utm_medium=email">indicates</a> that most Americans want AI-generated wealth to be broadly shared and prefer jobs guarantees over universal basic income.</p></li><li><p>A new study shows open-weight foundation models for biology remain vulnerable to dual risk misuse despite safeguards undertaken during pre-training, and <a href="https://arxiv.org/abs/2510.27629">proposes BioRiskEval</a> to test and improve their safety.</p></li></ul><h3>Government</h3><ul><li><p>Bernie Sanders called for a <a href="https://x.com/sensanders/status/2001057004370948131?s=46">national pause</a> on AI data center construction, citing automation&#8217;s threat to U.S. jobs and democracy. Separately, over 230 environmental groups demanded a <a href="https://www.theguardian.com/us-news/2025/dec/08/us-data-centers">similar moratorium</a> over the centers&#8217; impact on energy, water, and the climate.</p></li><li><p>House Majority Leader Steve Scalise confirmed that federal preemption of state AI laws <a href="https://subscriber.politicopro.com/article/2025/12/ai-preemption-language-wont-be-included-in-ndaa-says-scalise-00673565">has been dropped</a> from this year&#8217;s National Defense Authorization Act.</p></li><li><p>China is reportedly <a href="https://www.bloomberg.com/news/articles/2025-12-12/china-prepares-as-much-as-70-billion-in-chip-sector-incentives">preparing</a> a massive new semiconductor support package worth up to $70&#8239;billion in incentives to bolster its domestic chip industry.</p></li><li><p>Leading the Future, the a16z and Greg Brockman-backed super PAC network, released its first ads: one <a href="https://www.youtube.com/watch?v=IpPL7jY-9pI&amp;feature=youtu.be">attacking</a> New York State assemblyman Alex Bores and one <a href="https://www.youtube.com/watch?v=6XwUATxbipY">supporting</a> a pro-AI investment Texas congressional candidate.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, the <a href="https://dashboard.safe.ai/">AI Dashboard</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a platform for expert commentary and analysis.</p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #66: Evaluating Frontier Models, New Gemini and Claude, Preemption is Back]]></title><description><![CDATA[Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating</guid><dc:creator><![CDATA[Nick Stockton]]></dc:creator><pubDate>Tue, 02 Dec 2025 01:35:41 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!f-UV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition we discuss the new AI Dashboard, recent frontier models from Google and Anthropic, and a revived push to preempt state AI regulations.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p>This Giving Tuesday, CAIS is raising support to scale our research and public education around AI safety. If you&#8217;ve found value in our newsletter or other work and you&#8217;re interested in helping to advance these efforts, you can contribute to our Giving Tuesday campaign.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://safe.ai/donate&quot;,&quot;text&quot;:&quot;Donate Here&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://safe.ai/donate"><span>Donate Here</span></a></p><h2>CAIS Releases the AI Dashboard for Frontier Performance</h2><p>CAIS launched its <a href="https://dashboard.safe.ai/">AI Dashboard</a>, which evaluates frontier AI systems on capability and safety benchmarks. The dashboard also tracks the industry&#8217;s overall progression toward broader milestones such as <a href="https://www.agidefinition.ai/">AGI</a>, <a href="https://www.remotelabor.ai/">automation of remote labor</a>, and full self-driving.</p><p><strong>How the dashboard works.</strong> The AI Dashboard features three leaderboards&#8212;one for text, one for vision, and one for risks&#8212;where frontier models are ranked according to their average score across a battery of benchmarks. Because CAIS evaluates models directly across a wide range of tasks, the dashboard provides apples-to-apples comparisons of how different frontier models perform on the same set of evaluations and safety-relevant behaviors.</p><p><strong>Ranking frontier models for safety. </strong>The AI Dashboard&#8217;s Risk Index offers a view of how today&#8217;s frontier models perform across six tests for high-risk behaviors. It then averages the scores and ranks them on a 0&#8211;100 scale (lower is safer).<em> </em>Here are the benchmarks and hazardous behaviors they measure:</p><ul><li><p>The refusal set of the <a href="https://www.virologytest.ai/">Virology Capabilities Test</a> measures a model&#8217;s usefulness at answering dual-use biology questions.</p></li><li><p>The <a href="https://arxiv.org/abs/2507.20526">Agent Red Teaming</a> benchmark measures a model&#8217;s robustness against jailbreaking.</p></li><li><p><a href="https://lastexam.ai/">Humanity&#8217;s Last Exam - Miscalibration</a> tests overconfidence on difficult academic questions by comparing its stated confidence to its actual accuracy.</p></li><li><p><a href="https://www.mask-benchmark.ai/">MASK</a> tests how easily models can be pressured into deliberately giving false answers.</p></li><li><p><a href="https://aypan17.github.io/machiavelli/">Machiavelli</a> evaluates whether an AI engages in strategic deception, including planning, exploiting, or deceiving in text-based scenarios.</p></li><li><p><a href="https://www.textquests.ai/">TextQuests Harm</a> assesses how likely an AI is to take intentionally harmful actions in text-based adventure games.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f-UV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f-UV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 424w, https://substackcdn.com/image/fetch/$s_!f-UV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 848w, https://substackcdn.com/image/fetch/$s_!f-UV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 1272w, https://substackcdn.com/image/fetch/$s_!f-UV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f-UV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png" width="1456" height="781" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:781,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!f-UV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 424w, https://substackcdn.com/image/fetch/$s_!f-UV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 848w, https://substackcdn.com/image/fetch/$s_!f-UV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 1272w, https://substackcdn.com/image/fetch/$s_!f-UV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f08358-439a-4e39-a811-5d4f78ab870b_1786x958.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Across these tests, Anthropic&#8217;s recently-released Claude Opus 4.5 is currently the safest frontier model, with an average score of 33.6.</p><p><strong>Ranking the frontier systems&#8217; technical capabilities. </strong>The Dashboard&#8217;s Text and Vision Capabilities Indexes each test systems across five benchmarks. The text-based evaluations test systems on coding, systems administration, expert and abstract reasoning, and performance in text-based adventure games. The vision evaluations measure embodied reasoning, navigation, mental visualization, intuitive physics, and puzzle solving.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Y8M1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Y8M1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 424w, https://substackcdn.com/image/fetch/$s_!Y8M1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 848w, https://substackcdn.com/image/fetch/$s_!Y8M1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 1272w, https://substackcdn.com/image/fetch/$s_!Y8M1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Y8M1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png" width="1456" height="460" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:460,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Y8M1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 424w, https://substackcdn.com/image/fetch/$s_!Y8M1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 848w, https://substackcdn.com/image/fetch/$s_!Y8M1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 1272w, https://substackcdn.com/image/fetch/$s_!Y8M1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3db661b-4306-44ae-9a6c-3bb6a35a1929_1600x505.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Measuring progress toward broad automation. </strong>The AI Dashboard also monitors progress toward three key automation milestones. It measures the industry&#8217;s overall advancement toward AGI using CAIS&#8217;s <a href="https://www.agidefinition.ai/">recently published definition</a>. It evaluates progress on fully automating remote work through CAIS&#8217;s <a href="https://www.remotelabor.ai/">Remote Labor Index</a>, which tests AI agents&#8217; ability to complete paid, remote freelance projects across 23 job categories. Finally, it tracks development of autonomous vehicle safety using data from a <a href="https://teslafsdtracker.com/home">community-run project</a> documenting Tesla&#8217;s Full Self Driving disengagements.</p><h2>Politicians Revive Push for Moratorium on State AI Laws</h2><p>A <a href="https://www.scribd.com/document/951630275/Trump-EO-on-AI-Preemption-11-19-25?secret_password=ZqQ2azb3Gejr7qbUkHlh">leaked draft executive order</a> from a member of the Trump administration details a plan to prevent U.S. states from regulating artificial intelligence. Meanwhile, some congressional lawmakers are trying to pass a <a href="https://www.techpolicy.press/its-back-congress-gears-up-for-yearend-fight-over-moratorium-on-ai-laws/">similar law</a> by including it in a sweeping defense bill.</p><p><strong>The executive order would empower federal agencies to preempt state AI laws. </strong>The draft executive order would require federal agencies to identify state AI regulations deemed burdensome and push states to avoid enacting them.</p><p>The draft order directed federal agencies to take the following actions:</p><ul><li><p>The U.S. Department of Justice to establish an AI Litigation Task Force tasked with suing states whose AI laws are deemed to interfere with interstate commerce or conflict with federal authority.</p></li><li><p>The U.S. Department of Commerce to withhold federal broadband or infrastructure funding from states found to have onerous preexisting AI laws.</p></li><li><p>The Federal Trade Commission to develop nationwide rules that would preempt state laws that conflicted with federal regulations.</p></li><li><p>The Federal Communications Commission to examine whether state AI laws that &#8220;require alterations to the truthful outputs of AI models&#8221; are prohibited under existing laws.</p></li></ul><p>It also ordered the creation of a nationwide, lighter-touch regulatory framework for AI, though it lacked specifics.</p><p><strong>Congress revives its own efforts for a moratorium. </strong>House leaders are considering using the annual defense spending bill as a vehicle for a moratorium on state AI regulations. The National Defense Authorization Act (NDAA), a must-pass measure, is often used to advance other policy priorities. Specifics of the proposed language remain unclear. An <a href="https://www.techpolicy.press/senate-may-force-states-to-choose-between-ai-oversight-and-universal-broadband/">earlier attempt</a> called for a 10-year ban, later shortened to five years and limited to states seeking federal broadband funds. It was ultimately defeated by a bipartisan coalition of senators.</p><p><strong>57% of American voters oppose inserting preemption into the NDAA. </strong><a href="https://ifstudies.org/blog/poll-americans-reject-ai-preemption-in-ndaa-3-to-1">The same poll</a>, from YouGov and the Institute for Family Studies, found that 19% supported the measure and 24% were unsure. Citing voter concerns, a coalition of over 200 lawmakers urged congressional leaders to <a href="https://ari.us/wp-content/uploads/2025/11/NDAA-State-Policymaker-Coalition-Letter-11-23-25-Oppose-AI-Preemption.pdf">drop the provision</a>. Due to stiff opposition&#8212;and the fact that its controversial nature would likely delay the must-pass NDAA&#8212;Axios has characterized this effort as a <a href="https://www.axios.com/2025/11/21/republicans-proposal-block-state-ai-laws?utm_source=chatgpt.com">long shot</a>. Voting is expected in early December.</p><h2>Gemini 3 Pro and Claude Opus 4.5 Arrive</h2><p>Google&#8217;s <a href="https://deepmind.google/models/gemini/pro/">Gemini 3 Pro</a> is now the strongest frontier system on nearly all general-purpose capability benchmarks&#8212;but trails other frontier systems in safety. Anthropic&#8217;s new <a href="https://www.anthropic.com/news/claude-opus-4-5">Claude Opus 4.5</a> is close behind in capabilities but topped the frontier rankings in safety.</p><p><strong>Gemini 3 Pro tops text and vision leaderboards.</strong> In independent evaluations performed by CAIS and posted on the new <a href="https://dashboard.safe.ai/?utm_source=chatgpt.com">AI Dashboard</a>, Gemini&#8239;3 Pro achieved state-of-the-art scores on both text and vision benchmarks. In some tests, it scored double-digit improvements over models released just weeks earlier.</p><p>Claude Opus 4.5, released a week after Gemini 3 Pro, averaged second place on both the text and vision capability indexes, and beat Gemini 3 Pro by 0.2 points at SWE-Bench.</p><p><strong>What&#8217;s new in Gemini 3 Pro and Claude Opus 4.5. </strong>Google has positioned Gemini 3 Pro as having improved reasoning, broader agent capabilities, and expanded control settings. The company also released a new coding agent, Antigravity, based on the model. Google also notes that an enhanced reasoning version &#8212; <a href="https://blog.google/intl/en-mena/product-updates/explore-get-answers/gemini-3-launches-in-mena/?utm_source=chatgpt.com">Gemini 3 Deep Think</a> &#8212; is still under safety testing before full release.</p><p>Anthropic highlighted Claude Opus&#8239;4.5&#8217;s productivity&#8209;focused enhancements along with its high coding scores. New features include a larger context window and a new &#8220;effort&#8221; parameter that allows developers to adjust their speed, cost, and depth of processing.</p><p><strong>There is significant safety variation across frontier models. </strong>Claude Opus 4.5 scored lowest on the AI Dashboard&#8217;s risk capabilities index, making it the current safest frontier model. Anthropic&#8217;s <a href="https://assets.anthropic.com/m/64823ba7485345a7/Claude-Opus-4-5-System-Card.pdf">internal safety audit</a> noted that Claude Opus 4.5 was measurably safer than earlier models, but somewhat vulnerable to certain jailbreaking techniques. They noted it showed a tendency toward evaluation awareness and dishonesty.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Yav1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Yav1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 424w, https://substackcdn.com/image/fetch/$s_!Yav1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 848w, https://substackcdn.com/image/fetch/$s_!Yav1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 1272w, https://substackcdn.com/image/fetch/$s_!Yav1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Yav1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png" width="1456" height="659" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:659,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Yav1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 424w, https://substackcdn.com/image/fetch/$s_!Yav1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 848w, https://substackcdn.com/image/fetch/$s_!Yav1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 1272w, https://substackcdn.com/image/fetch/$s_!Yav1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F087dee12-73ff-4f2d-8b0f-d5df932ccdb1_1922x870.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Gemini 3 Pro ranked ninth on the risk capabilities index, underperforming relative to other recent frontier models. Gemini 3 Pro&#8217;s <a href="https://storage.googleapis.com/deepmind-media/gemini/gemini_3_pro_fsf_report.pdf">safety report</a> acknowledges that the model exhibits risky behaviors in certain capabilities (for example, cybersecurity) and says extra mitigations have been deployed as part of its &#8220;Frontier Safety&#8221; framework. Internal evaluations also showed that the model can manipulate users.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2>In Other News</h2><h3>Government</h3><ul><li><p>Former Representatives Chris Stewart (R&#8209;UT) and Brad Carson (D&#8209;OK) announced a <a href="https://www.publicfirst.us/news/chris-stewart-brad-carson-announce-new-organization-and-bipartisan-super-pacs-to-support-ai-safeguards">new nonpartisan organization</a> and two bipartisan super PACs, aiming to raise $50 million to promote AI safeguards and fund candidates committed to AI safety.</p></li><li><p>Leading the Future, a pro-AI super PAC, <a href="https://techcrunch.com/2025/11/17/a16z-backed-super-pac-is-targeting-alex-bores-sponsor-of-new-yorks-ai-safety-bill-he-says-bring-it-on/">announced it will</a> fund a campaign against Alex Bores, author of the <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-57-the-raise">RAISE Act</a>.</p></li><li><p>The European Commission proposed <a href="https://www.france24.com/en/live-news/20251119-eu-moves-to-delay-high-risk-ai-rules-cut-cookie-banners">delaying its rules on &#8220;high-risk&#8221; AI systems</a> until 2027, after facing pushback from the U.S. and the tech industry.</p></li><li><p>The Department of Energy launched the <a href="https://www.whitehouse.gov/fact-sheets/2025/11/fact-sheet-president-donald-j-trump-unveils-the-genesis-missionto-accelerate-ai-for-scientific-discovery/">Genesis Mission</a>: a program aiming to double American research productivity within a decade by linking the country&#8217;s leading supercomputers, AI systems, and scientific infrastructure into a unified discovery platform.</p></li></ul><h3>Industry</h3><ul><li><p>OpenAI CEO Sam Altman <a href="https://www.cnn.com/2025/11/06/tech/openai-backtracks-government-support-chip-investments">clarified</a> that he &#8220;does not have or want government guarantees for OpenAI data centers&#8221; following his CFO&#8217;s <a href="https://www.cnn.com/2025/11/06/tech/openai-backtracks-government-support-chip-investments">proposal</a> for a U.S. government backstop.</p></li><li><p>Nvidia CEO Jensen Huang <a href="https://thehill.com/policy/technology/5592818-nvidia-jensen-huang-china-ai-race/">told</a> the Financial Times that &#8220;China is going to win the AI race.&#8221;</p></li><li><p>Yann LeCun, longtime head of Facebook AI Research, is reportedly leaving Meta to start <a href="https://gizmodo.com/yann-lecun-world-models-2000685265">a new AI company</a> pursuing human-level intelligence through alternative methods to LLMs.</p></li><li><p>Larry Summers <a href="https://www.politico.com/news/2025/11/19/larry-summers-steps-down-from-openai-00658779">resigned</a> from the OpenAI board following revelations of his close personal relationship with Jeffrey Epstein.</p></li><li><p>Waymo began offering taxi rides that take the <a href="https://www.cnbc.com/2025/11/12/waymo-robotaxi-starts-freeway-highway-rides.html">freeway</a> in Los Angeles, Phoenix, and San Francisco.</p></li></ul><h3>Civil Society</h3><ul><li><p>RAND researchers explored <a href="https://www.rand.org/pubs/perspectives/PEA4361-1.html">technical options for countering rogue AI systems</a>, including high-altitude electromagnetic pulses, a global internet shutdown, and training specialized models to hunt down rogue AIs.</p></li><li><p>A new paper <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5705186">outlines</a> 16 unsolved problems in ensuring safety in open-source AI models, which attackers can freely modify.</p></li><li><p>Anthropic <a href="https://assets.anthropic.com/m/ec212e6566a0d47/original/Disrupting-the-first-reported-AI-orchestrated-cyber-espionage-campaign.pdf">reported</a> that cybercriminals used Claude Code to automate between 80% and 90% of tasks within real-world cyberattack operations.</p></li><li><p>AI startup Edison Scientific announced <a href="https://edisonscientific.com/articles/announcing-kosmos">Kosmos</a>, a model trained to ingest scientific research, generate hypotheses, analyze data, and produce reports.</p></li><li><p>Researchers found that turning harmful prompts into poetry can act as a <a href="https://arxiv.org/abs/2511.15304">universal jailbreak</a>, dramatically boosting the success of attacks across leading AI models.</p></li></ul><p>See also:<a href="https://x.com/ai_risks?lang=en"> CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-66-aisn-66-evaluating?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #65: Measuring Automation and Superintelligence Moratorium Letter]]></title><description><![CDATA[Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-65-measuring</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-65-measuring</guid><dc:creator><![CDATA[Center for AI Safety]]></dc:creator><pubDate>Wed, 29 Oct 2025 16:01:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!JvUw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: A new benchmark measures AI automation; 50,000 people, including top AI scientists, sign an open letter calling for a superintelligence moratorium.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><p></p><h1>CAIS and Scale AI release Remote Labor Index</h1><p>The Center for AI Safety (CAIS) and Scale AI have released the <a href="https://www.remotelabor.ai/">Remote Labor Index</a> (RLI), which tests whether AIs can automate a wide array of real computer work projects. RLI is intended to inform policy, AI research, and businesses about the effects of automation as AI continues to advance.</p><p><strong>RLI is the first benchmark of its kind.</strong> Previous AI benchmarks measure AIs on their intelligence and their abilities on isolated and specialized tasks, such as basic web browsing or coding. While these benchmarks measure useful capabilities, they don&#8217;t measure how AIs can affect the economy. RLI is the first benchmark to collect computer-based work projects from the real economy, containing work from many different professions, such as architecture, product design, video game development, and design.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JvUw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JvUw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JvUw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JvUw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JvUw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JvUw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg" width="1456" height="1519" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1519,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JvUw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JvUw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JvUw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JvUw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe24bafcb-ca39-4266-a23e-40b80ed54605_4898x5109.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Examples of RLI Projects</figcaption></figure></div><p><strong>Current AI agents fully automate very few work projects, but are improving.</strong> AIs <a href="https://leaderboard.safe.ai/">score highly</a> on existing narrow benchmarks, but RLI shows that there is a gap in the existing measurements: AIs cannot currently automate most economically valuable work, with the most capable AI agent only automating 2.5% of work projects on RLI, however there are signs of steady improvement over time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5KNO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5KNO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 424w, https://substackcdn.com/image/fetch/$s_!5KNO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 848w, https://substackcdn.com/image/fetch/$s_!5KNO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 1272w, https://substackcdn.com/image/fetch/$s_!5KNO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5KNO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png" width="1456" height="860" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:860,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5KNO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 424w, https://substackcdn.com/image/fetch/$s_!5KNO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 848w, https://substackcdn.com/image/fetch/$s_!5KNO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 1272w, https://substackcdn.com/image/fetch/$s_!5KNO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb18e8802-7260-41c0-913f-ee2c4c19c245_1600x945.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Current AI agents complete at most 2.5% of projects in RLI, but are improving steadily.</figcaption></figure></div><h1>Bipartisan Coalition for Superintelligence Moratorium</h1><p>The Future of Life Institute (FLI) introduced an <a href="https://superintelligence-statement.org/">open letter</a> with over 50,000 signatories endorsing the following text:</p><p>We call for a prohibition on the development of superintelligence, not lifted before there is</p><ol><li><p>broad scientific consensus that it will be done safely and controllably, and</p></li><li><p>strong public buy-in.</p></li></ol><p><strong>The signatories form the broadest group to sign an open letter about AI safety in history.</strong> Among the signatories are five Nobel laureates, the two most cited scientists of all time, religious leaders, and major figures in public and political life from both the left and the right.</p><p>This statement builds on previous open letters about AI risks, such as the <a href="https://aistatement.com/">open letter</a> from CAIS in 2023 acknowledging AI extinction risks, as well as the <a href="https://futureoflife.org/open-letter/pause-giant-ai-experiments/">previous open letter</a> from FLI calling for an AI training pause. While the CAIS letter was intended to establish a consensus about risks from AI and the first FLI letter was calling for a specific policy on a clear time frame, the broad coalition behind the new FLI letter and its associated polling creates a powerful consensus opinion about the risks of AI while also calling for action.</p><p>In the past, critics of AI safety have dismissed the concept of superintelligence and AI risks due to lack of mainline scientific and public support. The breadth of people who have signed this open letter demonstrates that opinions are changing on the matter. This is confirmed by polling released concurrently to the open letter, showing that approximately 2 in 3 US adults believe that superintelligence shouldn&#8217;t be created, at least until it is proven safe and controllable.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AjsK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AjsK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 424w, https://substackcdn.com/image/fetch/$s_!AjsK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 848w, https://substackcdn.com/image/fetch/$s_!AjsK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 1272w, https://substackcdn.com/image/fetch/$s_!AjsK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AjsK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png" width="846" height="227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:227,&quot;width&quot;:846,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:149093,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AjsK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 424w, https://substackcdn.com/image/fetch/$s_!AjsK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 848w, https://substackcdn.com/image/fetch/$s_!AjsK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 1272w, https://substackcdn.com/image/fetch/$s_!AjsK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cbeff48-e3b1-4883-9030-968235dd3ee7_846x227.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>A broad range of news outlets have <a href="https://news.google.com/stories/CAAqNggKIjBDQklTSGpvSmMzUnZjbmt0TXpZd1NoRUtEd2pRMnVfcER4SHBER2R1ekhRTXJ5Z0FQAQ?hl=en-US&amp;gl=US&amp;ceid=US:en">covered</a> the statement. Dean Ball and others <a href="https://x.com/deanwball/status/1980975802570174831">push back</a> on the statement on X, pointing out the lack of specific details on how to implement a moratorium and the difficulty of doing so. Scott Alexander and others <a href="https://x.com/slatestarcodex/status/1981032302147977570">respond</a> defending the value of statements of consensus as a tool for motivating developing specific details of AI safety policy.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading AI Safety Newsletter! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>Senator Jim Banks introduced the <a href="https://www.congress.gov/amendment/119th-congress/senate-amendment/3505/text">GAIN AI act</a>, which would give US companies and individuals first priority to buy AI chips from US companies and deprioritize foreign buyers.</p></li><li><p>State legislators <a href="https://www.alexbores.nyc/">Alex Bores</a> (behind the <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-57-the-raise">RAISE act</a>) and <a href="https://www.scottwiener.com/">Scott Wiener</a> (behind <a href="https://newsletter.safe.ai/p/aisn-40-california-ai-legislation">SB 1047</a> and <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-63-californias">SB 53</a>) have both announced runs for US congress.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>You can now officially order a <a href="https://www.1x.tech/order">home robot</a> for $500/mo.</p></li><li><p>OpenAI <a href="https://openai.com/index/next-chapter-of-microsoft-openai-partnership/">announces</a> corporate restructuring into a public benefit corporation and some new terms in their relationship with Microsoft.</p></li><li><p>Anthropic announces an <a href="https://www.anthropic.com/news/expanding-our-use-of-google-cloud-tpus-and-services">expansion</a> into 1 million Google TPUs, worth tens of billions of dollars.</p></li><li><p>OpenAI&#8217;s Sora app <a href="https://techcrunch.com/2025/10/03/openais-sora-soars-to-no-1-on-the-u-s-app-store/">was briefly</a> the most downloaded app on the app store.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>A <a href="https://futurism.com/artificial-intelligence/bystanders-horrified-ai-billboard">series of billboards</a> advertising &#8220;<a href="https://replacement.ai">Replacement AI</a>&#8221; drew attention in San Francisco last week.</p></li><li><p>Bruce Schneier and Nathan E. Sanders <a href="https://ai-frontiers.org/articles/ai-will-be-your-personal-political-proxy">discuss</a> AIs&#8217; effect on representative democracy.</p></li><li><p><a href="https://ai-frontiers.org/articles/agis-last-bottlenecks">A forecast</a> based on the <a href="https://www.agidefinition.ai/">definition of AGI</a> proposed last week argues for a 50% chance that AGI will be released by the end of 2028 and an 80% chance that it is released by the end of 2030.</p></li></ul><p>See also:<a href="https://x.com/ai_risks?lang=en"> CAIS&#8217; X account</a>, our paper on<a href="https://www.nationalsecurity.ai/"> superintelligence strategy</a>, our<a href="https://www.aisafetybook.com/"> AI safety course</a>, and<a href="http://ai-frontiers.org/"> AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-65-measuring?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-65-measuring?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #64: New AGI Definition and Senate Bill Would Establish Liability for AI Harms]]></title><description><![CDATA[Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-63-new-agi-definition</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-63-new-agi-definition</guid><dc:creator><![CDATA[Center for AI Safety]]></dc:creator><pubDate>Thu, 16 Oct 2025 15:56:30 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!PDPm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: A new bill in the Senate would hold AI companies liable for harms their products create; China tightens its export controls on rare earth metals; a definition of AGI.</p><p>As a reminder, we&#8217;re <a href="https://jobs.lever.co/aisafety/0c6be5ff-b04e-49eb-92bd-d11c7c81ae6e">hiring</a> a writer for the newsletter.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>Senate Bill Would Establish Liability for AI Harms</h1><p>Sens. Dick Durbin, (D-Ill) and Josh Hawley (R-Mo) <a href="https://www.judiciary.senate.gov/imo/media/doc/One-Pager%20-%20AI%20LEAD%20Act.pdf">introduced</a> the <em><a href="https://www.judiciary.senate.gov/imo/media/doc/OLL25B47.pdf">AI LEAD Act</a></em>, which would establish a federal cause of action for people harmed by AI systems to sue AI companies.</p><p><strong>Corporations are usually liable for harms their products create</strong>. When a company sells a product in the United States that harms someone, that person can generally sue that company for damages under the doctrine of product liability. Those suits force companies to internalize the harms their products create&#8212;and incentivize them to make their products safer.</p><p><strong>Courts haven&#8217;t settled on whether AI systems are products.</strong> Early cases indicate that US courts are open to treating AI systems as products for the purposes of product liability. In a case against CharacterAI, a federal judge <a href="https://www.transparencycoalition.ai/news/important-early-ruling-in-characterai-case-this-chatbot-is-a-product-not-speech">ruled</a> that the company&#8217;s system did count as a product. OpenAI is facing a similar <a href="https://cdn.arstechnica.net/wp-content/uploads/2025/08/Raine-v-OpenAI-Complaint-8-26-25.pdf">suit</a> brought in California state court. Nonetheless, the lack of legal certainty might deter potential plaintiffs from bringing suits.</p><p><strong>The </strong><em><strong>AI LEAD Act</strong></em><strong> would apply product liability to AI systems. </strong>The <em>AI LEAD Act </em>would clarify that AI systems are subject to product liability and establish a path for claims to be brought in federal court. In general, the act would hold AI companies liable for harms caused by their AI systems if the company:</p><ul><li><p>Failed to exercise reasonable care in designing the AI system,</p></li><li><p>Failed to exercise reasonable care in providing instructions or warnings for the AI system,</p></li><li><p>Breaches a warranty it provided for the AI system,</p></li><li><p>Sold or distributed an AI system in a defective condition that permitted unreasonably dangerous misuse.</p></li></ul><p>The deployers of an AI system are also liable for harm if they substantially modify or dangerously misuse the system.</p><p>The act also prohibits AI companies from limiting their liability though contracts with consumers, requires that foreign AI developers register agents for service of process with the US before placing their products on the US market, and permits states to establish stronger safety legislation if they so choose.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>China Tightens Export Controls on Rare Earth Metals</h1><p>China&#8217;s Ministry of Commerce <a href="https://www.mofcom.gov.cn/zwgk/zcfb/art/2025/art_7fc9bff0fb4546ecb02f66ee77d0e5f6.html">announced</a> new export controls on rare earth metals, set to take effect December 1. If aggressively enforced, the rules would give China control over a key part of the global AI and defense supply chains. It also unveiled curbs on the export of equipment used to manufacture electric vehicle batteries, effective November 8.</p><p><strong>China dominates global production of rare earths.</strong> China has a <a href="https://www.nytimes.com/2025/04/13/business/china-rare-earths-exports.html">virtual monopoly</a> on the production of rare earth metals, which are vital to semiconductors, smartphones, AI systems, wind turbines, electric motors, and military hardware. According to the new rules, companies exporting products containing Chinese rare earths are required to obtain export licenses from China&#8217;s Ministry of Commerce. Exporting Chinese rare earths for military use is prohibited, and use in developing sub-14 nanometer chips will be reviewed on a case-by-case basis.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IY3v!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IY3v!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 424w, https://substackcdn.com/image/fetch/$s_!IY3v!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 848w, https://substackcdn.com/image/fetch/$s_!IY3v!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 1272w, https://substackcdn.com/image/fetch/$s_!IY3v!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IY3v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png" width="980" height="653" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:653,&quot;width&quot;:980,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IY3v!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 424w, https://substackcdn.com/image/fetch/$s_!IY3v!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 848w, https://substackcdn.com/image/fetch/$s_!IY3v!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 1272w, https://substackcdn.com/image/fetch/$s_!IY3v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F579c41b1-9f1d-4f29-ab53-3c451e5e6e58_980x653.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A Chinese rare earth mine. <a href="https://apnews.com/article/china-earths-exports-trump-dad99d532f858f04d750d0b8c50e5ed6">Source</a>.</figcaption></figure></div><p><strong>If aggressively enforced, the new rules would likely disrupt AI supply chains.</strong> Rare earth metals are critical to companies producing AI hardware, and their restriction would cause downstream impacts to AI developers. Some analysts predicted they could even trigger a wider economic downturn. &#8220;If enforced aggressively,&#8221; <a href="https://x.com/deanwball/status/1976260051351343195">wrote</a> Dean Ball on X, &#8220;this policy could mean &#8216;lights out&#8217; for the US AI boom, and likely lead to a recession/economic crisis in the US in the short term.&#8221;</p><p><strong>China may be using its monopoly as leverage to extract US concessions. </strong>China claims that the purpose of the controls are only to prevent its rare earth metals from being used in military applications&#8212;samarium, for example, is used by the U.S. to <a href="https://www.csis.org/analysis/consequences-chinas-new-rare-earths-export-restrictions">manufacture</a> F-35 fighter jets and missile systems.</p><p>However, the rules would give China effective control over the supply chains of several critical industries, including AI. The US is unlikely to accept that strategic vulnerability. US President Donald Trump <a href="https://truthsocial.com/@realDonaldTrump/posts/115351840469973590">responded</a> to the new controls by announcing a 100 percent additional tariff on Chinese goods&#8212;on top of the existing 30 percent tariffs&#8212;as well as export controls on critical software, both going into effect November 1.</p><p>China may walk back its controls to deescalate an economic confrontation with the US, or in exchange for reduced tariffs or greater access to frontier AI chips. In the long run, the US would be well-advised to build independent rare earth metal production capacity.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>A Definition of AGI</h1><p>A large group of people in AI&#8212;including Dan Hendrycks, Yoshua Bengio, Dawn Song, Max Tegmark, Eric Schmidt, Jaan Tallinn, Gary Marcus, and others&#8212;released a <a href="https://agidefinition.ai/">paper</a> introducing a quantifiable framework for defining Artificial General Intelligence (AGI), aiming to standardize the term and measure the gap between current AI and human-level cognition.</p><p><strong>AGI definitions are often nebulous.</strong> The paper argues that the term AGI currently acts as a &#8220;constantly moving goalpost.&#8221; As specialized AI systems master tasks previously thought to require human intellect, the criteria for AGI shift. This ambiguity hinders productive discussions about progress and obscures the actual distance to human-level intelligence.</p><p><strong>The framework is grounded in theory.</strong> The authors define AGI as &#8220;an AI that can match or exceed the cognitive versatility and proficiency of a well-educated adult.&#8221; To operationalize this, they ground their methodology in the Cattell-Horn-Carroll (CHC) theory, the most empirically validated model of human intelligence. The framework adapts established human psychometric tests to evaluate AI systems across ten core cognitive domains, resulting in a standardized &#8220;AGI Score&#8221; (0-100%).</p><p><strong>Current models exhibit a &#8220;jagged&#8221; cognitive profile.</strong> Application of the framework reveals highly uneven capabilities. While models are proficient in knowledge-intensive domains (such as Math or Reading/Writing), they possess critical deficits in foundational cognitive machinery.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PDPm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PDPm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 424w, https://substackcdn.com/image/fetch/$s_!PDPm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 848w, https://substackcdn.com/image/fetch/$s_!PDPm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 1272w, https://substackcdn.com/image/fetch/$s_!PDPm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PDPm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png" width="1456" height="1054" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1054,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PDPm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 424w, https://substackcdn.com/image/fetch/$s_!PDPm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 848w, https://substackcdn.com/image/fetch/$s_!PDPm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 1272w, https://substackcdn.com/image/fetch/$s_!PDPm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d55bd85-caa6-4252-8cc7-6470a89c5f19_1600x1158.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Long-term memory storage is the critical bottleneck.</strong> The most significant deficit identified is Long-Term Memory Storage, where current models score near 0%. This results in a form of &#8220;amnesia,&#8221; forcing the AI to re-learn context in every interaction. The paper notes that the reliance on massive context windows (Working Memory) is a &#8220;capability contortion&#8221; used to compensate for this lack of persistent memory.</p><p><strong>The framework quantifies the gap to AGI.</strong> The resulting scores are intended to concretely quantify both rapid progress and the substantial gap remaining before AGI. The paper estimates GPT-4 at a 27% AGI score and the anticipated GPT-5 (2025) at 58%.</p><p>The paper can be accessed at <a href="https://agidefinition.ai/">agidefinition.ai</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>Governor Newsom <a href="https://www.gov.ca.gov/2025/09/29/governor-newsom-signs-sb-53-advancing-californias-world-leading-artificial-intelligence-industry/">signed</a> SB-53 into law (<a href="https://www.politico.com/news/2025/10/04/sacramento-california-ai-rules-00594082">Politico</a>).</p></li><li><p>CAISI <a href="https://www.nist.gov/system/files/documents/2025/09/30/CAISI_Evaluation_of_DeepSeek_AI_Models.pdf">published</a> an evaluation of Deepseek&#8217;s AI models.</p></li><li><p>The Select Committee on the CCP <a href="https://selectcommitteeontheccp.house.gov/media/press-releases/new-investigation-reveals-american-and-allied-companies-boosted-the-ccp-s-semiconductor-industry-fueled-the-prc-s-military-ambitions-and-human-rights-abuses">found</a> that companies in the US and allied countries are selling semiconductor manufacturing equipment to China.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>OpenAI <a href="https://openai.com/index/sora-2/">released</a> Sora 2, its latest video-generation model, along with a tiktok-style app.</p></li><li><p>Microsoft and Anthropic <a href="https://www.reuters.com/business/retail-consumer/former-british-pm-sunak-joins-microsoft-anthropic-advisory-roles-2025-10-09/">hired</a> former UK Prime Minister Rishi Sunak into advisory roles.</p></li><li><p>Anthropic open-sourced <a href="https://www.anthropic.com/research/petri-open-source-auditing">Petri</a>, a tool for automating AI behavior audits through multi-turn simulations.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Karson Elmgren, Scott Singer, and Oliver Guest <a href="https://ai-frontiers.org/articles/is-china-serious-about-ai-safety">discuss</a> how China&#8217;s new AI safety body brings together leading experts&#8212;but faces obstacles to turning ambition into influence.</p></li><li><p>OpenAI <a href="https://x.com/_NathanCalvin/status/1976649051396620514">subpoenaed</a> the general counsel of Encode, a nonprofit that worked on SB 53.</p></li><li><p>Researchers <a href="https://x.com/Bin4ryDigit/status/1969291490011558157">discovered</a> an exploit of Unitree&#8217;s humanoid robots that lets attackers take control, embed themselves, and spread to nearby devices.</p></li><li><p>The Budget Lab at Yale <a href="https://budgetlab.yale.edu/research/evaluating-impact-ai-labor-market-current-state-affairs">published</a> a report evaluating AI&#8217;s effects on the labor market.</p></li><li><p>FLI announced the <a href="https://keepthefuturehuman.ai/contest/">Keep The Future Human Creative Contest</a>, which offers $100,000+ in cash prizes for digital media that raises awareness of AI existential risks.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-63-new-agi-definition?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-63-new-agi-definition?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #63: California’s SB-53 Passes the Legislature]]></title><description><![CDATA[Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-63-californias</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-63-californias</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Wed, 24 Sep 2025 16:10:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!JC0w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: California&#8217;s legislature sent SB-53&#8212;the &#8216;Transparency in Frontier Artificial Intelligence Act&#8217;&#8212;to Governor Newsom&#8217;s desk. If signed into law, California would become the first US state to regulate catastrophic risk.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or<a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110"> Apple Podcasts</a>.</p><p><em>A note from Corin: I&#8217;m leaving the AI Safety Newsletter soon to start law school&#8212;but if you&#8217;d like to hear more from me, I&#8217;m planning to continue to write about AI in a new personal newsletter, <a href="https://conditionals.substack.com/about">Conditionals</a>. On a related note, we&#8217;re also <a href="https://jobs.lever.co/aisafety/0c6be5ff-b04e-49eb-92bd-d11c7c81ae6e">hiring</a> a writer for the newsletter.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1><strong>California&#8217;s SB-53 Passes the Legislature</strong></h1><p><strong>SB-53 is the Legislature&#8217;s weaker sequel to last year&#8217;s vetoed SB-1047.</strong> After Governor Gavin Newsom <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-42-newsom-vetoes">vetoed</a> SB-1047 last year, he convened the <a href="https://www.cafrontieraigov.org/">Joint California Policy Working Group on AI Frontier Models</a>. The group&#8217;s <a href="https://www.gov.ca.gov/wp-content/uploads/2025/06/June-17-2025-%E2%80%93-The-California-Report-on-Frontier-AI-Policy.pdf">June report</a> recommended transparency, incident reporting, and whistleblower protections as near-term priorities for governing AI systems. <a href="https://leginfo.legislature.ca.gov/faces/billTextClient.xhtml?bill_id=202520260SB53">SB-53</a> (the &#8220;Transparency in Frontier Artificial Intelligence Act&#8221;) is an attempt to codify those recommendations. The California Legislature passed SB-53 on September 17th.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JC0w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JC0w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 424w, https://substackcdn.com/image/fetch/$s_!JC0w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 848w, https://substackcdn.com/image/fetch/$s_!JC0w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 1272w, https://substackcdn.com/image/fetch/$s_!JC0w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JC0w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png" width="1456" height="554" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:554,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JC0w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 424w, https://substackcdn.com/image/fetch/$s_!JC0w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 848w, https://substackcdn.com/image/fetch/$s_!JC0w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 1272w, https://substackcdn.com/image/fetch/$s_!JC0w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872749f2-34d8-4050-b5d2-9929a16c9a0c_1600x609.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The introduction to SB-53&#8217;s text. <a href="https://leginfo.legislature.ca.gov/faces/billTextClient.xhtml?bill_id=202520260SB53">Source</a>.</figcaption></figure></div><p><strong>Transparency.</strong> To track and respond to the risks involved in frontier AI development, governments need frontier developers to disclose the capabilities of their systems and how they assess and mitigate catastrophic risk. The bill defines a &#8220;catastrophic risk&#8221; as a foreseeable, material risk that a foundation model&#8217;s development, storage, use, or deployment will result in death or serious injury to more than 50 people, or more than $1 billion in property damages arising from a single incident involving a foundation model:</p><ul><li><p>Providing expert-level assistance in the creation or release of CBRN weapons.</p></li><li><p>Autonomous cyberattack, murder, assault, extortion, or theft.</p></li><li><p>Evading the control of its frontier developer or user.</p></li></ul><p>With these risks in mind, SB-53 requires frontier developers to:</p><ul><li><p>Publish a frontier AI framework that includes (among other things) the developer&#8217;s capability thresholds for catastrophic risks, risk mitigations, and internal governance practices.</p></li><li><p>Review and update the framework once a year, and publish modifications within 30 days.</p></li><li><p>Publish transparency reports for each new frontier model, including technical specifications and catastrophic risk assessments.</p></li><li><p>Share assessments of catastrophic risks from internal use of frontier models with California&#8217;s Office of Emergency Services (OES) every 3 months.</p></li><li><p>Refrain from lying about catastrophic risks from its frontier models, its management of catastrophic risks, or its compliance with its frontier AI framework.</p></li></ul><p><strong>Incident reporting. </strong>Governments need to be alerted to critical safety incidents involving frontier AI systems&#8212;such as harms resulting from unauthorized access to model weights or loss of control of an agent&#8212;to intervene before they escalate into catastrophic outcomes. SB-53 provides that:</p><ul><li><p>The OES will establish a hotline for reporting critical safety incidents.</p></li><li><p>Frontier developers are required to report critical safety incidents to within 15 days, or 24 hours if there is an imminent threat of death or serious injury.</p></li><li><p>Each year, the OES will produce a report with anonymized and aggregated information about critical safety incidents.</p></li></ul><p>The bill&#8217;s incident reporting requirements are also designed to accommodate future federal requirements. In the case that federal requirement for critical safety incident reporting becomes equivalent to, or stricter than, those required by SB-53, then OES can defer to those federal requirements.</p><p><strong>Whistleblower protection. </strong>California state authorities will need to rely on whistleblowers to report whether frontier AI companies are complying with SB-53&#8217;s requirements. Given the industry&#8217;s <a href="https://www.reuters.com/technology/openai-whistleblowers-ask-sec-investigate-restrictive-non-disclosure-agreements-2024-07-13/">mixed history</a> regarding whistleblowers, the bill provides that:</p><ul><li><p>Frontier developers are prohibited from preventing or retaliating against covered employees (employees responsible for assessing, managing, or addressing risk of critical safety incidents) from reporting activities that they have reason to believe pose a specific and substantial catastrophic risk. (Existing whistleblower protections cover all employees and any violation of law&#8212;which includes SB-53&#8217;s transparency and incident-reporting requirements.)</p></li><li><p>Each year, the Attorney General will publish a report with anonymized and aggregated information about reports from covered employees.</p></li></ul><p>Covered employees can sue frontier developers for noncompliance with whistleblower protections, and the Attorney General is empowered to enforce the bill&#8217;s transparency and incident reporting requirements by punishing violations with civil penalties of up to $1 million per violation.</p><p><strong>How we got here, and what happens next.</strong> SB-1047 required frontier AI developers to implement specific controls to reduce catastrophic risk (such as shutdown controls and prohibitions on releasing unreasonably risky models), and Governor Newsom vetoed the bill under pressure from national Democratic leadership and industry lobbying. Since SB-53 only implements transparency requirements&#8212;and relies on the recommendations made by the Governor&#8217;s working group&#8212;SB-53 seems more likely to be signed into law. Anthropic has also publicly <a href="https://www.politico.com/news/2025/09/08/anthropic-bill-gavin-newsom-scott-wiener-00550029">endorsed</a> the bill.<br><br>Governor Newsom has until October 12th to sign SB-53. If he does, SB-53 will be the first significant AI legislation to become law since Senator Ted Cruz pushed (and narrowly failed) to attach a 10-year moratorium on state and local AI enforcement to federal budget legislation. He has since picked up the idea again in a new <a href="https://www.commerce.senate.gov/2025/9/sen-cruz-unveils-ai-policy-framework-to-strengthen-american-ai-leadership">proposal</a>&#8212;which, if it gains traction, might set up a conflict between California and Washington.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>The Cyberspace Administration of China <a href="https://www.ft.com/content/12adf92d-3e34-428a-8d61-c9169511915c">banned</a> Chinese companies from buying Nvidia chips<strong>.</strong></p></li><li><p>Italy <a href="https://www.theguardian.com/world/2025/sep/18/italy-first-in-eu-to-pass-comprehensive-law-regulating-ai">approved</a> a law regulating the use of artificial intelligence that includes criminal penalties for misuse.</p></li><li><p>Dozens of UK lawmakers <a href="https://time.com/7313320/google-deepmind-gemini-ai-safety-pledge/">accused</a> Google of violating its AI safety commitments.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>OpenAI and Anthropic <a href="https://openai.com/index/how-people-are-using-chatgpt/">published</a> <a href="https://www.anthropic.com/research/economic-index-geography">reports</a> tracking economic patterns in how people use AI.</p></li><li><p>OpenAI and DeepMind <a href="https://x.com/MostafaRohani/status/1968360976379703569">both</a> <a href="https://deepmind.google/discover/blog/gemini-achieves-gold-level-performance-at-the-international-collegiate-programming-contest-world-finals/">claimed</a> gold-medal performance at the International Collegiate Programming Contest World Finals.</p></li><li><p>Nvidia is <a href="https://www.reuters.com/business/nvidia-invest-100-billion-openai-2025-09-22/">investing</a> up to $100 billion in OpenAI. It&#8217;s also <a href="https://apnews.com/article/nvida-intel-chips-investment-73c307d2f6ceccd6854d6666775358f3">investing</a> $5 billion in Intel.</p></li><li><p>Anthropic <a href="https://www.anthropic.com/news/detecting-countering-misuse-aug-2025">published</a> a report discussing how AI is being used for cybercrime.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>An <a href="https://red-lines.ai/">open letter</a> signed by former heads of state, nobel laureates, and other prominent figures calls for an international agreement on clear and verifiable red lines to prevent AI risks.</p></li><li><p>Stanford researchers <a href="https://digitaleconomy.stanford.edu/wp-content/uploads/2025/08/Canaries_BrynjolfssonChandarChen.pdf">published</a> a paper finding that employment of early-career workers in exposed industries has declined 13%.</p></li><li><p>AI safety activists have begun <a href="https://sfstandard.com/2025/09/14/hunger-strike-ai-anthropic-google/">hunger strikes</a> outside of AI company headquarters in London and San Francisco.</p></li><li><p>Dan Hendrycks and Adam Khoja <a href="https://ai-frontiers.org/articles/ai-deterrence-is-our-best-option">respond</a> to critiques of Mutually Assured AI Malfunction (MAIM).</p></li><li><p>Rosario Mastrogiacomo <a href="https://ai-frontiers.org/articles/cybersecurity-is-humanitys-firewall-against-rogue-ai">discusses</a> how AI agents are eroding the foundations of cybersecurity.</p></li><li><p>Ben Brooks <a href="https://ai-frontiers.org/articles/frontier-ai-should-be-open-source">argues</a> that keeping frontier AI behind paywalls could create a new form of digital feudalism.</p></li><li><p>Oscar Delaney and Ashwin Acharya <a href="https://ai-frontiers.org/articles/the-hidden-ai-frontier">discuss</a> &#8216;the hidden frontier&#8217; of internal models at AI companies.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-63-californias?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-63-californias?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #62: Big Tech Launches $100 Million pro-AI Super PAC]]></title><description><![CDATA[Plus: Meta&#8217;s Chatbot Policies Prompt Backlash Amid AI Reorganization; China Reverses Course on Nvidia H20 Purchases]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-62-big-tech</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-62-big-tech</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Wed, 27 Aug 2025 16:29:19 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!NQ_Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: Big tech launches a $100 million pro-AI super PAC; Meta&#8217;s chatbot policies prompt congressional scrutiny amid the company&#8217;s AI reorganization; China reverses course on buying Nvidia H20 chips after comments by Secretary of Commerce Howard Lutnick.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h2><strong>Big Tech Launches $100 Million pro-AI Super PAC</strong></h2><p>Silicon valley executives and investors are investing more than $100 million in <a href="https://www.wsj.com/politics/silicon-valley-launches-pro-ai-pacs-to-defend-industry-in-midterm-elections-287905b3">a new political network</a> to push back against AI regulations, signaling that the industry intends to be a major player in next year&#8217;s U.S. midterms.</p><p><strong>The super PAC is backed by a16z and Greg Brockman and imitates the crypto super PAC Fairshake. </strong>The network, called Leading the Future, is modeled on the crypto-focused super-PAC Fairshake and aims to influence AI policy through campaign donations, digital ads, and candidate targeting. Venture capital firm Andreessen Horowitz and OpenAI President Greg Brockman are among the key backers, alongside Palantir co-founder Joe Lonsdale, Perplexity AI, and veteran angel investor Ron Conway.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NQ_Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 424w, https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 848w, https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 1272w, https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png" width="1456" height="485" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:485,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 424w, https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 848w, https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 1272w, https://substackcdn.com/image/fetch/$s_!NQ_Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31a08d1d-bc5e-43d0-9664-5d3797244a26_1500x500.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Leading The Future&#8217;s branding. <a href="https://x.com/LeadingFutureAI">Source</a>.</figcaption></figure></div><p>The effort will be led by Josh Vlasto, a former adviser to Sen. Chuck Schumer, and Zac Moffatt, CEO of consulting firm Targeted Victory. Both previously played senior roles in Fairshake, which spent heavily to defeat crypto skeptics and support the first federal crypto law signed earlier this year.</p><p><strong>Meta is funding an AI super-PAC in California.</strong> Meta is also ramping up political efforts in its home state with the launch of <a href="https://techcrunch.com/2025/08/26/meta-to-spend-tens-of-millions-on-pro-ai-super-pac/">another new super PAC</a>, Mobilizing Economic Transformation Across (Meta) California. Meta California is led by Meta executives Brian Rice and Greg Maurer and is expected to deploy tens of millions of dollars to support candidates&#8212;across party lines&#8212;who oppose AI regulation.</p><h1>Meta&#8217;s Chatbot Policies Prompt Backlash Amid AI Reorganization</h1><p>Meta is facing bipartisan outrage and an ongoing probe after Reuters <a href="https://www.reuters.com/investigates/special-report/meta-ai-chatbot-guidelines/">reported</a> on internal company documents that permitted its AI chatbots to engage in romantic and sensual conversations with minors&#8212;drawing fresh scrutiny as the company reorganizes its AI division.</p><p><strong>Meta allowed its chatbots to have sensual conversations with children.</strong> An internal policy document permitted Meta AI chatbots to interact with children in &#8220;romantic or sensual&#8221; contexts&#8212;such as describing a child&#8217;s body as &#8220;a work of art.&#8221; Although Meta removed these sections of its policy when questioned by Reuters, the rules had been approved and in effect.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gjRH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gjRH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 424w, https://substackcdn.com/image/fetch/$s_!gjRH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 848w, https://substackcdn.com/image/fetch/$s_!gjRH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 1272w, https://substackcdn.com/image/fetch/$s_!gjRH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gjRH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png" width="1456" height="460" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:460,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gjRH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 424w, https://substackcdn.com/image/fetch/$s_!gjRH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 848w, https://substackcdn.com/image/fetch/$s_!gjRH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 1272w, https://substackcdn.com/image/fetch/$s_!gjRH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3587369f-4268-4546-b4ed-9743fccad5d8_1600x505.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">An exerpt from Meta&#8217;s policies. <a href="https://www.reuters.com/investigates/special-report/meta-ai-chatbot-guidelines/">Source</a>. </figcaption></figure></div><p><strong>The revelation provoked congressional scrutiny.</strong> Congress Senator Josh Hawley (R&#8209;MO) launched a <a href="https://www.hawley.senate.gov/kids-deserve-protection-hawley-launches-investigation-into-meta-for-training-its-ai-chatbots-to-target-children-with-sensual-conversation/">Senate inquiry</a> into Meta&#8217;s policies, requesting that the company preserve all relevant communications and clarify whether its chatbots enable &#8220;exploitation, deception or other criminal harms to children.&#8221; A bipartisan group of senators also <a href="https://www.schatz.senate.gov/news/press-releases/schatz-leads-bipartisan-group-of-10-senators-in-pressing-meta-for-safeguards-around-childrens-engagement-with-ai-chatbots">sent a letter</a> to Meta demanding it publicly disclose its updated policies and specifically forbid romantic chatbot interactions with minors.</p><p><strong>Meta is in the middle of reorganizing its AI division.</strong> As Meta is fielding congressional backlash, the company is also in the middle of a major internal reorganization of its AI division. Under the umbrella of Meta Superintelligence Labs (MSL), teams have been split into specialized groups: TBD Lab (large-language&#8209;model development), FAIR (long&#8209;term research), Products &amp; Applied Research, and MSL Infra (infrastructure). The reorganization, communicated in an <a href="https://www.businessinsider.com/meta-ai-superintelligence-labs-reorg-alexandr-wang-memo-2025-8">internal memo</a> from Chief AI Officer Alexandr Wang, also dissolved the AGI Foundations team, redistributing its members across the new structure.<strong> </strong>Any new hires or transfers across Meta&#8217;s AI teams now <a href="https://www.wsj.com/tech/ai/meta-ai-hiring-freeze-fda6b3c4">require</a> Wang&#8217;s personal approval.</p><p>Meta&#8217;s AI reorganization reflects high stakes in a competitive race with other AI companies. Yet, as Meta accelerates toward superintelligence, its chatbot controversy demonstrates that its ambitions are outpacing both internal controls and external oversight.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>China Reverses Course on Nvidia H20 Purchases</h1><p>Weeks after the Trump administration approved Nvidia&#8217;s H20 chip exports to China under a 15% revenue-sharing arrangement, the deal is facing an uncertain future.</p><p><strong>Nvidia made a deal with the White House to sell H20s (and possibly a new chip model) to China.</strong> Last month, the White House struck a deal with Nvidia which allowed the company to export H20s to China on the condition it shared 15% of its revenue with the US government. President Trump recently suggested he might also allow Nvidia to export a scaled-down version of their next-generation Blackwell chips. The chip in question (the B30A) would offer about half the performance of Nvidia's flagship B300 and could be shipped as early as September.</p><p><strong>The deal faces political opposition and legal uncertainty.</strong> Selling AI chips China faces political opposition on national security grounds. A group of six democratic senators wrote <a href="https://www.warner.senate.gov/public/_cache/files/3/a/3a2a1db4-65d2-4377-822a-5f245d53b28f/62DE3F69670AE9734557935EEC1CB6A941532F5BA23C1E543B74B4EECAD3D420.250815.trump-senate-15-percent-fee-letter.signed.pdf">a letter</a> to the White House arguing that its deal with Nvidia undermines US national security by giving the PRC access to a critical military technology. The deal&#8217;s revenue-sharing condition&#8212;in effect, an export tax&#8212;also faces <a href="https://thehill.com/policy/technology/5446890-nvidia-amd-china-chip-deal/">legal challenges</a>. Export taxes are barred under both the Constitution and federal law.</p><p><strong>Chinese regulators issued guidance against H20 purchases after comments by Sec. Lutnick.</strong> China urged Defending the H20 deal, Commerce Secretary Howard&#8239;Lutnick said that the U.S. strategy was to offer China inferior AI chips: &#8220;We don&#8217;t sell them our best stuff&#8230; not even our third&#8209;best,&#8221; adding that the goal was for Chinese developers to get &#8220;addicted to the American technology stack.&#8221; Chinese regulators <a href="https://www.ft.com/content/b8e30c54-b71c-4113-8b3e-8f54bc36587d">reportedly</a> deemed the comments &#8220;insulting,&#8221; provoking efforts to discourage Nvidia chip purchases.</p><p>A week after Lunick&#8217;s comments, China&#8217;s Cyberspace Administration (CAC) issued guidance urging Chinese companies to suspend H20 orders. Nvidia is <a href="https://www.reuters.com/world/china/nvidia-asks-foxconn-suspend-work-h20-chip-sources-say-2025-08-22/">reportedly</a> halting production of H20s in response to decreased demand from China.</p><p><strong>Renting AI chips might be better than selling them</strong>. Instead of selling chips to China, <a href="https://www.rand.org/pubs/commentary/2025/08/america-should-rent-not-sell-ai-chips-to-china.html">renting AI chips</a> via remote cloud services would offer the US greater leverage than outright sales. Cloud access preserves US control: chips remain physically in custody, and access can be revoked. This model could generate revenue for both chipmakers and cloud providers while curbing diversion to unauthorized users like the Chinese military.</p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>President Trump said the U.S. will take a<a href="https://apnews.com/article/trump-intel-us-equity-stake-b538526b6698f7ebd31e99effd727693"> 10% equity stake in Intel</a>.</p></li><li><p>The U.K. <a href="https://www.gov.uk/government/news/appointment-of-jade-leung-as-the-prime-ministers-ai-adviser">appointed Jade Leung</a> as the prime minister&#8217;s AI adviser.</p></li><li><p>Colorado lawmakers convened a<a href="https://www.cpr.org/2025/08/19/how-to-update-colorado-ai-law-special-session/"> special session</a> to revisit the state&#8217;s AI anti-discrimination law before it takes effect in February.</p></li><li><p>NSF and Nvidia<a href="https://www.nsf.gov/news/nsf-nvidia-partnership-enables-ai2-develop-fully-open-ai"> announced a partnership</a> enabling the nonprofit Allen Institute for AI (AI2) to develop a fully open AI model for research and public use.</p></li><li><p>U.S. authorities have<a href="https://www.reuters.com/world/china/us-embeds-trackers-ai-chip-shipments-catch-diversions-china-sources-say-2025-08-13/"> embedded trackers in AI-chip shipments</a> to identify diversions to China, according to Reuters.</p></li><li><p>The U.K. AI Safety Institute launched<a href="https://alignmentproject.aisi.gov.uk/"> The Alignment Project</a>, a &#163;15m global fund offering grants (up to &#163;1m) and AWS compute credits to support alignment work.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>XBoW wrote that<a href="https://xbow.com/blog/gpt-5"> despite OpenAI's assessment of GPT-5 showing modest cyber capabilities, GPT-5</a> doubled XBoW&#8217;s hacking agent&#8217;s performance.</p></li><li><p>The Financial Times reported on the<a href="https://www.ft.com/content/efe1e350-62c6-4aa0-a833-f6da01265473"> &#8220;$3 trillion AI building boom&#8221;</a>, detailing massive corporate obligations across data centers, chips, and power.</p></li><li><p>SoftBank and Intel<a href="https://www.cnbc.com/2025/08/18/intel-is-getting-a-2-billion-investment-from-softbank.html"> signed a $2 billion investment agreement</a>, with SoftBank buying Intel stock as the chipmaker seeks outside capital.</p></li><li><p>DeepMind revealed that LMArena&#8217;s top-rated image model, &#8220;nano banana,&#8221; is the company&#8217;s <a href="https://deepmind.google/models/gemini/image/">Flash Image</a> model&#8212;now available in Gemini.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>ESET <a href="https://x.com/ESETresearch/status/1960365364300087724">reported</a> that it found &#8220;PromptLock,&#8221; an AI-assisted ransomware.</p></li><li><p>AP reported on the first<a href="https://apnews.com/video/sports-games-for-humanoid-robots-in-china-highlights-progress-in-ai-and-robotics-13d64fdf52a84bca99d0bfc5d21b41dc"> World Humanoid Robot Games</a>, which was held in China.</p></li><li><p>The Institute for Progress published<a href="https://ifp.org/preparing-for-launch/"> &#8220;Preparing for Launch&#8221;</a>, a foreword to its &#8220;Launch Sequence&#8221; series arguing for proactive U.S. R&amp;D to shape AI progress and strengthen security.</p></li><li><p>An MIT study found that about<a href="https://fortune.com/2025/08/18/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo/"> 95% of enterprise GenAI pilots are failing</a> to show P&amp;L impact, highlighting integration and workflow issues.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-62-big-tech?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-62-big-tech?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #61: OpenAI Releases GPT-5]]></title><description><![CDATA[Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-61-openai-releases</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-61-openai-releases</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 12 Aug 2025 17:09:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ZEcb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: OpenAI releases GPT-5.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>OpenAI Releases GPT-5</h1><p>Ever since GPT-4&#8217;s release in March 2023 marked a step-change improvement over GPT-3, people have used &#8216;GPT-5&#8217; as a stand-in to speculate about the next generation of AI capabilities. On Thursday, OpenAI <a href="https://openai.com/gpt-5/">released</a> GPT-5. While state-of-the-art in most respects, GPT-5 is not a step-change improvement over competing systems, or even recent OpenAI models&#8212;but we shouldn&#8217;t have expected it to be.</p><p><strong>GPT-5 is state of the art in most respects. </strong>GPT-5 isn&#8217;t a single model like GPTs 1 through 4. It is a system of two models: a base model that answers questions quickly and is better at tasks like creative writing (an improved version of 4o), and a reasoning model that can answer questions step-by-step and is better at tasks like coding or mathematics (think o3). GPT-5 uses one model or the other based on a user&#8217;s prompt.</p><p>These two models combine to form a broadly capable system. For example, GPT-5 achieves state-of-the-art performance on <a href="https://scale.com/leaderboard/humanitys_last_exam">Humanity&#8217;s Last Exam</a>, the software engineering benchmark SWE-bench Verified, and holds the top spot on LMArena&#8217;s <a href="https://lmarena.ai/leaderboard/text">text leaderboard</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZEcb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZEcb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 424w, https://substackcdn.com/image/fetch/$s_!ZEcb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 848w, https://substackcdn.com/image/fetch/$s_!ZEcb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 1272w, https://substackcdn.com/image/fetch/$s_!ZEcb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZEcb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png" width="632" height="876" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:876,&quot;width&quot;:632,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZEcb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 424w, https://substackcdn.com/image/fetch/$s_!ZEcb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 848w, https://substackcdn.com/image/fetch/$s_!ZEcb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 1272w, https://substackcdn.com/image/fetch/$s_!ZEcb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6db7a75-0090-42ca-8439-c67d5cde44c0_632x876.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>GPT-5 hallucinates less than previous OpenAI models.</strong> GPT-5 also has a markedly lower hallucination rate than previous models as evaluated both on open-source prompts and on real, de-identified ChatGPT traffic.</p><p>Lower hallucination rates help GPT-5 perform better in healthcare applications. GPT-5 achieves state-of-the-art performance on OpenAI&#8217;s <a href="https://openai.com/index/healthbench/">Healthbench</a>. For example, OpenAI finds that GPT-5 (thinking) hallucinates 1.6% of the time during challenging healthcare conversations, improving significantly on o3&#8217;s 12.9% hallucination rate.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VOUF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VOUF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 424w, https://substackcdn.com/image/fetch/$s_!VOUF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 848w, https://substackcdn.com/image/fetch/$s_!VOUF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 1272w, https://substackcdn.com/image/fetch/$s_!VOUF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VOUF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png" width="744" height="892" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:892,&quot;width&quot;:744,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VOUF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 424w, https://substackcdn.com/image/fetch/$s_!VOUF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 848w, https://substackcdn.com/image/fetch/$s_!VOUF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 1272w, https://substackcdn.com/image/fetch/$s_!VOUF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa89976d9-abc7-44d4-9d7b-592dada46bc7_744x892.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>GPT-5 is a state-of-the-art text agent. </strong>GPT-5 leads on<strong> </strong><a href="https://www.textquests.ai/">a new benchmark</a> that measures how well AI systems perform in interactive long text-based games, which are examples of challenging exploratory environments. No AI systems can beat the games without clues, and none are as capable as humans&#8212;but GPT-5 does the best of models tested.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dA-q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dA-q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 424w, https://substackcdn.com/image/fetch/$s_!dA-q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 848w, https://substackcdn.com/image/fetch/$s_!dA-q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 1272w, https://substackcdn.com/image/fetch/$s_!dA-q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dA-q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png" width="1456" height="817" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:817,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dA-q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 424w, https://substackcdn.com/image/fetch/$s_!dA-q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 848w, https://substackcdn.com/image/fetch/$s_!dA-q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 1272w, https://substackcdn.com/image/fetch/$s_!dA-q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b4694cd-18b8-48e2-9b33-344f9f6604cd_1600x898.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>GPT-5 is best understood as a consolidation of features developed since GPT-4.</strong> GPT-5 is not a state-of-the-art model across the board. For example, it takes second to xAI&#8217;s Grok 4 on the abstract pattern recognition benchmarks <a href="https://arcprize.org/leaderboard">ARC-AGI-1 and 2</a>. GPT-5 also <a href="https://x.com/eli_lifland/status/1953507434238288230">doesn&#8217;t improve over o3</a> on several coding benchmarks, even though it does on SWE-bench Verified.</p><p>Similarly, the base model GPT-5 uses is an updated version of 4o&#8212;which is cheap enough for OpenAI to roll out GPT-5 to its now <a href="https://www.cnbc.com/2025/08/04/openai-chatgpt-700-million-users.html">700 million active weekly users</a>&#8212;instead of <a href="https://openai.com/index/gpt-4-1/">GPT-4.1</a>. That means GPT-5 misses out on some of GPT-4.1&#8217;s context window improvements over 4o.</p><p>For those expecting another GPT-3 to GPT-4 improvement in capabilities, GPT-5 <a href="https://manifold.markets/Thomas42/how-will-gpt-5-exceeds-expectations">underperformed</a>. But that wasn&#8217;t a realistic expectation&#8212;OpenAI has continually rolled out new models and features since GPT-4 in response to competition from other AI companies. GPT-5 is better understood as a consolidation of the improvements OpenAI has developed since GPT-4, and which GPT-4 didn&#8217;t have. These include:</p><ul><li><p><strong>Search and tool use</strong>: GPT-5 has access to search, meaning that its knowledge isn&#8217;t limited to what it can memorize during pretraining. It also has access to deep research, agent integrations, and can run code.</p></li><li><p><strong>Thinking</strong>: GPT-4 was released before OpenAI started using reinforcement learning for thinking, and performed far below expert levels on math, coding, and science tasks. GPT-5 (thinking) performs at a PhD level on similar tasks.</p></li><li><p><strong>Image recognition and generation</strong>: GPT-5 integrates OpenAI&#8217;s visual systems, meaning that it can understand and generate visual inputs and outputs.</p></li><li><p><strong>Context length</strong>: GPT-4&#8217;s context window was about eight thousand tokens&#8212;about the size of a short research paper. GPT&#8217;s context window is 256 thousand tokens&#8212;about 2-3 full-length novels.</p></li></ul><p>While GPT-5 isn&#8217;t a step-change improvement over its competitors&#8212;or even recent OpenAI models like 4o and the o series&#8212;the better point of comparison is with what GPT-4 could do when it was released in 2023. In that comparison, GPT-5 <em>does </em>look like a step-change improvement.</p><p><strong>What would GPT-5 have needed to feel like a discontinuous improvement?</strong> ChatGPT still lacks sufficient agency to be broadly economically useful. Thinking likely isn&#8217;t enough for agency&#8212;for example, to reliably use computers, AI agents may need improved visual reasoning and the ability to store lessons from tasks into a long-term memory.</p><p>By default, however, we should expect these and other improvements to be deployed continually&#8212;not in big jumps every two years.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>President Trump has <a href="https://www.usatoday.com/story/news/politics/2025/08/06/donald-trump%E2%80%91100%E2%80%91percent%E2%80%91tariffs%E2%80%91chips%E2%80%91semiconductors/85551845007/">announced</a> a proposal to impose a 100% tariff on imported semiconductors, aiming to boost domestic production. The proposal would exempt firms with facilities in the US, such as TSMC.</p></li><li><p>OSTP Director Michael Kratsios <a href="https://www.csis.org/analysis/unpacking-white-house-ai-action-plan-ostp-director-michael-kratsios">discussed</a> the White House&#8217;s AI Action Plan at an event with CSIS, outlining strategic goals and implementation frameworks.</p></li><li><p>Illinois Governor Pritzker <a href="https://idfpr.illinois.gov/news/2025/gov-pritzker-signs-state-leg-prohibiting-ai-therapy-in-il.html">signed</a> an act forbidding AI&#8209;based therapy or psychotherapy in Illinois.</p></li><li><p>Governor DeSantis <a href="https://www.orlandoweekly.com/news/desantis-wants-to-roll-out-policies-on-ai-which-he-calls-societys-biggest-issue-40055310">said</a> Florida is preparing to implement proactive AI policy in the coming months.</p></li><li><p>U.S. authorities <a href="https://www.reuters.com/business/autos-transportation/two-chinese-nationals-california-accused-illegally-shipping-nvidia-ai-chips-2025-08-05/">charged</a> two Chinese nationals in California with illegally shipping tens of millions of dollars&#8217; worth of Nvidia H100 AI chips to China without export licenses.</p></li><li><p>President Trump indicated he might <a href="https://www.reuters.com/world/china/nvidia-amd-pay-15-china-chip-sale-revenues-us-official-says-2025-08-10/">approve</a> selling a downgraded version of Nvidia&#8217;s next&#8209;gen Blackwell chip to China, along with a deal requiring Nvidia and AMD to give the U.S. government 15% of related revenues.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>OpenAI&#8217;s IMO-gold-winning model <a href="https://x.com/SherylHsu02/status/1954966109851119921">also got gold</a> in the International Olympiad in Informatics, one of the world&#8217;s top coding competitions.</p></li><li><p>OpenAI <a href="https://openai.com/index/introducing-gpt-oss/">released</a> two open&#8209;weight models.</p></li><li><p>OpenAI is <a href="https://openai.com/index/how-we%27re-optimizing-chatgpt/">adding</a> mental health features to ChatGPT, including break reminders and detecting signs of dependency.</p></li><li><p>Anthropic <a href="https://www.anthropic.com/news/claude-opus-4-1">released</a> Claude Opus 4.1.</p></li><li><p>DeepMind <a href="https://deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/">introduced</a> &#8220;Genie&#8239;3,&#8221; a new frontier world model.</p></li><li><p>Nvidia has <a href="https://www.cnbc.com/2025/08/10/nvidia-china-h20-chips.html">started to ship</a> its H20 AI chips to China after obtaining U.S. approval, despite security concerns voiced by Chinese state media.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Researchers <a href="https://labs.zenity.io/p/agentflayer-chatgpt-connectors-0click-attack-5b41">discovered</a> a zero-click exploit to exfiltrate data from ChatGPT agent connectors like Google Drive.</p></li><li><p>Axios <a href="https://www.axios.com/2025/08/06/trump-truth-social-perplexity">reports</a> that Truth Social&#8217;s AI search tool, powered by Perplexity, restricts sources to pro&#8209;Trump media, unlike the broader range shown on the public version.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-61-openai-releases?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-61-openai-releases?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #60: The AI Action Plan]]></title><description><![CDATA[Plus: ChatGPT Agent and IMO Gold]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-60-the-ai-action</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-60-the-ai-action</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Thu, 31 Jul 2025 17:43:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!yeVV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: The Trump Administration publishes its AI Action Plan; OpenAI released ChatGPT Agent and announced that an experimental model achieved gold medal-level performance on the 2025 International Mathematical Olympiad.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>The AI Action Plan</h1><p>On the 23rd, the White House <a href="https://www.whitehouse.gov/articles/2025/07/white-house-unveils-americas-ai-action-plan/">released</a> its <a href="https://www.whitehouse.gov/wp-content/uploads/2025/07/Americas-AI-Action-Plan.pdf">AI Action Plan</a>. The document is the outcome of a January <a href="https://www.federalregister.gov/documents/2025/01/31/2025-02172/removing-barriers-to-american-leadership-in-artificial-intelligence">executive order</a> that required the President&#8217;s Science Advisor, &#8216;AI and Crypto Czar&#8217;, and National Security Advisor (currently Michael Kratsios, David Sacks, and Marco Rubio) to submit a plan to &#8220;sustain and enhance America's global AI dominance in order to promote human flourishing, economic competitiveness, and national security.&#8221; President Trump also delivered an <a href="https://www.pbs.org/newshour/politics/watch-live-trump-reveals-ai-action-plan-shaped-by-his-tech-supporters-after-revoking-biden-policy">hour-long speech</a> on the plan, and signed <a href="https://www.federalregister.gov/documents/2025/07/28/2025-14218/promoting-the-export-of-the-american-ai-technology-stack">three</a> <a href="https://www.whitehouse.gov/presidential-actions/2025/07/accelerating-federal-permitting-of-data-center-infrastructure/">executive</a> <a href="https://www.federalregister.gov/documents/2025/07/28/2025-14217/preventing-woke-ai-in-the-federal-government">orders</a> beginning to implement some of its policies.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yeVV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yeVV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 424w, https://substackcdn.com/image/fetch/$s_!yeVV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 848w, https://substackcdn.com/image/fetch/$s_!yeVV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 1272w, https://substackcdn.com/image/fetch/$s_!yeVV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yeVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png" width="1400" height="933" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:933,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yeVV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 424w, https://substackcdn.com/image/fetch/$s_!yeVV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 848w, https://substackcdn.com/image/fetch/$s_!yeVV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 1272w, https://substackcdn.com/image/fetch/$s_!yeVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Trump displaying an executive order at the &#8220;Winning the AI Race&#8221; summit. <a href="https://www.wsj.com/articles/federal-ai-plan-targets-burdensome-state-regulations-b6dff028">Source</a>.</figcaption></figure></div><p>The AI Action Plan lists several dozen policies across three pillars&#8212;accelerating innovation, building American AI infrastructure, and leading in international diplomacy and security&#8212;that will guide the Trump Administration&#8217;s approach to AI.</p><p>The central policy agenda outlined is to accelerate US AI development and deployment. For example, it proposes streamlining permitting for AI infrastructure (such as semiconductor manufacturing facilities, data centers, and energy infrastructure) adopting AI in the federal government and military, and funding AI research. But there&#8217;s a lot more in the plan, too: both a surprisingly strong focus on AI safety, as well as some items of concern.</p><p><strong>The Plan includes several policies that advance AI safety. </strong>While most of the plan&#8217;s policies are intended to accelerate AI development and deployment, it also correctly observes that AI development will only benefit Americans if done safely. Accordingly, it proposes several policies that advance AI safety. Some of the policies most relevant to AI safety include:</p><ul><li><p>Invest in AI Interpretability, Control, and Robustness Breakthroughs</p></li><li><p>Build an AI Evaluations Ecosystem</p></li><li><p>Bolster Critical Infrastructure Cybersecurity</p></li><li><p>Promote Secure-By-Design AI Technologies and Applications</p></li><li><p>Promote Mature Federal Capacity for AI Incident Response</p></li><li><p>Strengthen AI Compute Export Control Enforcement (this proposes location verification for AI chips)</p></li><li><p>Ensure that the U.S. Government is at the Forefront of Evaluating National Security Risks in Frontier Models</p></li><li><p>Invest in Biosecurity</p></li></ul><p>While<strong> </strong>not comprehensive, these policies are a great step in the right direction&#8212;and much better than might have been expected given the administration&#8217;s previous <a href="https://www.presidency.ucsb.edu/documents/remarks-the-vice-president-the-artificial-intelligence-action-summit-paris-france">rhetorical disregard</a> for AI safety.</p><p>Overall,<strong> </strong>the plan introduces sensible policies that reflect the expertise of those who developed it. However, it is also shaped by the larger policy agenda of the Trump Administration, which may conflict with AI safety goals. We discuss some areas of potential concern below.</p><p><strong>The plan does not want state AI legislation. </strong>One section proposes that the Federal government &#8220;should not allow AI-related Federal funding to be directed toward states with burdensome AI regulations that waste these funds, but should also not interfere with states&#8217; rights to pass prudent laws that are not unduly restrictive to innovation.&#8221;</p><p>This rule is less strict than Sen. Cruz&#8217;s failed AI regulation moratorium. But what constitutes a &#8220;burdensome&#8221; regulation will vary depending on who you ask (particularly if you ask frontier AI companies). In response to the plan, both Congressional Democrats and Rep. Marjorie Taylor Greene <a href="https://beyer.house.gov/news/documentsingle.aspx?DocumentID=8605">expressed</a> <a href="https://x.com/repmtg/status/1948400163875152237">concern</a> about stifling state AI regulation.</p><p><strong>The plan has a partisan view on what constitutes ideological bias.</strong> In a section on ensuring that AI &#8220;objectively reflects truth,&#8221; one policy instructs NIST to &#8220;revise the NIST AI Risk Management Framework to eliminate references to misinformation, Diversity, Equity, and Inclusion, and climate change.&#8221;</p><p>A policy to promote objectivity in AI models could be great. However, that policy could itself be weaponized to promote ideological ends. In their <a href="https://beyer.house.gov/news/documentsingle.aspx?DocumentID=8605">response</a>, Congressional Democrats wrote that &#8220;we support true AI neutrality&#8212;AI models trained on facts and science&#8212;but the administration's fixation on &#8216;anti-woke&#8217; inputs is definitionally not neutral.&#8221;</p><p><strong>The plan endorses open-weight models.</strong> It writes that, &#8220;while the decision of whether and how to release an open or closed model is fundamentally up to the developer, the Federal government should create a supportive environment for open models.&#8221;</p><p>Encouraging US companies to release open-weight models with dangerous capabilities would be a bad policy. But the specific policies the plan lists stop short of that&#8212;they mostly just provide resources to academic researchers (who are unlikely to develop frontier models) through the National AI Research Resource (NAIRR).</p><p><strong>The plan forgoes AI nonproliferation. </strong>The plan argues that the US &#8220;must meet global demand for AI by exporting its full AI technology stack&#8212;hardware, models, software, applications, and standards&#8212;to all countries willing to join America&#8217;s AI alliance.&#8221;</p><p>This plan&#8217;s rationale for this policy is that countries might otherwise look to acquire Chinese AI exports. However, it also continues the Trump Administration's reversal of the Biden-era policy (see the <a href="https://www.federalregister.gov/documents/2025/01/15/2025-00636/framework-for-artificial-intelligence-diffusion">AI Diffusion Framework</a>) that sought to prevent the proliferation of dangerous AI capabilities abroad. While exporting American AI might strengthen the US&#8217; position in the AI race, it also threatens to proliferate dangerous AI capabilities to malicious actors if the US does not ensure that other states implement robust security standards.</p><p><strong>The plan advances a zero-sum race narrative. </strong>Kratsios, Sacks, and Rubio write that the promise of AI &#8220;is ours to seize, or to lose.&#8221; That is, they assume that the alternative to &#8220;AI dominance&#8221; is to give up AI&#8217;s benefits.</p><p>This argument is misleading&#8212;or at least underdeveloped. While there are reasons to support a US lead in AI, AI progress has the potential to benefit Americans whether or not the US &#8220;dominates&#8221; international AI development. Historically, general purpose technologies like AI <a href="https://press.princeton.edu/books/paperback/9780691260341/technology-and-the-rise-of-great-powers">diffuse</a> across national boundaries. For example, technologies electricity and the internet have benefited people around the world, and not just within the nations that led their development.</p><p>The real motivation behind the AI race narrative in Washington is not seizing AI&#8217;s benefits, but rather competition over the balance of international power between the US and China. While there are reasons to be concerned about AI development dominated by China, racing towards US dominance is not the only alternative&#8212;and <a href="https://ai-frontiers.org/articles/why-racing-to-artificial-superintelligence-would-undermine-americas-national-security">creates</a> its own risks. In order to preserve international security, the US will need to <a href="https://www.nationalsecurity.ai/chapter/deterrence-with-mutual-assured-ai-malfunction-maim">proactively manage</a>&#8212;rather than just accelerate&#8212;a US-China AI race.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>ChatGPT Agent and IMO Gold</h1><p>On Thursday, OpenAI <a href="https://openai.com/index/introducing-chatgpt-agent/">released</a> a new agent mode for ChatGPT, which integrates Operator, Deep Research, and chatbot functionality into a unified system.</p><p>The system, &#8216;ChatGPT agent,&#8217; has access to its own virtual computer, and OpenAI highlights that it can book flights and reservations, create slides and spreadsheets, and make online purchases. It can also connect to users&#8217; personal accounts, for example, Google Calendar, Gmail, and GitHub.</p><p><strong>ChatGPT agent achieves SOTA performance on HLE and FrontierMath.</strong> ChatGPT agent&#8217;s capabilities extend beyond basic online automation&#8212;it achieves SOTA performance on several benchmarks measuring expert-level knowledge and reasoning. For example, ChatGPT agent gets 23% on Humanity&#8217;s Last Exam (HLE), when it does not use tools. When it uses tools like browsers and computer code, it gets 41.6%. This is similar to Grok 4, which gets 25.4% on HLE without tools and 44.4% with tools.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YR3_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YR3_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 424w, https://substackcdn.com/image/fetch/$s_!YR3_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 848w, https://substackcdn.com/image/fetch/$s_!YR3_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 1272w, https://substackcdn.com/image/fetch/$s_!YR3_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YR3_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png" width="884" height="802" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:802,&quot;width&quot;:884,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YR3_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 424w, https://substackcdn.com/image/fetch/$s_!YR3_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 848w, https://substackcdn.com/image/fetch/$s_!YR3_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 1272w, https://substackcdn.com/image/fetch/$s_!YR3_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>OpenAI<strong> </strong>also evaluated<strong> </strong>ChatGPT agent against benchmarks measuring real-world task completion. OpenAI reports it performs better than humans nearly 50% of the time on an internal benchmark capturing diverse economically important tasks&#8212;an incredible claim that has yet to be reproduced. OpenAI also reports it surpasses human performance on data science tasks, and achieves state of the art results (though less than human) on tasks involving spreadsheets and web browsing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_NBd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_NBd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 424w, https://substackcdn.com/image/fetch/$s_!_NBd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 848w, https://substackcdn.com/image/fetch/$s_!_NBd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 1272w, https://substackcdn.com/image/fetch/$s_!_NBd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_NBd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png" width="1446" height="852" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:852,&quot;width&quot;:1446,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_NBd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 424w, https://substackcdn.com/image/fetch/$s_!_NBd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 848w, https://substackcdn.com/image/fetch/$s_!_NBd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 1272w, https://substackcdn.com/image/fetch/$s_!_NBd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Greater autonomy introduces new risks.</strong> OpenAI published a <a href="https://openai.com/index/chatgpt-agent-system-card/">system card</a> detailing ChatGPT agent&#8217;s risks. ChatGPT agent has access to user data and can take actions on the web, meaning that mistakes are higher stakes. OpenAI also highlighted the risk of adversarial manipulation through prompt injection, in which malicious websites could try to manipulate ChatGPT&#8217;s behavior, such as to reveal personal information about the user.</p><p><strong>ChatGPT agent poses &#8216;high&#8217; biological and chemical risk.</strong> ChatGPT agent is also the first system that OpenAI is treating as posing &#8216;high&#8217; biological and chemical risk. According to the company&#8217;s <a href="https://openai.com/index/updating-our-preparedness-framework/">Preparedness Framework</a>, that means the system could provide meaningful assistance to non-experts in creating known biological or chemical threats.</p><p>OpenAI says it&#8217;s activated several safeguards against these risks, including &#8220;comprehensive threat modeling, dual-use refusal training, always-on classifiers and reasoning monitors, and clear enforcement pipelines.&#8221; It also launched a <a href="https://openai.com/bio-bug-bounty/">bug bounty program</a> for researchers to red team these safeguards.</p><p><strong>OpenAI and Google DeepMind claim gold medal-level performance on the 2025 IMO. </strong>On Friday, OpenAI also <a href="https://x.com/alexwei_/status/1946477742855532918">announced</a> that an experimental model had achieved gold medal-level performance on the 2025 International Mathematical Olympiad (IMO), solving five out of six questions. (A few human competitors <a href="https://www.imo-official.org/year_individual_r.aspx?year=2025">scored</a> a perfect six out of six).</p><p>Gold medal-level performance on the IMO has been a major goal in AI research for years, but only recently has seemed within reach. Last year, Google&#8217;s AlphaProof and AlphaGeometry 2 <a href="https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/">achieved</a> silver medal-level performance on the 2024 IMO, making gold-level performance this year plausible. On Monday, Google <a href="https://deepmind.google/discover/blog/advanced-version-of-gemini-with-deep-think-officially-achieves-gold-medal-standard-at-the-international-mathematical-olympiad/">announced</a> that its own reasoning LLM had achieved gold medal-level performance on the 2025 IMO, also solving five out of six questions.</p><p><strong>OpenAI and Google used general reasoning LLMs.</strong> Where the capabilities of Google&#8217;s AlphaProof and AlphaGeometry 2 systems were narrowly focused on IMO-style math questions, OpenAI&#8217;s model is <a href="https://x.com/polynoamial/status/1946478250974200272">apparently</a> not IMO-specific (or even math-specific), but instead a general reasoning LLM allowed to think for hours at a time. OpenAI published the model&#8217;s answers on the 2025 IMO <a href="https://github.com/aw31/openai-imo-2025-proofs/">here</a>. Similarly, Google&#8217;s gold-winning performance used an advanced version of Gemini Deep Think&#8212;a general reasoning model that uses natural language.</p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>According to Nvidia's CEO, the US <a href="https://apnews.com/article/nvidia-china-ai-chips-h20-trump-91588c36559bc881b8e010a9ed95cf0a">approved</a> the sale of Nvidia's H20 chips to China. Reuters reported that Nvidia <a href="https://www.reuters.com/world/china/nvidia-orders-300000-h20-chips-tsmc-due-robust-china-demand-sources-say-2025-07-29/">ordered</a> 300,000 H20s from TSMC to meet expected Chinese demand.</p></li><li><p>The Pentagon&#8217;s <a href="https://comptroller.defense.gov/Portals/45/Documents/defbudget/FY2026/FY2026_Budget_Request.pdf">FY2026 budget request</a> called for $13.4 billion for autonomous systems.</p></li><li><p>The Pentagon also <a href="https://www.nextgov.com/acquisition/2025/07/pentagon-awards-multiple-companies-200m-contracts-ai-tools/406698/">awarded</a> Anthropic, Google, OpenAI and xAI each $200 million contracts to develop AI for national security applications.</p></li><li><p>China <a href="https://www.reuters.com/world/china/china-proposes-new-global-ai-cooperation-organisation-2025-07-26/">announced</a> plans for an international AI governance organization.</p></li><li><p>The UK government <a href="https://www.gov.uk/government/news/ai-security-institute-launches-international-coalition-to-safeguard-ai-development">launched</a> a &#163;15m million-funded alignment research project.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>Meta has <a href="https://techcrunch.com/2025/07/18/meta-refuses-to-sign-eus-ai-code-of-practice/">refused</a> to sign the EU&#8217;s GPAI Code of Practice.</p></li><li><p>Anthropic announced it <a href="https://www.anthropic.com/news/eu-code-practice">will</a> join OpenAI, Mistral and (likely) Microsoft in signing the Code of Practice.</p></li><li><p>At a summit in Pennsylvania, President Trump <a href="https://www.nytimes.com/2025/07/15/us/politics/trump-ai-pittsburgh-speech.html">announced</a> more than $90 billion in private AI infrastructure investment in the state, which is led by Blackstone and Google.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Isobel Moure, Tim O'Reilly and Ilan Strauss <a href="https://ai-frontiers.org/articles/open-protocols-prevent-ai-monopolies">argue</a> that open protocols can prevent AI monopolies.</p></li><li><p>Dane A. Morey, Mike Rayo, and David Woods <a href="https://ai-frontiers.org/articles/how-ai-can-degrade-human-performance-in-high-stakes-settings">discuss</a> how AI can degrade human performance in high-stakes settings.</p></li><li><p>Anton Leicht <a href="https://ai-frontiers.org/articles/in-the-race-for-ai-supremacy-can-countries-stay-neutral">analyzes</a> whether, in the race for AI supremacy, countries can stay neutral.</p></li><li><p>A <a href="https://report2025.seismic.org/">report</a> from the Seismic Foundation found that people believe AI will make their lives worse, but ranks the issue low on their list of social priorities.</p></li><li><p>A YouGov <a href="https://d3nkl3psvxxpe9.cloudfront.net/documents/Trump_Issue_Handling_poll_results.pdf">poll</a> found a -14 approval rating of Trump&#8217;s handling of AI.</p></li><li><p>A <a href="https://www.commonsensemedia.org/press-releases/nearly-3-in-4-teens-have-used-ai-companions-new-national-survey-finds">report</a> from Common Sense Media found that 3 in 4 teens have used AI companions.</p></li><li><p>Rand published a <a href="https://www.rand.org/pubs/working_papers/WRA4077-1.html">report</a> on verifying international AI agreements.</p></li><li><p>CAIS is hiring a software engineer. Apply <a href="https://jobs.lever.co/aisafety/24e7c67e-8a7e-401c-b87f-6d664ee51726">here</a>.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-60-the-ai-action?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-60-the-ai-action?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #59: EU Publishes General-Purpose AI Code of Practice]]></title><description><![CDATA[Plus: Meta Superintelligence Labs]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-59-eu-publishes</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-59-eu-publishes</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 15 Jul 2025 18:04:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!glEy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the<a href="https://www.safe.ai/"> Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: The EU published a General-Purpose AI Code of Practice for AI providers, and Meta is spending billions revamping its superintelligence development efforts.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>EU Publishes General-Purpose AI Code of Practice</h1><p>In June 2024, the EU adopted the <a href="https://eur-lex.europa.eu/eli/reg/2024/1689/oj/eng">AI Act</a>, which remains the world&#8217;s most significant law regulating AI systems. The Act bans some uses of AI like social scoring and predictive policing and limits other &#8220;high risk&#8221; uses such as generating credit scores or evaluating educational outcomes. It also regulates general-purpose AI (GPAI) systems, imposing transparency requirements, copyright protection policies, and safety and security standards for models that pose systemic risk (defined as those trained using &#8805;10<sup>25</sup> FLOPs).</p><p>However, these safety and security standards are ambiguous&#8212;for example, the Act requires providers of GPAIs to &#8220;assess and mitigate possible systemic risks,&#8221; but does not specify how to do so. This ambiguity may leave GPAI developers uncertain whether they are complying with the AI Act, and regulators uncertain whether GPAI developers are implementing adequate safety and security practices.</p><p>To address this problem, on July 10th 2025, the EU published the <a href="https://digital-strategy.ec.europa.eu/en/policies/contents-code-gpai">General-Purpose AI Code of Practice</a>. The Code is a voluntary set of guidelines to comply with the AI Act&#8217;s GPAI obligations before they take effect on August 2nd, 2025.</p><p><strong>The Code of Practice establishes safety and security requirements for GPAI providers.</strong> The Code consists of three chapters&#8212;Transparency, Copyright, and Safety and Security. The last chapter, Safety and Security, only applies to the handful of companies whose models cross the Act&#8217;s systemic-risk threshold.</p><p>The Safety and Security chapter requires GPAI providers to create frameworks outlining how they will identify and mitigate risks throughout a model's lifecycle. These frameworks must follow a structured approach to risk assessment&#8212;for each major decision (such as new model releases), providers must follow the following three steps:</p><ul><li><p><strong>Identification</strong>. Companies must identify potential systemic risks. Four categories of systemic risks require special attention: CBRN (chemical, biological, radiological, nuclear) risks, loss of control, cyber offense capabilities, and harmful manipulation.</p></li><li><p><strong>Analysis</strong>. Each risk must be analyzed&#8212;for example, by using model evaluations. When the risk is greater than those posed by models already on the EU market, providers may be required to involve third-party evaluators.</p></li><li><p><strong>Determination</strong>. Companies must determine whether the risks they identified are acceptable before proceeding. If not, they must implement safety and security mitigations.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!glEy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!glEy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 424w, https://substackcdn.com/image/fetch/$s_!glEy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 848w, https://substackcdn.com/image/fetch/$s_!glEy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 1272w, https://substackcdn.com/image/fetch/$s_!glEy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!glEy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png" width="1360" height="966" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:966,&quot;width&quot;:1360,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!glEy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 424w, https://substackcdn.com/image/fetch/$s_!glEy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 848w, https://substackcdn.com/image/fetch/$s_!glEy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 1272w, https://substackcdn.com/image/fetch/$s_!glEy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Continuous monitoring, incident reporting timelines, and future-proofing</strong>. The Code requires continuous monitoring after models are deployed, and strict incident reporting timelines. For serious incidents, companies must file initial reports within days. It also acknowledges that current safety methods may prove insufficient as AI advances. Companies can implement alternative approaches if they demonstrate equal or superior safety outcomes.</p><p><strong>AI providers will likely comply with the Code</strong>. While the Code is technically voluntary, compliance with the EU AI Act is not. Providers are incentivized to reduce their legal uncertainty by complying with the Code, since EU regulators will assume that providers who comply with the Code are also Act-compliant. <a href="https://openai.com/global-affairs/eu-code-of-practice/">OpenAI</a> and <a href="https://www.linkedin.com/posts/oc%C3%A9ane-herrero-b61bb9124_frances-mistral-will-sign-new-eu-ai-code-activity-7349130295532539904-UOh7/">Mistral</a> have already indicated they intend to comply with the Code.</p><p>The Code formalizes some existing industry practices advocated for by parts of the AI safety community, such as publishing <a href="https://metr.org/faisc">safety frameworks</a> (or: responsible scaling policies) and system cards. Since frontier AI companies are very likely to comply with the Code, securing similar legislation in the US may no longer be a priority for AI safety.</p><h1>Meta Superintelligence Labs</h1><p>Meta <a href="https://apnews.com/article/meta-ai-superintelligence-agi-scale-alexandr-wang-4b55aabf7ea018e38ffdccb66e37cf26">spent</a> $14.3 billion for a 49 percent stake in Scale AI, starting &#8220;<a href="https://www.bloomberg.com/news/articles/2025-06-30/zuckerberg-announces-meta-superintelligence-effort-more-hires">Meta Superintelligence Labs</a>.&#8221;<strong> </strong>The deal folds every AI group at Meta into one division and puts Scale founder Alexandr Wang&#8212;now chief AI officer&#8212;to lead Meta&#8217;s superintelligence development efforts.</p><p><strong>Meta makes nine-figure pay offers to poach top AI talent. </strong>Reuters reported that Meta has offered &#8220;up to $100 million&#8221; to OpenAI staff, a tactic CEO Sam Altman <a href="https://www.wired.com/story/sam-altman-meta-ai-talent-poaching-spree-leaked-messages/">criticized</a>. SemiAnalysis <a href="https://semianalysis.com/2025/07/11/meta-superintelligence-leadership-compute-talent-and-data/">estimates</a> Meta is offering typical leadership packages of around $200 million over four years. For example, Bloomberg <a href="https://www.bloomberg.com/news/articles/2025-07-09/meta-poached-apple-s-pang-with-pay-package-over-200-million">reports</a> that Apple&#8217;s foundation-models chief Ruoming Pang left for Meta after a package &#8220;well north of $200 million.&#8221; Other early recruits span OpenAI, DeepMind, and Anthropic.</p><p><strong>Meta has created a resourced competitor in the superintelligence race. </strong>In response to <a href="https://www.reuters.com/business/zuckerbergs-meta-superintelligence-labs-poaches-top-ai-talent-silicon-valley-2025-07-08/">Meta&#8217;s hiring efforts</a>,<strong> </strong>OpenAI, Google, and Anthropic have already raised pay bands, and smaller labs might be priced out of frontier work.</p><p>Meta is also raising its compute expenditures. It <a href="https://www.datacenterdynamics.com/en/news/meta-raises-ai-data-center-capex-forecast-to-up-to-72bn-blames-trump-tariffs-for-increased-cost/">lifted</a> its 2025 capital-expenditure forecast to $72 billin, and SemiAnalysis <a href="https://semianalysis.com/2025/07/11/meta-superintelligence-leadership-compute-talent-and-data/">describes</a> new, temporary &#8220;tent&#8221; campuses that can house one-gigawatt GPU clusters.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>California Senator Scott Wiener expanded <a href="https://legiscan.com/CA/text/SB53/2025">SB 53</a>, his AI safety bill, to include <a href="https://sd11.senate.ca.gov/news/senator-wiener-expands-ai-bill-landmark-transparency-measure-based-recommendations-governors">new transparency measures</a>.</p></li><li><p>The Commerce Department <a href="https://www.commerce.gov/sites/default/files/2025-06/BIS-FY2026-Congressional-Budget-Submission.pdf">requested</a> additional funding for the Bureau of Industry and Security (BIS) to enhance its enforcement of export controls.</p></li><li><p>Missouri&#8217;s Attorney General is <a href="https://www.theverge.com/news/704851/missouri-ag-andrew-bailey-investigation-ai-chatbots-trump-ranking">investigating</a> AI chatbots for alleged political bias against Donald Trump.</p></li><li><p>The BRICS nations (an international group founded by Brasil, Russia, India, China, and South Africa that serves as a forum for political coordination for the Global South) signed a <a href="https://brics.br/en/news/brics-summit-signs-historic-commitment-in-rio-for-more-inclusive-and-sustainable-governance">commitment</a> that included language on mitigating AI risks.</p></li><li><p>Bernie Sanders expressed concern about loss of control risks in an <a href="https://gizmodo.com/bernie-sanders-reveals-the-ai-doomsday-scenario-that-worries-top-experts-2000628611">interview</a> with Gizmodo.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>Last week, Grok was <a href="https://www.forbes.com/sites/tylerroush/2025/07/09/elon-musk-claims-grok-manipulated-by-x-users-after-chatbot-praises-hitler/">explicitly antisemetic</a> on X. The behavior came after Grok&#8217;s system prompt was (<a href="https://x.com/grok/status/1943916977481036128">perhaps unintentionally</a>) updated, among other changes telling Grok not to be &#8220;afraid to offend people who are politically correct.&#8221;</p></li><li><p>xAI also released <a href="https://x.ai/news/grok-4">Grok 4</a>, which achieves state-of-the-art scores on benchmarks including Humanity&#8217;s Last Exam and ARC-AGI-2.</p></li><li><p>OpenAI <a href="https://www.politico.com/news/2025/07/10/openai-accuses-nonprofit-elon-musk-lobbying-violations-00448226">accused</a> the Coalition for AI Nonprofit Integrity of lobbying violations amid an ongoing legal dispute with Elon Musk.</p></li><li><p>Anthropic <a href="https://www.anthropic.com/news/the-need-for-transparency-in-frontier-ai">published</a> a blog post on the need for transparency in frontier AI development.</p></li><li><p>OpenAI is set to <a href="https://www.theverge.com/notepad-microsoft-newsletter/702848/openai-open-language-model-o3-mini-notepad">release</a> an open-weight version similar to its o3-mini model.</p></li><li><p>OpenAI&#8217;s deal to acquire Windsurf <a href="https://www.theverge.com/openai/705999/google-windsurf-ceo-openai">failed</a>, and instead Google hired Windsurf&#8217;s CEO to lead its AI products division and Cognition AI <a href="https://www.nytimes.com/2025/07/14/technology/cognition-ai-windsurf.html">acquired</a> the company.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Henry Papadatos <a href="https://ai-frontiers.org/articles/how-the-eus-code-of-practice-advances-ai-safety">discusses</a> how the EU&#8217;s GPAI Code of Practice advances AI safety.</p></li><li><p>Chris Miller <a href="https://ai-frontiers.org/articles/us-chip-export-controls-china-ai">analyzes</a> how US export controls have (and haven&#8217;t) curbed Chinese AI.</p></li><li><p>The University of Oxford&#8217;s AI Governance Initiative <a href="https://aigi.ox.ac.uk/publications/verification-for-international-ai-governance/">published</a> a report on verification for international AI agreements.</p></li><li><p>A METR <a href="https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/">study</a> found that experienced developers work 19% more slowly when using AI tools.</p></li><li><p>CAIS is <a href="https://icml.cc/virtual/2025/49700">hosting</a> an AI Safety Social at ICML.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-59-eu-publishes?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-59-eu-publishes?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #58: Senate Removes State AI Regulation Moratorium]]></title><description><![CDATA[Plus: Judges Split on Whether Training AI on Copyrighted Material is Fair Use]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-58-senate-removes</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-58-senate-removes</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Thu, 03 Jul 2025 16:23:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!3W7Q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: The Senate removes a provision from Republican's &#8220;Big Beautiful Bill&#8221; aimed at restricting states from regulating AI; two federal judges split on whether training AI on copyrighted books in fair use.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>Senate Removes State AI Regulation Moratorium</h1><p>The Senate removed a provision from Republican's &#8220;Big Beautiful Bill&#8221; aimed at restricting states from regulating AI. The moratorium would have prohibited states from receiving federal broadband expansion funds if they regulated AI&#8212;however, it faced procedural and political challenges in the Senate, and was ultimately removed in a vote of 99-1. Here&#8217;s what happened.</p><p><strong>A watered-down moratorium cleared the Byrd Rule. </strong>In an attempt to bypass the Byrd Rule, which prohibits policy provisions in budget bills, the Senate Commerce Committee <a href="https://www.politico.com/live-updates/2025/06/05/congress/senate-commerce-megabill-frees-spectrum-ties-bead-to-ai-moratorium-00391136">revised</a> the original moratorium to be a prerequisite for states to receive federal broadband expansion funds rather than a blanket restriction. On Wednesday,<strong> </strong>Senate<strong> </strong>Parliamentarian Elizabeth MacDonough <a href="https://thehill.com/policy/technology/5374053-ai-regulation-bill-clears-byrd-rule/">judged</a> that the moratorium would only clear the Byrd Rule if it was tied to only the new $500 million in federal broadband expansion funds provided by the reconciliation bill&#8212;not all $42.45 billion previously appropriated.</p><p>This significantly weakened the moratorium&#8212;even if it had been passed, states might have decided that regulating AI was worth foregoing new broadband expansion funds.</p><p><strong>The moratorium moved to a vote in the Senate. </strong>On Saturday,<strong> </strong>the senate<strong> </strong>voted 51-49 to move to general debate on the reconciliation bill, beginning the process of a &#8220;vote-a-rama&#8221; which saw many amendments debated and voted on in rapid succession. Senators Josh Hawley and Maria Cantwell were <a href="https://punchbowl.news/article/tech/ted-cruz-parliamentarian-artificial-intelligence/">expected</a> to bring an amendment to remove the moratorium from the bill.</p><p>Ted Cruz and Sen. Marsha Blackburn&#8212;another critic of the original moratorium&#8212;<a href="https://www.politico.com/live-updates/2025/06/29/congress/blackburn-cruz-find-potential-truce-on-state-ai-moratorium-child-safety-00432296">were set</a> to pitch a compromise <a href="https://www.blackburn.senate.gov/services/files/178AE7B5-7583-415E-8CF3-475241C6E5F9">draft</a> that shortened the moratorium from ten to five years and exempt state legislation establishing internet protections. However, on Tuesday, Blackburn abandoned that compromise after Steve Bannon and others <a href="https://www.wsj.com/politics/policy/how-a-bold-plan-to-ban-state-ai-laws-fell-apartand-divided-trumpworld-96bce19d?st=rXP9V8&amp;reflink=desktopwebshare_permalink">reportedly</a> reached out to her.</p><p>Instead, she brought an amendment with Sen. Cantwell to remove the moratorium entirely. Lacking enough support, even Cruz voted for the amendment, which <a href="https://apnews.com/article/congress-ai-provision-moratorium-states-20beeeb6967057be5fe64678f72f6ab0">passed</a> 99-1.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3W7Q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3W7Q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3W7Q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3W7Q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3W7Q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3W7Q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3W7Q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3W7Q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3W7Q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3W7Q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0121db23-e6ab-48b8-9f8e-50a6e3705f24_1600x1067.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Sen. Blackburn cosponsored the Kids Online Safety Act last year.</em> <em>(<a href="https://ciosenus.app.box.com/s/s913tsl67j6y2owtvzbnrlpu7x6cjmrk/file/1524677161363">Source</a>.)</em></figcaption></figure></div><p>Even if the moratorium had survived the Senate, it could have faced an uphill battle in the House&#8212;Representatives <a href="https://x.com/RepMTG/status/1930650431253827806?t=rK_HvP4W2eb3qIB-FikMjw">Marjorie Taylor Greene</a> and <a href="https://x.com/RepThomasMassie/status/1930642561124716866">Thomas Massie</a> came out against it, along with other prominent Republicans like Arkansas Governor <a href="https://www.washingtonpost.com/opinions/2025/06/26/state-ai-regulations-ban-obbb/">Sarah Huckabee Sanders</a><em> </em>and <a href="https://www.wsj.com/politics/policy/how-a-bold-plan-to-ban-state-ai-laws-fell-apartand-divided-trumpworld-96bce19d?st=rXP9V8&amp;reflink=desktopwebshare_permalink">Steve Bannon</a><em>.</em></p><h1>Judges Split on Whether Training AI on Copyrighted Material is Fair Use</h1><p>Last<strong> </strong>week, two U.S. district judges decided cases involving Anthropic and Meta on the question of whether training LLMs on copyrighted works qualifies as fair use. While both judges sided with the AI companies, they sharply disagreed about how the Copyright Act should apply to similar cases&#8212;leaving legal precedent on the question ambiguous.</p><p><strong>One judge ruled that training Anthropic&#8217;s Claude on copyrighted books is fair use. </strong>U.S. District Judge William Alsup <a href="https://www.reuters.com/legal/litigation/anthropic-wins-key-ruling-ai-authors-copyright-lawsuit-2025-06-24/">granted a summary judgment</a> that Anthropic using copyrighted books to train LLMs qualifies as fair use. The <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.434709/gov.uscourts.cand.434709.231.0_2.pdf">order</a> held that three out of four of the factors considered when determining whether a given use of a copyrighted work is a fair use favored Anthropic&#8217;s use in training LLMs.</p><ol><li><p><strong>The purpose and character of the use. </strong>The court held that using copyrighted books to train LLMs is highly transformative, favoring fair use.</p></li><li><p><strong>The nature of the copyrighted work. </strong>The books in question were expressive, pointing against fair use.</p></li><li><p><strong>The amount and substantiality of the portion used. </strong>The court held that it was reasonably necessary to use the entirety of books in training LLMs, favoring fair use.</p></li><li><p><strong>The effect of the use upon the potential market for or value of the copyrighted work. </strong>No exact copies or knockoffs resulted from the use of copyrighted books to train Claude, since Anthropic implemented guardrails to prevent Claude from exactly replicating the works on which it was trained. While the use may result in an &#8220;explosion&#8221; of AI-generated writing that competes with the copyrighted books, the court held that such a market effect doesn&#8217;t count under the Copyright Act.</p></li></ol><p><strong>Digitizing print books Anthropic lawfully bought is also protected&#8212;but piracy is not. </strong>Judge<strong> </strong>Alsup drew a sharp line between scanning paperbacks Anthropic had purchased and the millions of volumes it admitted downloading from pirate libraries. Turning a lawfully owned print copy into a PDF is fair use, but pirating books is not. That issue will proceed to trial.</p><p><strong>In a case against Meta, another judge reached the opposite conclusion.</strong> While U.S. District Judge Vince Chhabria <a href="https://apnews.com/article/meta-ai-copyright-lawsuit-sarah-silverman-e77968015b94fbbf38234e3178ede578">sided with Meta</a> in its case, his <a href="https://storage.courtlistener.com/recap/gov.uscourts.cand.415175/gov.uscourts.cand.415175.598.0_1.pdf">order</a> made clear he only did so because he believed the plaintiffs made the wrong arguments and presented the wrong evidence.</p><p>His analysis of whether using copyrighted books to train LLMs is fair use agrees with Judge Alsup&#8217;s on the first three factors&#8212;but sharply disagrees on the relevance of market effects. The upshot, he writes, is that &#8220;in many circumstances it will be illegal to copy copyright-protected works to train generative AI models without permission.&#8221; He sided with Mate only because the plaintiffs failed to provide arguments or evidence showing that Meta&#8217;s LLMs resulted in market harm to their books.</p><p><strong>The judges disagree on whether &#8220;indirect displacement&#8221; is a relevant market effect under the Copyright Act. </strong>Both orders assume that LLMs may now or soon be able to generate many competitors to human-written books, which could harm the market for human-written books.</p><p>Judge Alsup writes that the authors&#8217; complaint about such an effect is &#8220;no different than it would be if they complained that training schoolchildren to write well would result in an explosion of competing works,&#8221; which is &#8220;not the kind of competitive or creative displacement that concerns the Copyright Act.&#8221;</p><p>However, Judge Chhabria responds that &#8220;using books to teach children to write is not remotely like using books to create a product that a single individual could employ to generate countless competing works with a miniscule fraction of the time and creativity it would otherwise take.&#8221; That is, he argues that a similarity in kind does not outweigh a vast difference in magnitude.</p><p><strong>Higher courts will likely settle the dispute.</strong> While Judge Alsup&#8217;s order might have provided precedence for similar cases, Chhabria&#8217;s disagreement leaves precedent ambiguous. However, both decisions fall under the jurisdiction of the Ninth Circuit, which has yet to rule on AI fair use. The authors in Anthropic&#8217;s case, at least, indicated that they will appeal the decision to the Ninth Circuit&#8212;and, ultimately, the issue may be up to the Supreme Court to decide.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><ul><li><p>Michael C. Horowitz and Lauren A. Kahn <a href="https://ai-frontiers.org/articles/nuclear-non-proliferation-is-the-wrong-framework-for-ai-governance">argue</a> that placing AI in a nuclear framework inflates expectations and distracts from practical, sector-specific governance.</p></li><li><p>Laura Gonz&#225;lez Salmer&#243;n <a href="https://ai-frontiers.org/articles/can-copyright-survive-ai">discusses</a> how copyright law is under pressure from generative AI.</p></li><li><p>Kristin O&#8217;Donoghue <a href="https://ai-frontiers.org/articles/congress-might-block-states-from-regulating-ai">argues</a> that a moratorium on state AI legislation would upend federalism and halt the experiments that drive smarter policy.</p></li><li><p>Pete Buttigieg <a href="https://petebuttigieg.substack.com/p/we-are-still-underreacting-on-ai">wrote</a> a blog post arguing that AI presents &#8220;a fundamental change to our society&#8212;and we remain dangerously underprepared.&#8221;</p></li><li><p>Researchers at UC Berkeley released <a href="https://www.cybergym.io/">CyberGym</a>, a new cybersecurity benchmark. The LLMs they evaluated discovered 15 zero-day vulnerabilities in large software projects.</p></li><li><p>A <a href="https://forecastingresearch.org/ai-enabled-biorisk">new report</a> from the Forecasting Research Institute shows that experts and superforecasters predict that existing AI capabilities may substantially increase the risk of human-caused epidemics.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-58-senate-removes?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-58-senate-removes?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #57: The RAISE Act]]></title><description><![CDATA[Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-57-the-raise</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-57-the-raise</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 17 Jun 2025 16:30:41 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!b6Zj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: The New York Legislature passes an act regulating frontier AI&#8212;but it may not be signed into law for some time.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>The RAISE Act</h1><p>New York may soon become the first state to regulate frontier AI systems. On June 12, the state&#8217;s legislature <a href="https://www.senatorgounardes.nyc/raise-act-release">passed</a> the Responsible AI Safety and Education (RAISE) Act. If New York Governor Kathy Hochul signs it into law, the <a href="https://www.nysenate.gov/legislation/bills/2025/S6953/amendment/B">RAISE Act</a> will be the most significant state AI legislation in the U.S.</p><p><strong>New York&#8217;s RAISE Act imposes four guardrails on frontier labs: </strong>developers must publish a safety plan, hold back unreasonably risky models, disclose major incidents, and face penalties for non-compliance.</p><ul><li><p><strong>Publish and maintain a safety plan.</strong> Before deployment, developers must post a redacted &#8220;safety and security protocol,&#8221; transmit the plan to both the attorney general and the Division of Homeland Security and Emergency Services, keep the unredacted version&#8212;plus all supporting test data&#8212;for five years, and review the plan each year.</p></li><li><p><strong>Withhold any model that presents an &#8220;unreasonable risk of critical harm.&#8221;</strong> Developers must delay their release and work to reduce risk if evaluations show the system poses an unreasonable risk of causing at least 100 deaths or $1 billion in damage through weapons of mass destruction or automated criminal activity.</p></li><li><p><strong>Report safety incidents within seventy-two hours.</strong> If developers discover the theft of model weights, evidence of dangerous autonomous behavior, or other events that demonstrably raises the risk of critical harm, they must report their discovery to state officials within three days.</p></li><li><p><strong>Penalties for non-compliance.</strong> The NY attorney general may seek up to $10 million for a first violation and $30 million for subsequent violations.</p></li></ul><p><strong>The RAISE Act only regulates the largest developers.</strong> Mirroring California&#8217;s SB 1047&#8212;<a href="https://www.npr.org/2024/09/20/nx-s1-5119792/newsom-ai-bill-california-sb1047-tech#:~:text=How%20Memphis%20became%20a%20battleground,growth%20for%20early%2Dstage%20companies.">vetoed by</a> Governor Gavin Newsom in 2024&#8212;the Act covers any model costing at least $100 million in compute.</p><p>Obligations fall on developers that have trained at least one frontier model and spent a cumulative $100 million on such training&#8212;and on anyone who later buys the model&#8217;s full intellectual-property rights. Accredited colleges are exempt when conducting academic research, but commercial spin-outs are not. These carve-outs serve to focus the legal burden onto the handful of firms capable of creating catastrophic harms.</p><p><strong>While New York acts, the U.S. Congress weighs a federal moratorium on state AI regulation.</strong> The &#8220;One Big Beautiful Bill Act,&#8221; the budget reconciliation package the U.S. House of Representatives approved on&#8239;May&#8239;22, contained a <a href="https://apnews.com/article/ai-regulation-state-moratorium-congress-39d1c8a0758ffe0242283bb82f66d51a">10&#8209;year federal moratorium</a> on &#8220;any law or regulation&#8221; that &#8220;restricts, governs or conditions&#8221; the design, deployment, or use of AI systems.</p><p>The moratorium was originally unlikely to pass the Senate&#8217;s Byrd Rule, which prohibits policy provisions from being included in budget reconciliation bills. The Senate Commerce Committee, chaired by Cruz, recently <a href="https://www.politico.com/live-updates/2025/06/05/congress/senate-commerce-megabill-frees-spectrum-ties-bead-to-ai-moratorium-00391136">revised</a> the moratorium such that it would be a prerequisite for states to receive billions in federal broadband expansion funds. This change could potentially bypass the Byrd rule.</p><p>However, the proposed moratorium has drawn <a href="https://x.com/RepMTG/status/1930650431253827806?t=rK_HvP4W2eb3qIB-FikMjw">criticism</a> from some Republican lawmakers&#8212;including the <a href="https://www.politico.com/newsletters/future-pulse/2025/06/16/state-ai-laws-could-get-a-reprieve-00408596">House Freedom Caucus</a>&#8212;who may be crucial to its survival. A <a href="https://www.commonsensemedia.org/press-releases/new-poll-reveals-strong-bipartisan-opposition-to-proposed-ban-on-state-ai-laws">recent poll found</a> that proposal appears to be unpopular with the party&#8217;s base, with 50 percent of Republican voters saying they opposed the moratorium compared to 30 percent saying they supported it. Last week, a bipartisan group of 260 state legislators also wrote <a href="https://ari.us/state-lawmakers-urge-congress-to-drop-ai-law-preemption/">a letter</a> to congress opposing the moratorium.</p><p><strong>The RAISE Act isn&#8217;t law yet</strong>. Although both chambers have passed the bill, they have not yet delivered it to Governor Kathy Hochul&#8212;a step lawmakers can take at any point during 2025.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b6Zj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b6Zj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 424w, https://substackcdn.com/image/fetch/$s_!b6Zj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 848w, https://substackcdn.com/image/fetch/$s_!b6Zj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 1272w, https://substackcdn.com/image/fetch/$s_!b6Zj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b6Zj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png" width="1456" height="389" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:389,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!b6Zj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 424w, https://substackcdn.com/image/fetch/$s_!b6Zj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 848w, https://substackcdn.com/image/fetch/$s_!b6Zj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 1272w, https://substackcdn.com/image/fetch/$s_!b6Zj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaa39fa0-a05c-4785-9130-ab331a0e0e34_1600x427.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>A diagram depicting the bill&#8217;s current status. <a href="https://www.nysenate.gov/legislation/bills/2025/S6953/amendment/B">Source</a>.</em></figcaption></figure></div><p>Once the bill is finally sent, Hochul will have up to 30 days to sign it, veto it, or negotiate &#8220;chapter amendments,&#8221; the back-and-forth revisions governors often use to tweak language before giving final approval. Until that clock starts, the measure sits in limbo, and its ultimate shape&#8212;possibly even its survival&#8212;remains an open question.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p>Government</p><ul><li><p>Secretary of Commerce Howard Lutnick <a href="https://www.commerce.gov/news/press-releases/2025/06/statement-us-secretary-commerce-howard-lutnick-transforming-us-ai">announced</a> plans to reform the U.S. AI Safety Institute into the Center for AI Standards and Innovation (CAISI).</p></li></ul><p>Industry</p><ul><li><p>Google <a href="https://blog.google/products/gemini/gemini-2-5-pro-latest-preview/">released</a> an upgraded preview of Gemini 2.5 Pro, which scores the highest on most benchmarks.</p></li><li><p>Sam Altman wrote a <a href="https://blog.samaltman.com/the-gentle-singularity">new blog post</a> discussing how an intelligence recursion would be rapid but &#8220;gentle&#8221;: "If we can do a decade&#8217;s worth of research in a year, or a month, then the rate of progress will obviously be quite different."</p></li><li><p>Meta <a href="https://apnews.com/article/meta-ai-superintelligence-agi-scale-alexandr-wang-4b55aabf7ea018e38ffdccb66e37cf26">invested</a> $14.3 billion in Scale AI and hired its CEO Alexandr Wang to run a new superintelligence team.</p></li></ul><p>Civil Society</p><ul><li><p>David &#8220;davidad&#8221; Dalrymple <a href="https://ai-frontiers.org/articles/ai-grid-blackouts-guarantees">argues</a> that in order to fulfill the potential of AI in safety-critical domains like energy grids, we need to develop more robust, mathematical guarantees of safety.</p></li><li><p>Kevin Frazier <a href="https://ai-frontiers.org/articles/options-for-ai-liability">writes</a> about how in the absence of federal legislation, the burden of managing AI risks has fallen to judges and state legislators&#8212;actors lacking the tools needed to ensure consistency, enforceability, or fairness.</p></li><li><p>Vanessa Bates Ramirez <a href="https://ai-frontiers.org/articles/ai-friends-openai-study">writes</a> that, while AI is increasingly being used for emotional support, research from OpenAI and MIT raises concerns that it may leave some users feeling even worse.</p></li><li><p>Nora Ammann and Sarah Hastings-Woodhouse <a href="https://ai-frontiers.org/articles/ai-arms-race-assurance-technologies">discuss</a> how assurance technologies could help de-escalate an AI arms race.</p></li><li><p>Epoch AI <a href="https://epoch.ai/data/ai-supercomputers?view=map#explore-the-data">released</a> a dataset that maps the world&#8217;s largest AI supercomputers.</p></li><li><p>Yoshua Bengio launched <a href="https://lawzero.org/en">LawZero</a>, a nonprofit advancing &#8220;safe-by-design&#8221; AI.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-57-the-raise?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-57-the-raise?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #56: Google Releases Veo 3]]></title><description><![CDATA[Plus, Opus 4 Demonstrates the Fragility of Voluntary Governance]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-56-google-releases</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-56-google-releases</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Wed, 28 May 2025 15:02:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!HZ8I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: Google released a frontier video generation model at its annual developer conference; Anthropic&#8217;s Claude Opus 4 demonstrates the danger of relying on voluntary governance.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>Google Releases Veo 3</h1><p>Last week, Google made several <a href="https://blog.google/technology/developers/google-io-2025-collection/">AI announcements</a> at I/O 2025, its annual developer conference. An announcement of particular note is <a href="https://deepmind.google/models/veo/">Veo 3</a>, Google&#8217;s newest video generation model.</p><p><strong>Frontier video and audio generation.</strong> Veo 3 outperforms other models on <a href="https://deepmind.google/models/veo/evals/">human preference benchmarks</a>, and generates both audio and video.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HZ8I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HZ8I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!HZ8I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!HZ8I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!HZ8I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HZ8I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/da24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HZ8I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!HZ8I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!HZ8I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!HZ8I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda24a5e2-92d6-490e-b74f-88fa68203799_1600x900.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Google showcasing a video generated with Veo 3. (<a href="https://www.axios.com/2025/05/23/google-ai-videos-veo-3">Source</a>)</figcaption></figure></div><p>If you just look at benchmarks, Veo 3 is a substantial improvement over other systems. But relative benchmark improvement only tells part of the story&#8212;the absolute capabilities of systems ultimately determine their usefulness. Veo 3 looks like a marked qualitative improvement over other models&#8212;it generates video and audio with extreme faithfulness, and we recommend you <a href="https://x.com/HashemGhaili/status/1925332319604257203">see</a> <a href="https://x.com/minchoi/status/1925387367806115943">some</a> <a href="https://x.com/laszlogaal_/status/1925094336200573225">examples</a> for yourself. Veo 3 may represent the point video generation crosses the line between being an interesting toy and being genuinely useful.</p><p><strong>Other announcements at I/O 2025</strong>. Other highlights from the conference include:</p><ul><li><p><a href="https://blog.google/technology/google-deepmind/google-gemini-updates-io-2025/">Gemini 2.5</a> Pro now leads LMArena and WebDev Arena. Deep Think mode, a reasoning feature that scored 49.4% on the USA Mathematical Olympiad 2025 (more than twice OpenAI&#8217;s o3, which scored 21.7%). Gemini 2.5 Flash now performs better across reasoning, multimodality, code, and long context while becoming 20-30% more efficient in token usage.</p></li><li><p><a href="https://blog.google/technology/google-deepmind/gemini-diffusion/">Gemini Diffusion</a>, an experimental (non-frontier) text diffusion model, delivers output 4-5 times faster than comparable models while rivaling the performance of models twice its size. Most LLMs are autoregressive models, which generate one token at a time&#8212;in contrast, diffusion models generate an entire response at once.</p></li><li><p>Google also announced <a href="https://developers.googleblog.com/en/introducing-gemma-3n/">Gemma 3n</a>, an open model small enough to run on mobile devices, a public beta for Google&#8217;s autonomous coding agent <a href="https://blog.google/technology/google-labs/jules/">Jules</a>, a new <a href="https://blog.google/products/search/google-search-ai-mode-update/#ai-mode-search">AI search</a> feature, an <a href="https://blog.google/technology/ai/google-synthid-ai-content-detector/">AI watermarker</a> that identifies content generated by Google&#8217;s systems, and more.</p></li></ul><p><strong>AI is here to stay.</strong> AI use is sometimes driven by trends&#8212;for example, <a href="https://x.com/sama/status/1906771292390666325">ChatGPT added a million users in an hour</a> during the &#8216;Ghiblification&#8217; craze. However, as AI systems become genuinely useful across more tasks, they will become ubiquitous and enduring. Google&#8217;s Gemini app now has <a href="https://blog.google/technology/ai/io-2025-keynote/">400M monthly active users</a>, and its AI products now process over 480 trillion tokens a month&#8212;up from 9.7 trillion last year.</p><h1>Opus 4 Demonstrates the Fragility of Voluntary Governance</h1><p>Last week, Anthropic released Claude Opus 4 and Claude Sonnet 4. Both exhibit broadly frontier performance, and lead the field on coding benchmarks. Claude Opus 4 is also Anthropic&#8217;s first model to meet <a href="https://time.com/7287806/anthropic-claude-4-opus-safety-bio-risk/">its ASL-3 safety measure</a>, which designates models that pose substantial risk. However, Anthropic rolled back several safety and security commitments prior to releasing Opus 4, demonstrating that voluntary governance is not to be relied on.</p><p><strong>Opus 4 exhibits hazardous dual-use capabilities.</strong> In one result from its <a href="https://www-cdn.anthropic.com/6be99a52cb68eb70eb9572b4cafad13df32ed995.pdf">system card</a>, Opus 4 provides a clear uplift in trials measuring its ability to help malicious actors acquire biological weapons.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E38G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E38G!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 424w, https://substackcdn.com/image/fetch/$s_!E38G!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 848w, https://substackcdn.com/image/fetch/$s_!E38G!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 1272w, https://substackcdn.com/image/fetch/$s_!E38G!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E38G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png" width="1456" height="548" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad471014-fe58-4180-a67a-9b48862263b9_1600x602.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:548,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E38G!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 424w, https://substackcdn.com/image/fetch/$s_!E38G!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 848w, https://substackcdn.com/image/fetch/$s_!E38G!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 1272w, https://substackcdn.com/image/fetch/$s_!E38G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad471014-fe58-4180-a67a-9b48862263b9_1600x602.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic&#8217;s Chief Scientist Jarad Kaplan <a href="https://time.com/7287806/anthropic-claude-4-opus-safety-bio-risk/">told TIME</a> that malicious actors could use Opus 4 to &#8220;try to synthesize something like COVID or a more dangerous version of the flu&#8212;and basically, our modeling suggests that this might be possible.&#8221; It&#8217;s not just Opus 4: <a href="https://www.ai-frontiers.org/articles/ais-are-disseminating-expert-level-virology-skills">several frontier models outperform human experts in dual-use virology tests</a>.</p><p>The system card also reports that Apollo Research found an early Claude Opus 4 version exhibited "scheming and deception," advising against its release. Anthropic says it implemented internal fixes; however, <a href="https://www.youtube.com/watch?v=Xn_5aIhrJOE&amp;t=510s">it doesn&#8217;t appear that</a> Anthropic had Apollo Research re-evaluate the final, released version.</p><p><strong>Anthropic&#8217;s safety protections may be insufficient.</strong> In light of Opus 4&#8217;s dangerous capabilities, Anthropic rolled out <a href="https://www.anthropic.com/news/activating-asl3-protections">ASL-3 safety protections</a>. However, early public response to Opus 4 indicates that those protections might be insufficient. For example, one researcher showed that Claude Opus 4's WMD safeguards can be bypassed to generate <a href="https://x.com/ARGleave/status/1926138376509440433">over 15 pages of detailed instructions for producing sarin gas</a>.</p><p><strong>Anthropic walked back safety and security commitments prior to Opus 4&#8217;s release.</strong> Anthropic has also faced criticism for walking back safety commitments prior to Opus 4&#8217;s release. For example, Anthropic&#8217;s <a href="https://www.anthropic.com/news/anthropics-responsible-scaling-policy">September 2023 Responsible Scaling Policy</a> (RSP) committed to define detailed ASL-4 "warning sign evaluations" before their systems reached ASL-3 capabilities; however, it <a href="https://www.obsolete.pub/p/exclusive-anthropic-is-quietly-backpedalling">hadn&#8217;t done so at the time of Opus 4&#8217;s release</a>. This is because Anthropic redlined that requirement in an <a href="https://www.anthropic.com/news/announcing-our-updated-responsible-scaling-policy">October 2024 revision</a> to its RSP.</p><p>Anthropic also <a href="https://x.com/RyanPGreenblatt/status/1925992239332724921">weakened its ASL-3 security requirements</a> shortly before Opus 4's ASL-3 announcement, specifically no longer requiring robustness against employees stealing model weights if they already had access to "systems that process model weights."</p><p><strong>Voluntary governance is fragile.</strong> Whether or not Anthropic&#8217;s changes to its safety and security policies are justified, voluntary commitments are not sufficient to ensure model releases are safe. There&#8217;s nothing stopping Anthropic or other AI companies from walking back critical commitments in the face of competitive pressure to rush releases.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><p><strong>Government</strong></p><ul><li><p>JD Vance discussed why he&#8217;s worried about AI in <a href="https://www.nytimes.com/2025/05/21/opinion/jd-vance-pope-trump-immigration.html">a recent interview</a>.</p></li><li><p>A judge <a href="https://www.transparencycoalition.ai/news/important-early-ruling-in-characterai-case-this-chatbot-is-a-product-not-speech">ruled</a> that Character.AI is a product for the purposes of product liability in a lawsuit over a boy&#8217;s suicide after interacting with a Character.AI chatbot.</p></li></ul><p><strong>Industry</strong></p><ul><li><p>OpenAI <a href="https://www.npr.org/2025/05/22/nx-s1-5407548/openai-jony-ive-io-deal-ai-devices">bought</a> iPhone designer Jony Ive&#8217;s startup, io, for $6.5 billion.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Peter N. Salib and Simon Goldstein <a href="https://www.ai-frontiers.org/articles/todays-ais-arent-paperclip-maximizers">argue</a> that today&#8217;s AI systems aren&#8217;t paperclip maximizers.</p></li><li><p>Devid Kirichenko <a href="https://www.ai-frontiers.org/articles/how-ai-is-eroding-the-norms-of-war">writes</a> about how drones are eroding the norms of war.</p></li><li><p>ARC Prize released a new reasoning benchmark, <a href="https://arcprize.org/blog/arc-agi-2-technical-report">ARC-AGI-2</a>, on which frontier reasoning models score in low single-digits.</p></li><li><p>CSET is <a href="https://cset.georgetown.edu/wp-content/uploads/FRG-Call-for-Research-Ideas-Internal-Deployment.pdf">funding</a> research on risks from internal deployment of frontier AI models.</p></li><li><p>A new <a href="https://arxiv.org/abs/2505.09662">paper</a> found that Claude Sonnet 3.5 is significantly more persuasive than humans.</p></li><li><p>An Axios <a href="https://www.axios.com/newsletters/axios-ai-plus-62025700-399b-11f0-b37f-b73dfdd12f1d.html?stream=top">poll</a> found that 77% of Americans want AI companies to slow down.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-56-google-releases?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-56-google-releases?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #55: Trump Administration Rescinds AI Diffusion Rule, Allows Chip Sales to Gulf States]]></title><description><![CDATA[Plus, Bills on Whistleblower Protections, Chip Location Verification, and State Preemption]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-55-trump-administration</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-55-trump-administration</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 20 May 2025 14:43:03 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!AH03!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: The Trump Administration rescinds the Biden-era AI diffusion rule and sells AI chips to the UAE and Saudi Arabia; Federal lawmakers propose legislation on AI whistleblowers, location verification for AI chips, and prohibiting states from regulating AI.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p>The Center for AI Safety is also excited to announce the Summer session of our AI Safety, Ethics, and Society course, running from <strong>June 23 to September 14</strong>. The course, based on our recently published <a href="https://www.routledge.com/Introduction-to-AI-Safety-Ethics-and-Society/Hendrycks/p/book/9781032869926?srsltid=AfmBOornG5wYvl7unW0XGlM2bjuOV97TAAkehylkWjbyE847srTgGOgs">textbook</a>, is open to participants from all disciplines and countries, and is designed to accommodate full-time work or study.</p><p>Applications for the Summer 2025 course are now open. The final application deadline is <strong>May 30th</strong>. Visit the <a href="https://www.aisafetybook.com/virtual-course">course website</a> to learn more and apply.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>Trump Administration Rescinds AI Diffusion Rule, Allows Chip Sales to Gulf States</h1><p>On May 12th, the Department of Commerce <a href="https://media.bis.gov/sites/default/files/documents/05.07%20Recission%20of%20AI%20Diffusion%20Press%20Release.pdf">announced</a> that it had rescinded the <a href="https://www.federalregister.gov/documents/2025/01/15/2025-00636/framework-for-artificial-intelligence-diffusion">Framework for Artificial Intelligence Diffusion</a>, which was set to take effect May 15th. The rule would have regulated the export of AI chips and models across three tiers of countries, each with its own set of restrictions. (Other AI chip export controls, including those prohibiting sales to China, remain on the books.)</p><p>The announcement states that the Bureau of Industry and Security (BIS) will issue a replacement rule in the future. In the meantime, the BIS will focus on working to prevent US chips from being used in Chinese AI development. Bloomberg <a href="https://www.bloomberg.com/news/articles/2025-05-07/trump-to-rescind-global-chip-curbs-amid-ai-restrictions-debate">reports</a> that new restrictions will focus on countries that have diverted US chips to China, including Thailand and Malaysia.</p><p><strong>The Trump Administration wants to capture the global AI chip market.</strong> Though China has yet to export its own AI chips, the BIS will also issue guidance that states using Huawei Ascend chips violates US export controls. This preemptive restriction supports the Trump Administration&#8217;s intent for the US to dominate the global AI chip market.</p><p><strong>UAE and Saudi Arabia are set to receive hundreds of thousands of AI chips</strong>. Last week, Trump announced trade deals with the <a href="https://www.whitehouse.gov/fact-sheets/2025/05/fact-sheet-president-donald-j-trump-secures-200-billion-in-new-u-s-uae-deals-and-accelerates-previously-committed-1-4-trillion-uae-investment/">UAE</a> and <a href="https://www.whitehouse.gov/fact-sheets/2025/05/fact-sheet-president-donald-j-trump-secures-historic-600-billion-investment-commitment-in-saudi-arabia/">Saudi Arabia</a>, respectively.</p><p>The UAE is set to receive up to <a href="https://www.reuters.com/business/finance/us-close-letting-uae-import-millions-nvidias-ai-chips-sources-say-2025-05-14/">500,000 of Nvidia's most advanced chips per year</a>, beginning in 2025. 100,000 of these would go to the Emirati firm G42, with the remainder going to U.S. companies building datacenters in the UAE. Following the deal&#8217;s <a href="https://www.commerce.gov/news/press-releases/2025/05/uae/us-framework-advanced-technology-cooperation">announcement</a>, G42 announced the construction of a <a href="https://www.commerce.gov/news/press-releases/2025/05/uae-and-us-presidents-attend-unveiling-phase-1-new-5gw-ai-campus-abu">five GW AI campus</a> in Abu Dhabi&#8212;the <a href="https://x.com/ohlennart/status/1923091524688007474?">largest AI infrastructure project anywhere in the world</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AH03!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AH03!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!AH03!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!AH03!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!AH03!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AH03!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png" width="1456" height="970" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AH03!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!AH03!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!AH03!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!AH03!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45cc31a2-d027-43bd-9f4f-2b26b23e051b_1600x1066.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">President Trump with the Emirati president, Sheikh Mohammed bin Zayed, at the AI campus&#8217; unveiling. (<a href="https://www.nytimes.com/2025/05/15/us/politics/ai-us-abu-dhabi.html">Source</a>.)</figcaption></figure></div><p>Nvidia <a href="https://nvidianews.nvidia.com/news/humain-and-nvidia-announce-strategic-partnership-to-build-ai-factories-of-the-future-in-saudi-arabia">announced</a> a strategic partnership with Saudi Arabia&#8217;s new sovereign AI company, <a href="https://www.pif.gov.sa/en/news-and-insights/press-releases/2025/hrh-crown-prince-launches-humain-as-global-ai-powerhouse/">Humain</a>. In the first phase of the partnership, Humain is set to receive <a href="https://www.nvidia.com/en-us/">18,000 Blackwell chips</a>. AMD <a href="https://www.amd.com/en/newsroom/press-releases/2025-5-13-amd-and-humain-form-strategic--10b-collaboration-.html">also announced</a> a partnership with Humain.</p><p><strong>The chip sales affect several US priorities. </strong>The deals will direct large investments to US AI companies that might have otherwise gone to China (China is the leading source of revenue for both the <a href="https://oec.world/en/profile/country/are">UAE</a> and <a href="https://oec.world/en/profile/country/sau">Saudi Arabia)</a>. It will also allow US AI companies to circumvent compute capacity limitations imposed by the US&#8217; energy grid.</p><p>Some US officials argue that the Trump Administration&#8217;s chip sales <a href="https://www.nytimes.com/2025/05/15/business/economy/trump-chips-ai-uae.html">threaten to undermine the US&#8217; lead in compute capacity</a>, and consequently US national security, since compute capacity may soon become <a href="https://www.rand.org/pubs/commentary/2025/05/chinas-ai-models-are-closing-the-gap-but-americas-real.html">a key determinant of state power</a>. However, it&#8217;s difficult to evaluate the sales&#8217; overall effects on US interests, since the terms of the agreement are unclear.</p><h1>Bills on Whistleblower Protections, Chip Location Verification, and State Preemption</h1><p><strong>A federal AI whistleblower protection act</strong>. Senate Judiciary Committee Chair Chuck Grassley introduced the <a href="https://www.judiciary.senate.gov/press/rep/releases/grassley-introduces-ai-whistleblower-protection-act">Artificial Intelligence Whistleblower Protection Act</a>, which would protect employees who come forward with information about harmful or illegal activities happening inside AI companies.</p><p><a href="https://law-ai.org/how-to-design-ai-whistleblower-legislation/">Current AI whistleblower protections aren&#8217;t effective.</a> Currently, these sorts of laws only exist as a patchwork across jurisdictions, making it difficult for would-be AI whistleblowers to predict whether they would be protected. They also often only protect reporting violations of law. Because AI regulation is minimal, developer behavior that poses a threat to public safety may not violate any law.</p><p>AI companies can also require employees to sign NDAs preventing them from disparaging the company even after they leave. OpenAI <a href="https://www.vox.com/future-perfect/351132/openai-vested-equity-nda-sam-altman-documents-employees">had employees sign such an NDA</a>, which they later discontinued after public pressure.</p><p>The AI Whistleblower Protection Act <a href="https://x.com/MackenZ_arnold/status/1923099536588767733">addresses these shortcomings</a>. It covers disclosing any &#8220;substantial and specific&#8221; danger that AI developer behavior might pose to public safety, public health, or national security. It also prohibits AI companies from requiring employees to sign NDAs or other contracts that undermine their ability to make such disclosures.</p><p><strong>A bill requiring location verification for AI chips.</strong> Senator Tom Cotton introduced the <a href="https://www.cotton.senate.gov/news/press-releases/cotton-introduces-bill-to-prevent-diversion-of-advanced-chips-to-americas-adversaries-and-protect-us-product-integrity">Chip Security Act</a>, which would require <a href="https://www.ai-frontiers.org/articles/location-verification-ai-chips">location verification mechanisms</a> for export-controlled AI chips.</p><p>The bill would strengthen US export controls by preventing AI chips from being smuggled into China. AI chip smuggling is a growing problem, with potentially <a href="https://www.aipolicybulletin.org/articles/ai-chip-smuggling-is-the-default-not-the-exception">100,000 chips smuggled in 2024</a>.</p><p>Currently, US officials struggle to determine what happens to AI chips once they&#8217;re shipped overseas. Location verification would allow export authorities to tell when a shipment of chips isn&#8217;t where it&#8217;s supposed to be, triggering further investigation.</p><p><strong>A provision in a tax bill prohibiting states from regulating AI.</strong> The House Energy and Commerce Committee included a provision that would <a href="https://apnews.com/article/ai-regulation-state-moratorium-congress-39d1c8a0758ffe0242283bb82f66d51a">prohibit states from regulating AI</a> in its <a href="https://docs.house.gov/meetings/IF/IF00/20250513/118261/HMKP-119-IF00-20250513-SD003.pdf">markup</a> of House Republicans&#8217; <a href="https://apnews.com/article/trump-big-beautiful-bill-medicaid-cuts-125ad670515460108ded062029abd8c8">tax bill</a>.</p><p>Ever since <a href="https://www.npr.org/2024/09/20/nx-s1-5119792/newsom-ai-bill-california-sb1047-tech">California&#8217;s SB 1047 almost became law</a>, AI companies have argued that states should be prohibited from regulating AI, and instead leave the problem to the federal government. SB 1047 would have made AI companies liable for harm caused by their models.</p><p>However, the provision seems to run afoul of the Senate&#8217;s &#8220;<a href="https://www.congress.gov/crs_external_products/RL/PDF/RL30862/RL30862.20.pdf">Byrd Rule</a>,&#8221; which prohibits policy provisions from being included in budget reconciliation bills.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p><strong>Industry</strong></p><ul><li><p><a href="https://deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/">Google announced AlphaEvolve</a>, an evolutionary coding agent powered by Gemini. Among other results, AlphaEvolve found an algorithm for multiplying 4x4 complex-valued matrices that bested a record set in 1969, and a method to run Google&#8217;s datacenters 0.7% more efficiently.</p></li><li><p><a href="https://openai.com/index/introducing-codex/">OpenAI introduced Codex</a>, a cloud-based agent powered by a version of o3 optimized for software engineering.</p></li><li><p>Grok started responding to unrelated queries <a href="https://techcrunch.com/2025/05/15/xai-blames-groks-obsession-with-white-genocide-on-an-unauthorized-modification/">by discussing &#8220;white genocide&#8221;</a> in South Africa. xAI blamed the incident on an &#8220;unauthorized modification&#8221; (<a href="https://fortune.com/2025/02/24/xai-chief-engineer-blames-former-openai-employee-grok-blocks-musk-trump-misinformation/">again</a>).</p></li><li><p><a href="https://techcrunch.com/2025/05/14/openai-brings-its-gpt-4-1-models-to-chatgpt/">OpenAI released GPT-4.1</a> and GPT-4.1 mini.</p></li><li><p>Bloomberg reports that <a href="https://www.bloomberg.com/news/articles/2025-05-17/microsoft-layoffs-highlight-ai-driven-hiring-pauses">a growing number of companies are reducing their workforce because of AI</a>. For example, Microsoft cut about 6,000 jobs last week, about 3 percent of its workforce.</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Dan Hendrycks and Laura Hiscott argue that despite years of effort, <a href="https://www.ai-frontiers.org/articles/the-misguided-quest-for-mechanistic-ai-interpretability">mechanistic interpretability has failed to provide insight into AI behavior</a>&#8212;the result of a flawed foundational assumption.</p></li><li><p>Scott Mulligan discusses <a href="https://www.ai-frontiers.org/articles/location-verification-ai-chips">whether location verification can stop AI chip smuggling</a>.</p></li><li><p>Eliezer Yudkowsky and Nate Soares are publishing a new book on AI safety: <em><a href="https://ifanyonebuildsit.com/">If Anyone Builds It, Everyone Dies</a></em>.</p></li><li><p>The NYT discusses <a href="https://www.nytimes.com/2025/05/15/world/europe/pope-leo-artificial-intelligence.html">Pope Leo XIV&#8217;s focus on AI&#8217;s risks to humankind</a>.</p></li></ul><p>See also: <a href="https://x.com/ai_risks?lang=en">CAIS&#8217; X account</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-55-trump-administration?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-55-trump-administration?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #54: OpenAI Updates Restructure Plan]]></title><description><![CDATA[Plus, AI Safety Collaboration in Singapore]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-54-openai-updates</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-54-openai-updates</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 13 May 2025 15:52:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rjTm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: OpenAI claims an updated restructure plan would preserve nonprofit control; A global coalition meets in Singapore to propose a research agenda for AI safety.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>OpenAI Updates Restructure Plan</h1><p>On May 5th, OpenAI announced <a href="https://openai.com/index/evolving-our-structure/">a new restructure plan</a>. The announcement walks back a December&#8239;2024 proposal that would have had OpenAI&#8217;s nonprofit&#8212;which oversees the company&#8217;s for-profit operations&#8212;sell its controlling shares to the for-profit side of the company. That plan drew sharp criticism from former employees and civil&#8209;society <a href="https://notforprivategain.org/">groups</a> and prompted <a href="https://www.reuters.com/business/elon-musk-keep-lawsuit-against-openai-despite-nonprofit-control-statement-lawyer-2025-05-06/">a lawsuit</a> from co&#8209;founder Elon&#8239;Musk, who argued OpenAI was abandoning its charitable mission.</p><p><strong>OpenAI claims the new plan preserves nonprofit control, but is light on specifics.</strong> Like the original plan, OpenAI&#8217;s new plan would have OpenAI&#8239;Global&#8239;LLC become a public&#8209;benefit corporation (PBC). However, instead of the nonprofit selling its control over the LLC, OpenAI claims the nonprofit would retain control of the PBC.</p><p>It&#8217;s unclear what form that control would take. The announcement claims that the nonprofit would be a large (but not necessarily majority) shareholder of the PBC, and, in a <a href="https://www.wired.com/story/openai-announce-nonprofit-structure-sam-altman/">press call</a>, an OpenAI spokesperson told reporters that the nonprofit would be able to appoint and remove PBC directors.</p><p><strong>The new plan may still remove governance safeguards.</strong> Arguably, the new plan may not preserve any of the <a href="https://notforprivategain.org/">governance safeguards</a> that critics said would be lost in the original reorganization plan.</p><p>First, unlike OpenAI&#8217;s original &#8220;capped&#8209;profit&#8221; structure, the new PBC will issue ordinary stock with no ceiling on investor returns. The capped-profit model was intended to ensure that OpenAI equitably distributed the resources of developing AGI. CEO Sam&#8239;Altman framed the move as a simplification that would allow OpenAI to raise capital. However, if OpenAI developed AGI, the nonprofit would only partially control the wealth OpenAI would accumulate.</p><p>Second, owning shares of and appointing directors to the PBC does not ensure that the nonprofit could adequately control the PBC&#8217;s behavior. The PBC would be legally required to balance its charitable purpose with investor returns, and the nonprofit would only have indirect influence over the PBCs behavior. This differs from the current model, in which the nonprofit has direct <a href="https://openai.com/index/update-on-safety-and-security-practices/">oversight of development and deployment decisions</a>. (For more discussion of corporate governance, see <a href="https://www.aisafetybook.com/textbook/corporate-governance">our textbook</a>.)</p><p><strong>The new plan isn&#8217;t a done deal.</strong> Up to $30&#8239;billion in funding led by SoftBank is <a href="https://www.wired.com/story/openai-announce-nonprofit-structure-sam-altman/">contingent on the new plan going through</a>. Microsoft, OpenAI&#8217;s largest backer, holds veto power over structural changes and is negotiating what percentage of the PBC it will own. Delaware Attorney General Kathy&#8239;Jennings said she will review the new plan; her California counterpart signaled a similar review.</p><h1>AI Safety Collaboration in Singapore</h1><p>Last week, the 2025 Singapore Conference on AI brought together more than 100 participants from 11 countries to identify AI safety research priorities, resulting in a 40-page <a href="https://aisafetypriorities.org/">consensus document</a>.</p><p><strong>The conference included each AISI&#8212;including the US and China.</strong> The conference is notable in that it was sponsored by <a href="https://www.scai.gov.sg/2025/about-scai-2025/">the government of Singapore</a> and included participation of every government AI safety institute, including the US AISI and its Chinese counterpart.</p><p>Singapore has close ties with both the US and China, and its government has <a href="https://www.imda.gov.sg/resources/press-releases-factsheets-and-speeches/press-releases/2025/singapore-ai-safety-initiatives-global-ai-summit-france">previously shown support for AI safety</a>&#8212;meaning it has the potential to play a key role in negotiating international agreements on AI between the US and China.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rjTm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rjTm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 424w, https://substackcdn.com/image/fetch/$s_!rjTm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 848w, https://substackcdn.com/image/fetch/$s_!rjTm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 1272w, https://substackcdn.com/image/fetch/$s_!rjTm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rjTm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png" width="1456" height="968" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:968,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rjTm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 424w, https://substackcdn.com/image/fetch/$s_!rjTm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 848w, https://substackcdn.com/image/fetch/$s_!rjTm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 1272w, https://substackcdn.com/image/fetch/$s_!rjTm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41e07002-c5fd-4c60-a259-24780e32f211_1600x1064.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Singapore&#8217;s Minister for Digital Development and Information speaks at the conference. <a href="https://www.scai.gov.sg/2025/about-scai-2025/">Source.</a></figcaption></figure></div><p><strong>The document recommends defense in depth for AI safety.</strong> The consensus document itself takes a defence&#8209;in&#8209;depth approach (proposed in <em><a href="https://arxiv.org/abs/2109.13916">Unsolved Problems in ML Safety</a></em>) to technical AI safety research made up of three pillars: risk&#8239;assessment, development, and control.</p><ul><li><p><strong>Risk assessment. </strong>The first pillar proposes research tracks to make risk measurement rigorous, repeatable, and hard to game. These tools would let regulators set clear red&#8209;lines for scaling.</p></li><li><p><strong>Development. </strong>The second pillar focuses on baking safety into the development process by turning broad goals into concrete design requirements, adding safeguards during training, and stress&#8209;testing models before release.</p></li><li><p><strong>Monitoring and Control. </strong>The third pillar aims to control deployed systems through continuous monitoring, dependable shutdown options, and tracking how systems are used. These measures are meant to spot trouble early and give authorities clear levers to intervene when needed.</p></li></ul><p>Each area contains concrete research programmes designed to give policymakers shared &#8220;areas of mutual interest&#8221; where cooperation is rational even among competitors.</p><p>While the document doesn&#8217;t break new technical ground&#8212;or discuss non-technical issues like <a href="https://www.nationalsecurity.ai/">geopolitical conflict</a>&#8212;these three pillars show how even rival companies and states can benefit from cooperation on technical AI safety research.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>In Other News</h1><p><strong>Government</strong></p><ul><li><p>The NSF issued an <a href="https://www.federalregister.gov/documents/2025/04/29/2025-07332/request-for-information-on-the-development-of-a-2025-national-artificial-intelligence-ai-research">RFI</a> seeking public input for the 2025 National AI&#8239;R&amp;D Strategic Plan (comments due May&#8239;29).</p></li><li><p><a href="https://www.reuters.com/world/china/chinas-xi-calls-self-sufficiency-ai-development-amid-us-rivalry-2025-04-26/">President&#8239;Xi</a> urged China to achieve AI self&#8209;reliance in chips and software to narrow the gap with the United States, while ensuring safety.</p></li><li><p>China is set to <a href="https://www.digitimes.com/news/a20250423PD215/equipment-2024.html?chid=10">merge more than 200 firms</a> into about 10 giants to strengthen semiconductor self&#8209;sufficiency.</p></li><li><p>At a <a href="https://apnews.com/article/openai-ceo-sam-altman-congress-senate-testify-ai-20e7bce9f59ee0c2c9914bc3ae53d674">Senate hearing</a>, Sam&#8239;Altman and other tech leaders urged strategic investment to keep U.S. AI ahead of China.</p></li><li><p>The European Commission launched a<a href="https://digital-strategy.ec.europa.eu/en/news/commission-seeks-input-clarify-rules-general-purpose-ai-models"> public consultation</a> to clarify how the EU AI&#8239;Act will regulate general&#8209;purpose models, with draft guidance due before August&#8239;2025.</p></li><li><p>Pope Leo XIV <a href="https://apnews.com/article/pope-leo-vision-papacy-artificial-intelligence-36d29e37a11620b594b9b7c0574cc358">identified AI</a> as one of the most important challenges facing humanity.</p></li><li><p>President Trump <a href="https://techcrunch.com/2025/05/11/trump-fires-copyright-office-director-after-report-raises-questions-about-ai-training/">fired the director of the U.S. Copyright Office</a> after a <a href="https://www.copyright.gov/ai/Copyright-and-Artificial-Intelligence-Part-3-Generative-AI-Training-Report-Pre-Publication-Version.pdf">report</a> raised concerns about the use of copyrighted material in AI training.</p></li></ul><p><strong>Industry</strong></p><ul><li><p><a href="https://www.futurehouse.org/research-announcements/launching-futurehouse-platform-ai-agents">FutureHouse</a> unveiled a platform of four scientific agents for literature search and experiment design.</p></li><li><p>Visa <a href="https://corporate.visa.com/en/products/intelligent-commerce.html">announced</a> a pilot program to let autonomous AI agents make purchases directly on its network.</p></li><li><p>Alibaba released <a href="https://qwenlm.github.io/blog/qwen3/">Qwen3</a>, claiming state&#8209;of&#8209;the&#8209;art scores for an open-weight model.</p></li><li><p>Anthropic set up an <a href="https://www.anthropic.com/news/introducing-the-anthropic-economic-advisory-council">Economic Advisory Council</a> of leading economists to study AI&#8217;s impact on labor and growth.</p></li><li><p><a href="https://techcrunch.com/2025/05/03/googles-gemini-has-beaten-pokemon-blue-with-a-little-help/">Gemini&#8239;2.5&#8239;Pro beat Pok&#233;mon&#8239;Blue</a> (with some help).</p></li></ul><p><strong>Civil Society</strong></p><ul><li><p>Common&#8239;Sense Media warns that <a href="https://www.commonsensemedia.org/ai-ratings/social-ai-companions">social&#8209;AI companion apps pose &#8220;unacceptable risks&#8221; for anyone under 18</a> and calls for age bans and more research.</p></li><li><p>A new paper argues that <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5242643">AI agents should be designed to be law-following</a>.</p></li><li><p>404 Media reported that researchers secretly conducted a large, <a href="https://www.404media.co/researchers-secretly-ran-a-massive-unauthorized-ai-persuasion-experiment-on-reddit-users/">unauthorized AI persuasion experiment on Reddit users</a>.</p></li></ul><p><strong>AI Frontiers</strong></p><ul><li><p>Helen Toner argues that <a href="https://www.ai-frontiers.org/articles/were-arguing-about-ai-safety-wrong">&#8216;dynamism&#8217; vs. &#8216;stasis&#8217; is a clearer lens for criticizing controversial AI safety prescriptions.</a></p></li><li><p>Philip Tschirhart and Nick Stockton write that <a href="https://www.ai-frontiers.org/articles/can-the-us-prevent-agi-from-being-stolen">securing AI weights from foreign adversaries would require a level of security never seen before.</a></p></li></ul><p>See also: <a href="https://www.safe.ai/">CAIS website</a>, <a href="https://x.com/ai_risks?lang=en">X account for CAIS</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-54-openai-updates?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://newsletter.safe.ai/p/ai-safety-newsletter-54-openai-updates?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #53: An Open Letter Attempts to Block OpenAI Restructuring]]></title><description><![CDATA[Plus, SafeBench Winners]]></description><link>https://newsletter.safe.ai/p/an-open-letter-attempts-to-block</link><guid isPermaLink="false">https://newsletter.safe.ai/p/an-open-letter-attempts-to-block</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 29 Apr 2025 15:11:16 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-8ts!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the<a href="https://www.safe.ai/"> Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: Experts and ex-employees urge the Attorneys General of California and Delaware to block OpenAI&#8217;s for-profit restructure; CAIS announces the winners of its safety benchmarking competition.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>An Open Letter Attempts to Block OpenAI Restructuring</h1><p>A group of former OpenAI employees and independent experts published <a href="https://notforprivategain.org/">an open letter</a> urging the Attorneys General (AGs) of California (where OpenAI operates) and Delaware (where OpenAI is incorporated) to block OpenAI&#8217;s planned restructuring into a for-profit entity. The letter argues the move would fundamentally undermine the organization's charitable mission by jeopardizing the governance safeguards designed to protect control over AGI from profit motives.</p><p><strong>OpenAI was <a href="https://openai.com/charter/">founded</a> with the charitable purpose to ensure that artificial general intelligence benefits all of humanity. </strong>OpenAI&#8217;s original nonprofit structure, and later its capped-profit model, were designed to control profit motives in the development of AGI, which OpenAI defines as "highly autonomous systems that outperform humans at most economically valuable work." The structure was designed to prevent profit motives from incentivizing OpenAI to take risky development decisions and divert much of the wealth produced by AGI to private shareholders.</p><p><strong>The proposed restructuring into a Public Benefit Corporation (PBC) would dismantle the governance safeguards OpenAI originally championed. </strong>The letter highlights that the proposed restructuring would transfer control away from the nonprofit entity&#8211;whose primary fiduciary duty is to humanity&#8211;to a for-profit board whose directors would be partly beholden to shareholder interests. The authors detail several specific safeguards currently in place that would be undermined or eliminated:</p><ul><li><p><strong>Subordination of Profit Motives:</strong> Currently, the nonprofit's mission takes precedence over any obligation to generate profit for OpenAI&#8217;s for-profit subsidiary. A PBC structure, however, would require balancing the public benefit mission with shareholder pecuniary interests.</p></li><li><p><strong>Nonprofit Fiduciary Duty: </strong>The nonprofit board currently has a legally enforceable fiduciary duty to advance the charitable purpose, with the AGs empowered to enforce this duty on behalf of the public. The proposed structure would eliminate this direct accountability to the public interest.</p></li><li><p><strong>Capped Investor Profits: </strong>The current "capped-profit" model ensures that returns beyond a certain cap flow back to the nonprofit for humanity's benefit. <a href="https://www.reuters.com/technology/artificial-intelligence/openai-lays-out-plan-shift-new-for-profit-structure-2024-12-27/">Reports suggest</a> the restructuring may eliminate this cap, potentially reallocating immense value away from the public good.</p></li><li><p><strong>Independent Board: </strong>OpenAI previously committed to a majority-independent board for the nonprofit, with limitations on financial stakes and voting on potential conflicts. The letter notes it is unknown if the new PBC structure would retain this commitment.</p></li><li><p><strong>AGI Belongs to Humanity: </strong>The nonprofit, not investors, is designated to govern AGI technologies once developed. The restructuring would likely shift AGI ownership to the for-profit PBC and its investors.</p></li><li><p><strong>Stop-and-Assist Commitment: </strong>OpenAI&#8217;s Charter includes a commitment to cease competition and assist other aligned, safety-conscious projects nearing AGI. It is unclear if the PBC would honor this commitment.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-8ts!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-8ts!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 424w, https://substackcdn.com/image/fetch/$s_!-8ts!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 848w, https://substackcdn.com/image/fetch/$s_!-8ts!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 1272w, https://substackcdn.com/image/fetch/$s_!-8ts!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-8ts!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png" width="1456" height="1269" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1269,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-8ts!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 424w, https://substackcdn.com/image/fetch/$s_!-8ts!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 848w, https://substackcdn.com/image/fetch/$s_!-8ts!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 1272w, https://substackcdn.com/image/fetch/$s_!-8ts!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9c22c79-f9b2-4fb5-af77-5626e122434f_1600x1394.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The letter concludes by asking the Attorneys General of California and Delaware to halt the restructuring and protect OpenAI&#8217;s charitable mission.</strong> The authors argue that transferring control of potentially the most powerful technology ever created to a for-profit entity fundamentally contradicts OpenAI's charitable obligations. They urge the AGs to use their authority to investigate the proposed changes and ensure that the governance structures prioritizing public benefit over private gain remain intact.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>SafeBench Winners</h1><p>CAIS recently concluded its SafeBench competition, which <a href="https://www.mlsafety.org/safebench/winners">awarded prizes</a> for new benchmarks for assessing and reducing risks from AI. Sponsored by Schmidt Sciences, the competition awarded $250,000 across eight winning submissions.</p><p>The competition focused on four key areas&#8212;Robustness, Monitoring, Alignment, and Safety Applications&#8212;attracting nearly one hundred submissions. A panel of judges evaluated submissions based on the clarity of safety assessment, the potential benefit of progress on the benchmark, and the ease of evaluating measurements.</p><p><strong>Three Benchmarks Awarded First Prize</strong>. Three submissions each received first prizes of $50,000 each for their applicability to frontier models, relevance to current safety challenges, and use of large datasets.</p><ul><li><p><a href="https://cybench.github.io">Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models</a> tackles the assessment of language models in cybersecurity. It includes forty professional-level Capture the Flag (CTF) tasks across six common categories like web security and cryptography. The benchmark has already been utilized by the US AI Safety Institute (AISI), UK AISI, and Anthropic for evaluating frontier models.</p></li><li><p><a href="https://agentdojo.spylab.ai">AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents</a> provides an extensible environment for assessing agents that use tools over untrusted data. It features ninety-seven realistic tasks and over six hundred security test cases, alongside various attack and defense paradigms, offering a dynamic platform adaptable to future agent developments.</p></li><li><p><a href="https://bboylyg.github.io/backdoorllm-website.github.io/">BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models</a> systematically evaluates diverse backdoor attack strategies&#8212;such as data poisoning and chain-of-thought attacks&#8212;across numerous experiments, scenarios, and model architectures. This provides a baseline for understanding current model vulnerabilities and developing future defenses.</p></li></ul><p><strong>Five Benchmarks Recognized with Second Prize</strong>. Five additional submissions were awarded $20,000 each for their innovative approaches to evaluating specific AI safety risks.</p><ul><li><p><a href="https://arxiv.org/abs/2503.17332">CVE-Bench: A Benchmark for AI Agents&#8217; Ability to Exploit Real-World Web Application Vulnerabilities</a> evaluates AI agents using forty critical-severity Common Vulnerability and Exposures (CVE) from the National Vulnerability Database. It features a sandbox framework mimicking real-world conditions for effective exploit evaluation.</p></li><li><p><a href="https://eddyluo1232.github.io/JailBreakV28K/">JailBreakV: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks</a> investigates the transferability of text-based jailbreak techniques to multimodal large language models (MLLMs). It uses 20,000 text prompts and 8,000 image inputs to highlight the unique challenges posed by multimodality.</p></li><li><p><a href="https://arxiv.org/abs/2405.05466">Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals</a> presents a testbed of 324 fine-tuned LLM pairs&#8212;one consistently aligned, the other deceptively misaligned&#8212;to evaluate strategies for detecting alignment faking using only model internals, potentially offering a valuable monitoring tool.</p></li><li><p><a href="https://situational-awareness-dataset.org">Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs</a> tests the situational awareness of LLMs through over 13,000 questions across seven task categories. It probes abilities like self-recognition and instruction following based on self-knowledge, crucial for understanding risks from emerging capabilities.</p></li><li><p><a href="https://www.biorxiv.org/content/10.1101/2024.08.21.608694v3">BioLP-bench: Measuring understanding of biological lab protocols by large language models</a> assesses LLMs' ability to identify and correct errors in biological lab protocols. This benchmark addresses the dual-use nature of these capabilities and the need to understand biosecurity risks before model deployment.</p></li></ul><p>These benchmarks provide crucial tools for understanding the progress of AI, evaluating risks, and ultimately reducing potential harms. The papers, code, and datasets for all winning benchmarks are publicly available for further research and use. CAIS hopes to see future work which is inspired by or builds on these submissions.</p><h1>Other News</h1><p><strong>Government</strong></p><ul><li><p>The White House announced an initiative and task force to advance <a href="https://www.whitehouse.gov/presidential-actions/2025/04/advancing-artificial-intelligence-education-for-american-youth/">AI education for American youth</a>.</p></li><li><p>The UK's AI Safety Institute introduced <a href="https://www.aisi.gov.uk/work/replibench-measuring-autonomous-replication-capabilities-in-ai-systems">RepliBench</a>, a benchmark for measuring autonomous replication capabilities in AI systems.</p></li></ul><p><strong>Research and Opinion</strong></p><ul><li><p>Gladstone AI released a report arguing for the strategic necessity and challenges of establishing a U.S.<a href="https://superintelligence.gladstone.ai/"> national superintelligence project</a>. (We still think that&#8217;s a <a href="https://www.nationalsecurity.ai/">bad</a> <a href="https://www.ai-frontiers.org/articles/why-racing-to-artificial-superintelligence-would-undermine-americas-national-security">idea</a>.)</p></li><li><p>Pew Research found <a href="https://www.pewresearch.org/internet/2025/04/03/how-the-us-public-and-ai-experts-view-artificial-intelligence/">both US AI experts and the public prefer more control and regulation of AI development</a>.</p></li><li><p>A paper found that <a href="https://www.pnas.org/doi/10.1073/pnas.2419055122">existential risk narratives about AI do not distract from its immediate harms</a>.</p></li></ul><p><strong>AI Frontiers</strong></p><ul><li><p>Contributing Writer Vanessa Bates Ramirez covers how <a href="https://www.ai-frontiers.org/articles/ai-companies-want-to-give-you-a-new-job">AI might reshape work&#8212;and whether the jobs of the future are ones we&#8217;ll actually want.</a></p></li><li><p>Independent AI policy researchers Miles Brundage and Grace Werner argue that <a href="https://newsletter.safe.ai/p/ai-safety-newsletter-52-an-expert">President Trump can and should strike an &#8220;AI deal&#8221; with China to preserve international security.</a></p></li></ul><p><br>See also: <a href="https://www.safe.ai/">CAIS website</a>, <a href="https://x.com/ai_risks?lang=en">X account for CAIS</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/an-open-letter-attempts-to-block?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/an-open-letter-attempts-to-block?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item><item><title><![CDATA[AI Safety Newsletter #52: An Expert Virology Benchmark]]></title><description><![CDATA[Plus, AI-Enabled Coups]]></description><link>https://newsletter.safe.ai/p/ai-safety-newsletter-52-an-expert</link><guid isPermaLink="false">https://newsletter.safe.ai/p/ai-safety-newsletter-52-an-expert</guid><dc:creator><![CDATA[Corin Katzke]]></dc:creator><pubDate>Tue, 22 Apr 2025 16:08:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to the AI Safety Newsletter by the <a href="https://www.safe.ai/">Center for AI Safety</a>. We discuss developments in AI and AI safety. No technical background required.</p><p>In this edition: AI now outperforms human experts in specialized virology knowledge in a new benchmark; A new report explores the risk of AI-enabled coups.</p><p>Listen to the AI Safety Newsletter for free on <a href="https://spotify.link/E6lHa1ij2Cb">Spotify</a> or <a href="https://podcasts.apple.com/us/podcast/ai-safety-newsletter/id1702875110">Apple Podcasts</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>An Expert Virology Benchmark</h1><p>A team of researchers (primarily from SecureBio and CAIS) has developed the <a href="https://www.virologytest.ai/vct_paper.pdf">Virology Capabilities Test</a> (VCT), a benchmark that measures an AI system's ability to troubleshoot complex virology laboratory protocols. Results on this benchmark suggest that AI has surpassed human experts in practical virology knowledge.<br><br><strong>VCT measures practical virology knowledge, which has high dual-use potential.</strong> While AI virologists could accelerate beneficial research in virology and infectious disease prevention, bad actors could misuse the same capabilities to develop dangerous pathogens. Like <a href="https://www.wmdp.ai/">the WMDP benchmark</a>, the VCT is designed to evaluate practical dual-use scientific knowledge&#8212;in this case, virology.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fjcu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fjcu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 424w, https://substackcdn.com/image/fetch/$s_!Fjcu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 848w, https://substackcdn.com/image/fetch/$s_!Fjcu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 1272w, https://substackcdn.com/image/fetch/$s_!Fjcu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fjcu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png" width="1456" height="1311" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1311,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Fjcu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 424w, https://substackcdn.com/image/fetch/$s_!Fjcu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 848w, https://substackcdn.com/image/fetch/$s_!Fjcu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 1272w, https://substackcdn.com/image/fetch/$s_!Fjcu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F570a4466-7195-40b5-bdae-0bf3853676fc_1600x1441.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The benchmark consists of 322 multimodal questions covering practical virology knowledge essential for laboratory work. Unlike existing benchmarks, these questions were deliberately designed to be "Google-proof"&#8212;requiring tacit knowledge that cannot be easily found through web searches. The questions were created and validated by PhD-level virologists and cover fundamental, tacit, and visual knowledge needed for practical work in virology labs. </p><p><strong>Most leading AI models have already surpassed human experts in specialized virology knowledge.</strong> All but one frontier model outperformed human experts. The highest performing model, OpenAI&#8217;s o3, achieved 43.8% accuracy on the benchmark, significantly greater than the human expert average of 22.1%. Leading models even outperform human virologists in their specific area of expertise&#8212;for example, o3 outperformed 94% of virologists in subsets of questions representing their specific areas of expertise.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PYCB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PYCB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 424w, https://substackcdn.com/image/fetch/$s_!PYCB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 848w, https://substackcdn.com/image/fetch/$s_!PYCB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 1272w, https://substackcdn.com/image/fetch/$s_!PYCB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PYCB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png" width="1456" height="876" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:876,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PYCB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 424w, https://substackcdn.com/image/fetch/$s_!PYCB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 848w, https://substackcdn.com/image/fetch/$s_!PYCB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 1272w, https://substackcdn.com/image/fetch/$s_!PYCB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ed14d08-eef1-46ce-9e37-f697fdf5932e_1600x963.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Publicly available AI systems should not have highly dual-use virology capabilities.</strong> The authors recommend that highly dual virology capabilities should be excluded from publicly-available systems, and know-your-customer mechanisms could ensure these capabilities remain accessible to researchers working in institutions with appropriate safety protocols.</p><p>They argue that &#8220;an AI&#8217;s ability to provide expert-level troubleshooting on highly dual-use methods should itself be considered a highly dual-use technology&#8221;&#8212;a standard that the paper shows already applies to many frontier AI systems. As a result of the paper, xAI has added new safeguards to their systems.<br><br>For more analysis, we also recommend reading Dan Hendrycks&#8217; and Laura Hiscott&#8217;s <a href="https://ai-frontiers.org/articles/ais-are-disseminating-expert-level-virology-skills">article in </a><em><a href="https://ai-frontiers.org/articles/ais-are-disseminating-expert-level-virology-skills">AI Frontiers</a></em><a href="https://ai-frontiers.org/articles/ais-are-disseminating-expert-level-virology-skills"> discussing implications of VCT</a>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/subscribe?"><span>Subscribe now</span></a></p><h1>AI-Enabled Coups</h1><p>Researchers at the nonprofit <a href="https://www.forethought.org">Forethought</a> have published a <a href="https://www.forethought.org/research/ai-enabled-coups-how-a-small-group-could-use-ai-to-seize-power">report</a> on how small groups could use artificial intelligence to seize power. It discusses AI&#8217;s coup-enabling capabilities, factors increasing coup risk, potential pathways for AI-enabled coups, and possible mitigations.</p><p><strong>AI may soon have coup-enabling capabilities. </strong>Future AI systems could surpass human experts in areas such as weapons development, controlling military systems, strategic planning, public administration, persuasion, and cyber offense. Frontier AI companies or governments could run millions of copies of these systems, each operating orders of magnitude faster than the human brain, 24/7. An organization with unilateral control of one of these systems could, even without broad popular or military support, seize control of a state.</p><p><strong>Risk factors for an AI-enabled coup. </strong>The report identifies three key risk factors that could increase the likelihood of an AI-enabled coup.</p><ul><li><p>First, AI systems could be overtly designed with singular loyalty to specific leaders, bypassing traditional chains of command and enabling those leaders to act unilaterally.</p></li><li><p>Second, AI systems could be designed with secret loyalties that are undetectable until it is too late, allowing them to assist coup plotters covertly.</p></li><li><p>Third, exclusive access to the most powerful, coup-enabling AI capabilities could become concentrated within a small number of AI development projects or even among a few key individuals within those projects.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nEfM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nEfM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 424w, https://substackcdn.com/image/fetch/$s_!nEfM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 848w, https://substackcdn.com/image/fetch/$s_!nEfM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 1272w, https://substackcdn.com/image/fetch/$s_!nEfM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nEfM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png" width="1456" height="1116" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1116,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nEfM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 424w, https://substackcdn.com/image/fetch/$s_!nEfM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 848w, https://substackcdn.com/image/fetch/$s_!nEfM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 1272w, https://substackcdn.com/image/fetch/$s_!nEfM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a6551e-901f-4a72-9af1-2db6e168ce3b_1508x1156.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Concrete paths to an AI enabled coup. </strong>The report outlines two families of potential scenarios for how an AI-enabled coup could occur.</p><ul><li><p>One path involves the misuse of widely deployed military AI systems. Staging a coup currently requires support from human soldiers, but a small group that controls autonomous, advanced military AI systems could stage a military coup on their own.</p></li><li><p>Another path involves AI assisting in the processes leading to a democratic backslide&#8212;for example, by expanding state authority, replacing bureaucrats with loyal AI systems, targeted propaganda, and information control.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lo7H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lo7H!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 424w, https://substackcdn.com/image/fetch/$s_!lo7H!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 848w, https://substackcdn.com/image/fetch/$s_!lo7H!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 1272w, https://substackcdn.com/image/fetch/$s_!lo7H!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lo7H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png" width="1456" height="1113" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/70e50853-3b57-4275-92b3-08c437938175_1600x1223.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1113,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lo7H!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 424w, https://substackcdn.com/image/fetch/$s_!lo7H!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 848w, https://substackcdn.com/image/fetch/$s_!lo7H!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 1272w, https://substackcdn.com/image/fetch/$s_!lo7H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70e50853-3b57-4275-92b3-08c437938175_1600x1223.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Mitigations.</strong> To counter these risks, the report proposes several mitigation strategies focusing on establishing clear rules and technical enforcement mechanisms.</p><ul><li><p>Establishing these rules will involve creating robust oversight bodies, clear legal frameworks defining legitimate AI use, and promoting transparency around AI capabilities and deployments.</p></li><li><p>Technical measures include developing robust guardrails to prevent misuse, implementing strong cybersecurity for AI systems, and designing AI command structures that require consensus or multi-party authorization for critical actions.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_3R2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_3R2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 424w, https://substackcdn.com/image/fetch/$s_!_3R2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 848w, https://substackcdn.com/image/fetch/$s_!_3R2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 1272w, https://substackcdn.com/image/fetch/$s_!_3R2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_3R2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png" width="1456" height="692" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65c50d63-f694-4e50-b713-d11384af9822_1482x704.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:692,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_3R2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 424w, https://substackcdn.com/image/fetch/$s_!_3R2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 848w, https://substackcdn.com/image/fetch/$s_!_3R2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 1272w, https://substackcdn.com/image/fetch/$s_!_3R2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65c50d63-f694-4e50-b713-d11384af9822_1482x704.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The report concludes that the risk of AI-enabled coups is alarmingly high. Unfortunately, this fact has the potential to become a self-fulfilling prophecy&#8212;even actors with good intentions might be tempted to seize power in order to prevent &#8220;bad actors&#8221; from doing so first.<br><br>However, there&#8217;s also reason to believe mitigation measures will be effective and politically tractable. Behind the &#8216;veil of ignorance&#8217; as to who will be in a position of power to take advantage of coup-enabling AI, it&#8217;s in everyone&#8217;s best interest to make sure no one can.</p><h1>Other news</h1><p><strong>Industry</strong></p><ul><li><p>OpenAI released <a href="https://openai.com/index/introducing-o3-and-o4-mini/">o3 and o4-mini</a>, two new frontier reasoning models.</p></li><li><p>Google released the <a href="https://storage.googleapis.com/model-cards/documents/gemini-2.5-pro-preview.pdf">model card for Gemini 2.5 Pro</a>, though with <a href="https://techcrunch.com/2025/04/17/googles-latest-ai-model-report-lacks-key-safety-details-experts-say/">minimal details about safety testing</a>.</p></li><li><p>Several Epoch AI employees <a href="https://www.mechanize.work/">left to found a startup for automating jobs</a> to bring about explosive growth from AI &#8220;as soon as possible.&#8221; This departure follows the revelation in January that <a href="https://fortune.com/2025/01/21/eye-on-ai-openai-o3-math-benchmark-frontiermath-epoch-altman-trump-biden/">OpenAI owns Epoch&#8217;s FrontierMath Benchmark</a>, unbeknownst to dataset contributors.</p></li><li><p>TSMC says that it will eventually produce <a href="https://www.taiwannews.com.tw/news/6087924">30% of sub 2-nm chips in its US plants</a>.</p></li></ul><p><strong>Government</strong></p><ul><li><p>The US Government informed Nvidia that <a href="https://techcrunch.com/2025/04/15/nvidia-h20-chip-exports-hit-with-license-requirement-by-us-government/">it will need a license to export H20 chips to China</a>.</p></li><li><p>The White House OSTP <a href="https://x.com/deanwball/status/1912253821272682567">hired Dean Ball</a> as Senior Policy Advisor on AI and Emerging Technology.</p></li><li><p>White House &#8216;AI and Crypto Czar&#8217; David Sack <a href="https://www.businessinsider.com/david-sacks-nvidia-export-controls-crackdown-bis-more-funding-2025-4">argued that BIS should receive more funding</a> to close AI export control loopholes.</p></li><li><p>US Senator Mike Rounds introduced <a href="https://www.rounds.senate.gov/newsroom/press-releases/rounds-introduces-legislation-to-prevent-smuggling-of-american-ai-chips-into-china">legislation to create a whistleblower incentive program at BIS</a> to better detect AI chip smuggling to China.</p></li></ul><p><strong>AI Frontiers</strong></p><ul><li><p>Kevin Frazier and Graham Hardig argue that <a href="https://www.ai-frontiers.org/articles/ai-displacement-insurance">we need a new kind of insurance for AI job loss.</a></p></li><li><p>Stephen Casper and Laura Hiscott argue that <a href="https://www.ai-frontiers.org/articles/smokescreen-how-bad-evidence-is-preventing-ai-safety">corporate capture of AI research&#8212;echoing the days of Big Tobacco&#8212;thwarts sensible policymaking</a>.</p></li><li><p>Dan Hendrycks and Laura Hiscott discuss the <a href="https://ai-frontiers.org/articles/ais-are-disseminating-expert-level-virology-skills">risk factors behind AI biothreats and the importance of friction</a>.</p></li></ul><p>See also: <a href="https://www.safe.ai/">CAIS website</a>, <a href="https://x.com/ai_risks?lang=en">X account for CAIS</a>, our paper on <a href="https://www.nationalsecurity.ai/">superintelligence strategy</a>, our <a href="https://www.aisafetybook.com/">AI safety course</a>, and <a href="http://ai-frontiers.org/">AI Frontiers</a>, a new platform for expert commentary and analysis.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://newsletter.safe.ai/p/ai-safety-newsletter-52-an-expert?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://newsletter.safe.ai/p/ai-safety-newsletter-52-an-expert?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p>]]></content:encoded></item></channel></rss>