Podcast AI Safety Newsletter by Center for AI Safety

Podcast details

Technical and general information from the podcast's RSS feed.

Site

RSS

Apple

Recent rankings

Latest chart positions across Apple Podcasts and Spotify rankings.

Shared links between episodes and podcasts

Links found in episode descriptions and other podcasts that share them.

See all

https://substackcdn.com/image/fetch/
1261 shares
https://substackcdn.com/image/fetch/w_1456
769 shares
https://substackcdn.com/image/fetch/w_848
341 shares

RSS feed quality and score

Technical evaluation of the podcast's RSS feed quality and structure.

See all

Publication history

Monthly episode publishing history over the past years.

Latest published episodes

Recent episodes with titles, durations, and descriptions.

See all

AISN #60: The AI Action Plan

jeudi 31 juillet 2025 • Duration 15:41

Also: ChatGPT Agent and IMO Gold.

In this edition: The Trump Administration publishes its AI Action Plan; OpenAI released ChatGPT Agent and announced that an experimental model achieved gold medal-level performance on the 2025 International Mathematical Olympiad.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

The AI Action Plan

On the 23rd, the White House released its AI Action Plan. The document is the outcome of a January executive order that required the President's Science Advisor, ‘AI and Crypto Czar’, and National Security Advisor (currently Michael Kratsios, David Sacks, and Marco Rubio) to submit a plan to “sustain and enhance America's global AI dominance in order to promote human flourishing, economic competitiveness, and national security.” President Trump also delivered an hour-long speech on the plan, and signed three executive orders beginning to implement some of its policies.

Trump displaying an executive order at the [...]

---

Outline:

(00:34) The AI Action Plan

(07:36) ChatGPT Agent and IMO Gold

(12:48) In Other News

---

First published:
July 31st, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-60-the-ai-action

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/$s_!_NBd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39879533-bbcb-4b77-a1b9-67d248591bf5_1446x852.png https://substackcdn.com/image/fetch/$s_!YR3_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32c045cf-daf7-4254-8cdc-4dd861f2c397_884x802.png https://substackcdn.com/image/fetch/$s_!yeVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf95488b-7af9-4342-aec3-fddfd3b5ee7c_1400x933.png

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

AISN #59: EU Publishes General-Purpose AI Code of Practice

mardi 15 juillet 2025 • Duration 09:23

Plus: Meta Superintelligence Labs.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

In this edition: The EU published a General-Purpose AI Code of Practice for AI providers, and Meta is spending billions revamping its superintelligence development efforts.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

EU Publishes General-Purpose AI Code of Practice

In June 2024, the EU adopted the AI Act, which remains the world's most significant law regulating AI systems. The Act bans some uses of AI like social scoring and predictive policing and limits other “high risk” uses such as generating credit scores or evaluating educational outcomes. It also regulates general-purpose AI (GPAI) systems, imposing transparency requirements, copyright protection policies, and safety and security standards for models that pose systemic risk (defined as those trained [...]

---

Outline:

(00:31) EU Publishes General-Purpose AI Code of Practice

(04:50) Meta Superintelligence Labs

(06:17) In Other News

---

First published:
July 15th, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-59-eu-publishes

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/$s_!glEy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd30e7d8d-65ae-4c7c-aa81-f7e56c8b8c96_1360x966.png

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

AISN #50: AI Action Plan Responses

lundi 31 mars 2025 • Duration 12:25

Plus, Detecting Misbehavior in Reasoning Models.

In this newsletter, we cover AI companies’ responses to the federal government's request for information on the development of an AI Action Plan. We also discuss an OpenAI paper on detecting misbehavior in reasoning models by monitoring their chains of thought.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

On January 23, President Trump signed an executive order giving his administration 180 days to develop an “AI Action Plan” to “enhance America's global AI dominance in order to promote human flourishing, economic competitiveness, and national security.”

Despite the rhetoric of the order, the Trump administration has yet to articulate many policy positions with respect to AI development and safety. In a recent interview, Ben Buchanan—Biden's AI advisor—interpreted the executive order as giving the administration time to develop its AI policies. The AI Action Plan will therefore likely [...]

---

First published:
March 31st, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-50-ai-action

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6544cd82-9ba4-472a-8183-d108be2c86ac_1537x675.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fcfe4f8-9b5c-4ce4-9611-683a441c230b_1600x956.png

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

AISN #49: AI Action Plan Responses

lundi 31 mars 2025 • Duration 12:25

Plus, Detecting Misbehavior in Reasoning Models.

In this newsletter, we cover AI companies’ responses to the federal government's request for information on the development of an AI Action Plan. We also discuss an OpenAI paper on detecting misbehavior in reasoning models by monitoring their chains of thought.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

On January 23, President Trump signed an executive order giving his administration 180 days to develop an “AI Action Plan” to “enhance America's global AI dominance in order to promote human flourishing, economic competitiveness, and national security.”

Despite the rhetoric of the order, the Trump administration has yet to articulate many policy positions with respect to AI development and safety. In a recent interview, Ben Buchanan—Biden's AI advisor—interpreted the executive order as giving the administration time to develop its AI policies. The AI Action Plan will therefore likely [...]

---

First published:
March 31st, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-49-ai-action

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2fcfe4f8-9b5c-4ce4-9611-683a441c230b_1600x956.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6544cd82-9ba4-472a-8183-d108be2c86ac_1537x675.png

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

AISN

jeudi 6 mars 2025 • Duration 11:31

Plus, Measuring AI Honesty.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this newsletter, we discuss two recent papers: a policy paper on national security strategy, and a technical paper on measuring honesty in AI systems.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

Superintelligence Strategy

CAIS director Dan Hendrycks, former Google CEO Eric Schmidt, and Scale AI CEO Alexandr Wang have authored a new paper, Superintelligence Strategy. The paper (and an in-depth expert version) argues that the development of superintelligence—AI systems that surpass humans in nearly every domain—is inescapably a matter of national security.

In this story, we introduce the paper's three-pronged strategy for national security in the age of advanced AI: deterrence, nonproliferation, and competitiveness.

Deterrence

The simultaneous power and danger of superintelligence presents [...]

---

Outline:

(00:20) Superintelligence Strategy

(01:09) Deterrence

(02:41) Nonproliferation

(04:04) Competitiveness

(05:33) Measuring AI Honesty

(09:24) Links

---

First published:
March 6th, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-49-superintelligence

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9ac746a-e95a-47f6-9d7a-2bb63ddcf744_1600x768.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4455070d-25de-4786-8540-3b221b8976dd_1600x876.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37b8b6f7-3ac8-41e2-a5b4-3cc7ed902c3e_1600x725.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ac21dab-6473-4436-880b-da868c9e5d9b_1600x738.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b74ae32-76b8-430f-92c9-2cf86e1ba710_1600x900.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4a71e77-48c9-49a6-a757-8cdbc28d19e8_1600x720.png

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Superintelligence Strategy: Expert Version

mercredi 5 mars 2025 • Duration

Superintelligence is destabilizing since it threatens other states’ survival—it could be weaponized, or states may lose control of it. Attempts to build superintelligence may face threats by rival states—creating a deterrence regime called Mutual Assured AI Malfunction (MAIM). In this paper, Dan Hendrycks, Eric Schmidt, and Alexandr Wang detail a strategy—focused on deterrence, nonproliferation, and competitiveness—for nations to navigate the risks of superintelligence.

Superintelligence Strategy: Standard Version

mercredi 5 mars 2025 • Duration

Superintelligence is destabilizing since it threatens other states’ survival—it could be weaponized, or states may lose control of it. Attempts to build superintelligence may face threats by rival states—creating a deterrence regime called Mutual Assured AI Malfunction (MAIM). In this paper, Dan Hendrycks, Eric Schmidt, and Alexandr Wang detail a strategy—focused on deterrence, nonproliferation, and competitiveness—for nations to navigate the risks of superintelligence.

AISN #48: Utility Engineering and EnigmaEval

mardi 18 février 2025 • Duration 08:56

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

In this newsletter, we explore two recent papers from CAIS. We’d also like to highlight that CAIS is hiring for editorial and writing roles, including for a new online platform for journalism and analysis regarding AI's impacts on national security, politics, and economics.

Utility Engineering

A common view is that large language models (LLMs) are highly capable but fundamentally passive tools, shaping their responses based on training data without intrinsic goals or values. However, a new paper from the Center for AI Safety challenges this assumption, showing that LLMs exhibit coherent and structured value systems.

Structured preferences emerge with scale. The paper introduces Utility Engineering, a framework for analyzing and controlling AI [...]

---

Outline:

(00:26) Utility Engineering

(04:48) EnigmaEval

---

First published:
February 18th, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-48-utility-engineering

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fbfb7e4-413d-4552-ad61-2dd0ccd7d309_1600x1223.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ea44b62-5e2b-43de-b9de-02ee70db25ef_1600x576.png https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67fb642-4cce-463b-aed5-26777d393977_1600x588.jpeg https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f8e4ae6-7a37-4377-9f3d-a41efb1cbd7b_1072x782.jpeg

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

AISN #47: Reasoning Models

jeudi 6 février 2025 • Duration 09:00

Plus, State-Sponsored AI Cyberattacks.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

Reasoning Models

DeepSeek-R1 has been one of the most significant model releases since ChatGPT. After its release, the DeepSeek's app quickly rose to the top of Apple's most downloaded chart and NVIDIA saw a 17% stock decline. In this story, we cover DeepSeek-R1, OpenAI's o3-mini and Deep Research, and the policy implications of reasoning models.

DeepSeek-R1 is a frontier reasoning model. DeepSeek-R1 builds on the company's previous model, DeepSeek-V3, by adding reasoning capabilities through reinforcement learning training. R1 exhibits frontier-level capabilities in mathematics, coding, and scientific reasoning—comparable to OpenAI's o1. DeepSeek-R1 also scored 9.4% on Humanity's Last Exam—at the time of its release, the highest of any publicly available system.

DeepSeek reports spending only about $6 million on the computing power needed to train V3—however, that number doesn’t include the full [...]

---

Outline:

(00:13) Reasoning Models

(04:58) State-Sponsored AI Cyberattacks

(06:51) Links

---

First published:
February 6th, 2025

Source:
https://newsletter.safe.ai/p/ai-safety-newsletter-47-reasoning

---

Want more? Check out our ML Safety Newsletter for technical safety research.

Narrated by TYPE III AUDIO.

---

Images from the article:

https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F872ba487-5b6a-484d-a542-4173781925fd_1600x1170.png

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

AISN #46: The Transition

jeudi 23 janvier 2025 • Duration 11:20

Plus, Humanity's Last Exam, and the AI Safety, Ethics, and Society Course.

Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts.

The Transition

The transition from the Biden to Trump administrations saw a flurry of executive activity on AI policy, with Biden signing several last-minute executive orders and Trump revoking Biden's 2023 executive order on AI risk. In this story, we review the state of play.

Trump signing first-day executive orders. Source.

The AI Diffusion Framework. The final weeks of the Biden Administration saw three major actions related to AI policy. First, the Bureau of Industry and Security released its Framework for Artificial Intelligence Diffusion, which updates the US’ AI-related export controls. The rule establishes three tiers of countries 1) US allies, 2) most other countries, and 3) arms-embargoed countries.