Back

Explore every episode of the podcast Joe Carlsmith Audio

Dive into the complete episode list for Joe Carlsmith Audio. Each episode is cataloged with detailed descriptions, making it easy to find and explore specific topics. Keep track of all episodes from your favorite podcast and never miss a moment of insightful content.

Rows per page:

1–50 of 71

TitlePub. DateDuration
Introduction and summary for "Otherness and control in the age of AGI"21 Jun 202400:12:23

This is the introduction and summary for my series "Otherness and control in the age of AGI." 

Text version here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

Second half of full audio for "Otherness and control in the age of AGI"18 Jun 202404:11:02

Second half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power. 

First half here: https://joecarlsmithaudio.buzzsprout.com/2034731/15266490-first-half-of-full-audio-for-otherness-and-control-in-the-age-of-agi

PDF of the full series here: https://jc.gatspress.com/pdf/otherness_full.pdf
Summary of the series here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

When "yang" goes wrong08 Jan 202400:21:32

On the connection between deep atheism and seeking control. 

Text version here: https://joecarlsmith.com/2024/01/08/when-yang-goes-wrong

This essay is part of a series of essays called "Otherness and control in the age of AGI." I'm hoping the individual essays can be read fairly well on their own, but see here for brief summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

Deep atheism and AI risk04 Jan 202400:46:59

On a certain kind of fundamental mistrust towards Nature. 

Text version here: https://joecarlsmith.com/2024/01/04/deep-atheism-and-ai-risk

This is the second essay in my series “Otherness and control in the age of AGI. I’m hoping that the individual essays can be read fairly well on their own, but see here for brief summaries of the essays released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

Gentleness and the artificial Other02 Jan 202400:22:39

AIs as fellow creatures. And on getting eaten. 

Link: https://joecarlsmith.com/2024/01/02/gentleness-and-the-artificial-other

This is the first essay in a series of essays that I’m calling “Otherness and control in the age of AGI.” See here for more about the series as a whole: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi.

In search of benevolence (or: what should you get Clippy for Christmas?)27 Dec 202300:52:52

What is altruism towards a paperclipper? Can you paint with all the colors of the wind at once? 

(This is a recording of an essay originally published in 2021. Text here: https://joecarlsmith.com/2021/07/19/in-search-of-benevolence-or-what-should-you-get-clippy-for-christmas)

Empirical work that might shed light on scheming (Section 6 of "Scheming AIs")16 Nov 202300:28:00

This is section 6 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Summing up "Scheming AIs" (Section 5)16 Nov 202300:15:46

This is section 5 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Speed arguments against scheming (Section 4.4-4.7 of "Scheming AIs")16 Nov 202300:15:19

This is section 4.4 through 4.7 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Simplicity arguments for scheming (Section 4.3 of "Scheming AIs")16 Nov 202300:19:37

This is section 4.3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

The counting argument for scheming (Sections 4.1 and 4.2 of "Scheming AIs")16 Nov 202300:10:40

This is sections 4.1 and 4.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Arguments for/against scheming that focus on the path SGD takes (Section 3 of "Scheming AIs")16 Nov 202300:29:03

This is section 3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

First half of full audio for "Otherness and control in the age of AGI"17 Jun 202403:07:29

First half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power. 

Second half here: https://joecarlsmithaudio.buzzsprout.com/2034731/15272132-second-half-of-full-audio-for-otherness-and-control-in-the-age-of-agi

PDF of the full series here: https://jc.gatspress.com/pdf/otherness_full.pdf
Summary of the series here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

Non-classic stories about scheming (Section 2.3.2 of "Scheming AIs")16 Nov 202300:24:34

This is section 2.3.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Does scheming lead to adequate future empowerment? (Section 2.3.1.2 of "Scheming AIs")16 Nov 202300:22:54

This is section 2.3.1.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

The goal-guarding hypothesis (Section 2.3.1.1 of "Scheming AIs")16 Nov 202300:19:11

This is section 2.3.1.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

How useful for alignment-relevant work are AIs with short-term goals? (Section 2.2.4.3 of "Scheming AIs")16 Nov 202300:09:21

This is section 2.2.4.3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Is scheming more likely if you train models to have long-term goals? (Sections 2.2.4.1-2.2.4.2 of "Scheming AIs")16 Nov 202300:09:01

This is sections 2.2.4.1-2.2.4.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

"Clean" vs. "messy" goal-directedness (Section 2.2.3 of "Scheming AIs")16 Nov 202300:16:44

This is section 2.2.3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Two sources of beyond-episode goals (Section 2.2.2 of "Scheming AIs")16 Nov 202300:21:25

This is section 2.2.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Two concepts of an "episode" (Section 2.2.1 of "Scheming AIs")16 Nov 202300:12:08

This is section 2.2.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Situational awareness (Section 2.1 of "Scheming AIs")16 Nov 202300:09:27

This is section 2.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

On "slack" in training (Section 1.5 of "Scheming AIs")16 Nov 202300:07:12

This is section 1.5 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Loving a world you don't trust17 Jun 202401:03:54

Garden, campfire, healing water.

Text version here: https://joecarlsmith.com/2024/06/18/loving-a-world-you-dont-trust

This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

Why focus on schemers in particular? (Sections 1.3-1.4 of "Scheming AIs")16 Nov 202300:31:17

This is sections 1.3-1.4 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

A taxonomy of non-schemer models (Section 1.2 of "Scheming AIs")16 Nov 202300:11:20

This is section 1.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Varieties of fake alignment (Section 1.1 of "Scheming AIs")16 Nov 202300:17:54

This is section 1.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” 

Text of the report here: https://arxiv.org/abs/2311.08379
 
Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
 
Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Full audio for "Scheming AIs: Will AIs fake alignment during training in order to get power?"15 Nov 202306:13:17

This is the full audio for my report "Scheming AIs: Will AIs fake alignment during training in order to get power?"

(I’m also posting audio for individual sections of the report on this podcast, but the ordering was getting messed up on various podcast apps, and I think some people might want one big audio file regardless, so here it is. I’m going to be posting the individual sections one by one, in the right order, over the coming days. )

Full text of the report here: https://arxiv.org/abs/2311.08379
Summary here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

Introduction and summary of "Scheming AIs: Will AIs fake alignment during training in order to get power?"14 Nov 202300:56:32

This is a recording of the introductory section of my report "Scheming AIs: Will AIs fake alignment during training in order to get power?".  This section includes a summary of the full report. The summary covers most of the main points and technical terminology, and I'm hoping that it will provide much of the context necessary to understand individual sections of the report on their own. (Note: the text of the report itself may not be public by the time this episode goes live.)

In memory of Louise Glück15 Oct 202300:21:22

"It was, she said, a great discovery, albeit my real life."

On the limits of idealized values12 May 202301:00:14

Contra some meta-ethical views, you can't forever aim to approximate the self you would become in idealized conditions. You have to actively create yourself, often in the here and now. 

Originally published in 2021. Text version here: https://joecarlsmith.com/2021/06/21/on-the-limits-of-idealized-values

Predictable updating about AI risk08 May 202301:03:14

How worried about AI risk will we feel in the future, when we can see advanced machine intelligence up close? We should worry accordingly now. Text version here: https://joecarlsmith.com/2023/05/08/predictable-updating-about-ai-risk 

Existential Risk from Power-Seeking AI (shorter version)19 Mar 202300:55:03

A shorter version of my report on existential risk from power-seeking AI. Forthcoming in an essay collection from Oxford University Press. Text version here: https://jc.gatspress.com/pdf/existential_risk_and_powerseeking_ai.pdf

Problems of evil05 Mar 202300:35:42

Is everything holy? Can reality, in itself, be worthy of reverence? Text version here: https://joecarlsmith.com/2021/04/19/problems-of-evil 

On attunement25 Mar 202400:44:14

Examining a certain kind of meaning-laden receptivity to the world.

Text version here: https://joecarlsmith.com/2024/03/25/on-attunement

This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

(Though: note that I haven't put the summary post on the podcast yet.)

Seeing more whole17 Feb 202300:52:26

On looking out of your own eyes. Text version at joecarlsmith.com.

Why should ethical anti-realists do ethics?16 Feb 202300:53:29

Who needs a system if you're free? Text version at https://joecarlsmith.com/2023/02/16/why-should-ethical-anti-realists-do-ethics 

Is Power-Seeking AI an Existential Risk?25 Jan 202303:21:02

Audio version of my report on existential risk from power-seeking AI. Text here: https://arxiv.org/pdf/2206.13353.pdf. Narration by Type III audio. 

On sincerity23 Dec 202201:35:02

Nearby is the country they call life. Text version at: https://joecarlsmith.com/2022/12/23/on-sincerity

Against meta-ethical hedonism01 Dec 202201:02:29

Can the epistemology of consciousness save moral realism and redeem experience machines? No.

Against the normative realist's wager09 Oct 202200:42:48

If your find a button that gives you a hundred dollars if a certain controversial meta-ethical view is true, but you and your family get burned alive if that view is false, should you press the button? No.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

On infinite ethics05 Oct 202201:25:05

Infinities puncture the dream of a simple, bullet-biting utilitarianism. But they're everyone's problem.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

Actually possible: thoughts on Utopia05 Oct 202200:28:39

Life in the future could be profoundly good. I think this is an extremely important fact, and one that often goes under-estimated.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

Against neutrality about creating happy lives05 Oct 202200:23:19

Making happy people is good. Just ask the golden rule.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

On future people, looking back at 21st century longtermism05 Oct 202200:25:28

I find imagining future people looking back on present-day longtermism (the view that positively influencing the long-term future should be a key moral priority) a helpful intuition pump, especially re: a certain kind of “holy sh**” reaction to existential risk, and to the possible size and quality of the future at stake.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

On green21 Mar 202401:15:13

Examining a philosophical vibe that I think contrasts in interesting ways with "deep atheism."

Text version here: https://joecarlsmith.com/2024/03/21/on-green

This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

(Though: note that I haven't put the summary post on the podcast yet.)

Can you control the past?05 Oct 202201:17:03

Sometimes, you can “control” events you have no causal interaction with (for example, if you're a deterministic software twin).

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

Killing the ants05 Oct 202200:15:08

If you kill something, look it in the eyes as you do.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

On clinging05 Oct 202200:17:48

How can "non-attachment" be compatible with care? We need to distinguish between caring and clinging.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

Thoughts on being mortal05 Oct 202200:12:44

You can't keep any of it. The only thing to do is to give it away on purpose.

Text version here.

Edited for Joe Carlsmith by TYPE III AUDIO.

© My Podcast Data