Explore every episode of the podcast Joe Carlsmith Audio
| Title | Pub. Date | Duration | |
|---|---|---|---|
| Introduction and summary for "Otherness and control in the age of AGI" | 21 Jun 2024 | 00:12:23 | |
This is the introduction and summary for my series "Otherness and control in the age of AGI." | |||
| Second half of full audio for "Otherness and control in the age of AGI" | 18 Jun 2024 | 04:11:02 | |
Second half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power. | |||
| When "yang" goes wrong | 08 Jan 2024 | 00:21:32 | |
On the connection between deep atheism and seeking control. | |||
| Deep atheism and AI risk | 04 Jan 2024 | 00:46:59 | |
On a certain kind of fundamental mistrust towards Nature. | |||
| Gentleness and the artificial Other | 02 Jan 2024 | 00:22:39 | |
AIs as fellow creatures. And on getting eaten. | |||
| In search of benevolence (or: what should you get Clippy for Christmas?) | 27 Dec 2023 | 00:52:52 | |
What is altruism towards a paperclipper? Can you paint with all the colors of the wind at once? | |||
| Empirical work that might shed light on scheming (Section 6 of "Scheming AIs") | 16 Nov 2023 | 00:28:00 | |
This is section 6 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Summing up "Scheming AIs" (Section 5) | 16 Nov 2023 | 00:15:46 | |
This is section 5 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Speed arguments against scheming (Section 4.4-4.7 of "Scheming AIs") | 16 Nov 2023 | 00:15:19 | |
This is section 4.4 through 4.7 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Simplicity arguments for scheming (Section 4.3 of "Scheming AIs") | 16 Nov 2023 | 00:19:37 | |
This is section 4.3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| The counting argument for scheming (Sections 4.1 and 4.2 of "Scheming AIs") | 16 Nov 2023 | 00:10:40 | |
This is sections 4.1 and 4.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Arguments for/against scheming that focus on the path SGD takes (Section 3 of "Scheming AIs") | 16 Nov 2023 | 00:29:03 | |
This is section 3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| First half of full audio for "Otherness and control in the age of AGI" | 17 Jun 2024 | 03:07:29 | |
First half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power. | |||
| Non-classic stories about scheming (Section 2.3.2 of "Scheming AIs") | 16 Nov 2023 | 00:24:34 | |
This is section 2.3.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Does scheming lead to adequate future empowerment? (Section 2.3.1.2 of "Scheming AIs") | 16 Nov 2023 | 00:22:54 | |
This is section 2.3.1.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| The goal-guarding hypothesis (Section 2.3.1.1 of "Scheming AIs") | 16 Nov 2023 | 00:19:11 | |
This is section 2.3.1.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| How useful for alignment-relevant work are AIs with short-term goals? (Section 2.2.4.3 of "Scheming AIs") | 16 Nov 2023 | 00:09:21 | |
This is section 2.2.4.3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Is scheming more likely if you train models to have long-term goals? (Sections 2.2.4.1-2.2.4.2 of "Scheming AIs") | 16 Nov 2023 | 00:09:01 | |
This is sections 2.2.4.1-2.2.4.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| "Clean" vs. "messy" goal-directedness (Section 2.2.3 of "Scheming AIs") | 16 Nov 2023 | 00:16:44 | |
This is section 2.2.3 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Two sources of beyond-episode goals (Section 2.2.2 of "Scheming AIs") | 16 Nov 2023 | 00:21:25 | |
This is section 2.2.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Two concepts of an "episode" (Section 2.2.1 of "Scheming AIs") | 16 Nov 2023 | 00:12:08 | |
This is section 2.2.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Situational awareness (Section 2.1 of "Scheming AIs") | 16 Nov 2023 | 00:09:27 | |
This is section 2.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| On "slack" in training (Section 1.5 of "Scheming AIs") | 16 Nov 2023 | 00:07:12 | |
This is section 1.5 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Loving a world you don't trust | 17 Jun 2024 | 01:03:54 | |
Garden, campfire, healing water. | |||
| Why focus on schemers in particular? (Sections 1.3-1.4 of "Scheming AIs") | 16 Nov 2023 | 00:31:17 | |
This is sections 1.3-1.4 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| A taxonomy of non-schemer models (Section 1.2 of "Scheming AIs") | 16 Nov 2023 | 00:11:20 | |
This is section 1.2 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Varieties of fake alignment (Section 1.1 of "Scheming AIs") | 16 Nov 2023 | 00:17:54 | |
This is section 1.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 | |||
| Full audio for "Scheming AIs: Will AIs fake alignment during training in order to get power?" | 15 Nov 2023 | 06:13:17 | |
This is the full audio for my report "Scheming AIs: Will AIs fake alignment during training in order to get power?" | |||
| Introduction and summary of "Scheming AIs: Will AIs fake alignment during training in order to get power?" | 14 Nov 2023 | 00:56:32 | |
This is a recording of the introductory section of my report "Scheming AIs: Will AIs fake alignment during training in order to get power?". This section includes a summary of the full report. The summary covers most of the main points and technical terminology, and I'm hoping that it will provide much of the context necessary to understand individual sections of the report on their own. (Note: the text of the report itself may not be public by the time this episode goes live.) | |||
| In memory of Louise Glück | 15 Oct 2023 | 00:21:22 | |
"It was, she said, a great discovery, albeit my real life." | |||
| On the limits of idealized values | 12 May 2023 | 01:00:14 | |
Contra some meta-ethical views, you can't forever aim to approximate the self you would become in idealized conditions. You have to actively create yourself, often in the here and now. | |||
| Predictable updating about AI risk | 08 May 2023 | 01:03:14 | |
How worried about AI risk will we feel in the future, when we can see advanced machine intelligence up close? We should worry accordingly now. Text version here: https://joecarlsmith.com/2023/05/08/predictable-updating-about-ai-risk | |||
| Existential Risk from Power-Seeking AI (shorter version) | 19 Mar 2023 | 00:55:03 | |
A shorter version of my report on existential risk from power-seeking AI. Forthcoming in an essay collection from Oxford University Press. Text version here: https://jc.gatspress.com/pdf/existential_risk_and_powerseeking_ai.pdf | |||
| Problems of evil | 05 Mar 2023 | 00:35:42 | |
Is everything holy? Can reality, in itself, be worthy of reverence? Text version here: https://joecarlsmith.com/2021/04/19/problems-of-evil | |||
| On attunement | 25 Mar 2024 | 00:44:14 | |
Examining a certain kind of meaning-laden receptivity to the world. | |||
| Seeing more whole | 17 Feb 2023 | 00:52:26 | |
On looking out of your own eyes. Text version at joecarlsmith.com. | |||
| Why should ethical anti-realists do ethics? | 16 Feb 2023 | 00:53:29 | |
Who needs a system if you're free? Text version at https://joecarlsmith.com/2023/02/16/why-should-ethical-anti-realists-do-ethics | |||
| Is Power-Seeking AI an Existential Risk? | 25 Jan 2023 | 03:21:02 | |
Audio version of my report on existential risk from power-seeking AI. Text here: https://arxiv.org/pdf/2206.13353.pdf. Narration by Type III audio. | |||
| On sincerity | 23 Dec 2022 | 01:35:02 | |
Nearby is the country they call life. Text version at: https://joecarlsmith.com/2022/12/23/on-sincerity | |||
| Against meta-ethical hedonism | 01 Dec 2022 | 01:02:29 | |
Can the epistemology of consciousness save moral realism and redeem experience machines? No. | |||
| Against the normative realist's wager | 09 Oct 2022 | 00:42:48 | |
If your find a button that gives you a hundred dollars if a certain controversial meta-ethical view is true, but you and your family get burned alive if that view is false, should you press the button? No. | |||
| On infinite ethics | 05 Oct 2022 | 01:25:05 | |
Infinities puncture the dream of a simple, bullet-biting utilitarianism. But they're everyone's problem. | |||
| Actually possible: thoughts on Utopia | 05 Oct 2022 | 00:28:39 | |
Life in the future could be profoundly good. I think this is an extremely important fact, and one that often goes under-estimated. | |||
| Against neutrality about creating happy lives | 05 Oct 2022 | 00:23:19 | |
Making happy people is good. Just ask the golden rule. | |||
| On future people, looking back at 21st century longtermism | 05 Oct 2022 | 00:25:28 | |
I find imagining future people looking back on present-day longtermism (the view that positively influencing the long-term future should be a key moral priority) a helpful intuition pump, especially re: a certain kind of “holy sh**” reaction to existential risk, and to the possible size and quality of the future at stake. | |||
| On green | 21 Mar 2024 | 01:15:13 | |
Examining a philosophical vibe that I think contrasts in interesting ways with "deep atheism." | |||
| Can you control the past? | 05 Oct 2022 | 01:17:03 | |
Sometimes, you can “control” events you have no causal interaction with (for example, if you're a deterministic software twin). | |||
| Killing the ants | 05 Oct 2022 | 00:15:08 | |
If you kill something, look it in the eyes as you do. | |||
| On clinging | 05 Oct 2022 | 00:17:48 | |
How can "non-attachment" be compatible with care? We need to distinguish between caring and clinging. | |||
| Thoughts on being mortal | 05 Oct 2022 | 00:12:44 | |
You can't keep any of it. The only thing to do is to give it away on purpose. | |||