PyTorch Developer Podcast – Details, episodes & analysis
Podcast details
Technical and general information from the podcast's RSS feed.


Recent rankings
Latest chart positions across Apple Podcasts and Spotify rankings.
Apple Podcasts
🇩🇪 Germany - technology
25/03/2025#74🇺🇸 USA - technology
25/03/2025#69🇫🇷 France - technology
25/03/2025#85🇬🇧 Great Britain - technology
24/03/2025#70🇩🇪 Germany - technology
24/03/2025#46🇺🇸 USA - technology
24/03/2025#54🇫🇷 France - technology
24/03/2025#53🇬🇧 Great Britain - technology
23/03/2025#62🇩🇪 Germany - technology
23/03/2025#56🇺🇸 USA - technology
23/03/2025#60
Spotify
No recent rankings available
Shared links between episodes and podcasts
Links found in episode descriptions and other podcasts that share them.
See allRSS feed quality and score
Technical evaluation of the podcast's RSS feed quality and structure.
See allScore global : 42%
Publication history
Monthly episode publishing history over the past years.
Compiler collectives
Episode 83
dimanche 4 août 2024 • Duration 16:33
Compiler collectives are a PT2 feature where by compiler instances across multiple ranks use NCCL collectives to communicate information to other instances. This is used to ensure we consistently decide if inputs or static or dynamic across all ranks. See also PR at https://github.com/pytorch/pytorch/pull/130935
TORCH_TRACE and tlparse
Episode 82
lundi 29 avril 2024 • Duration 15:28
Inductor - Define-by-run IR
Episode 73
mercredi 24 janvier 2024 • Duration 12:06
Unsigned integers
Episode 72
mercredi 17 janvier 2024 • Duration 13:07
Inductor - IR
Episode 71
mardi 16 janvier 2024 • Duration 18:00
Inductor IR is an intermediate representation that lives between ATen FX graphs and the final Triton code generated by Inductor. It was designed to faithfully represent PyTorch semantics and accordingly models views, mutation and striding. When you write a lowering from ATen operators to Inductor IR, you get a TensorBox for each Tensor argument which contains a reference to the underlying IR (via StorageBox, and then a Buffer/ComputedBuffer) that says how the Tensor was computed. The inner computation is represented via define-by-run, which allows for compact definition of IR representation, while still allowing you to extract an FX graph out if you desire. Scheduling then takes buffers of inductor IR and decides what can be fused. Inductor IR may have too many nodes, this would be a good thing to refactor in the future.
Dynamo - VariableTracker
Episode 70
vendredi 12 janvier 2024 • Duration 15:55
I talk about VariableTracker in Dynamo. VariableTracker is Dynamo's representation of the Python. I talk about some recent changes, namely eager guards and mutable VT. I also tell you how to find the functionality you care about in VariableTracker (https://docs.google.com/document/d/1XDPNK3iNNShg07jRXDOrMk2V_i66u1hEbPltcsxE-3E/edit#heading=h.i6v7gqw5byv6).
Unbacked SymInts
Episode 69
mardi 21 février 2023 • Duration 21:31
This podcast goes over the basics of unbacked SymInts. You might want to listen to this one before listening to https://pytorch-dev-podcast.simplecast.com/episodes/zero-one-specialization Some questions we answer (h/t from Gregory Chanan):
- Are unbacked symints only for export? Because otherwise I could just break / wait for the actual size. But maybe I can save some retracing / graph breaks perf if I have them too? So the correct statement is "primarily" for export?
- Why am I looking into the broadcasting code at all? Naively, I would expect the export graph to be just a list of ATen ops strung together. Why do I recurse that far down? Why can't I annotate DONT_TRACE_ME_BRO?
- How does 0/1 specialization fit into this? I understand we may want to 0/1 specialize in a dynamic shape regime in "eager" mode (is there a better term?), but that doesn't seem to matter for export?
- So far we've mainly been talking about how to handle our own library code. There is a worry about pushing complicated constraints downstream, similar to torchscript. What constraints does this actually push?
Zero-one specialization
Episode 68
lundi 20 février 2023 • Duration 21:07
Mikey Dagistes joins me to ask some questions about the recent recent composability sync https://www.youtube.com/watch?v=NJV7YFbtoR4 where we discussed 0/1 specialization and its implications on export in PT2. What's the fuss all about? What do I need to understand about PT2 to understand why 0/1 specialization is a thing?
torchdynamo
Episode 67
mardi 6 décembre 2022 • Duration 25:35
What is torchdynamo? From a bird's eye view, what exactly does it do? What are some important things to know about it? How does it differ from other graph capture mechanisms?
For more reading, check out https://docs.google.com/document/d/13K03JN4gkbr40UMiW4nbZYtsw8NngQwrTRnL3knetGM/edit#
PyTorch 2.0
Episode 66
dimanche 4 décembre 2022 • Duration 17:51
- Soumith's keynote on PT2.0: https://youtu.be/vbtGZL7IrAw?t=1037
- PT2 Manifesto: https://docs.google.com/document/d/1tlgPcR2YmC3PcQuYDPUORFmEaBPQEmo8dsh4eUjnlyI/edit#
- PT2 Architecture: https://docs.google.com/document/d/1wpv8D2iwGkKjWyKof9gFdTf8ISszKbq1tsMVm-3hSuU/edit#



