Argmax – Details, episodes & analysis
Podcast details
Technical and general information from the podcast's RSS feed.


A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm
Recent rankings
Latest chart positions across Apple Podcasts and Spotify rankings.
Apple Podcasts
🇩🇪 Germany - mathematics
09/06/2026#28🇺🇸 USA - mathematics
09/06/2026#51🇩🇪 Germany - mathematics
08/06/2026#27🇺🇸 USA - mathematics
08/06/2026#44🇩🇪 Germany - mathematics
07/06/2026#24🇺🇸 USA - mathematics
07/06/2026#57🇩🇪 Germany - mathematics
05/06/2026#25🇩🇪 Germany - mathematics
04/06/2026#24🇩🇪 Germany - mathematics
03/06/2026#23🇩🇪 Germany - mathematics
02/06/2026#22
Spotify
No recent rankings available
Shared links between episodes and podcasts
Links found in episode descriptions and other podcasts that share them.
See allRSS feed quality and score
Technical evaluation of the podcast's RSS feed quality and structure.
See allScore global : 38%
Publication history
Monthly episode publishing history over the past years.
LoRA
Season 2 · Episode 1
samedi 2 septembre 2023 • Duration 01:02:56
We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y
15: InstructGPT
Season 1 · Episode 15
mardi 28 mars 2023 • Duration 57:27
In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.
6: Deep Reinforcement Learning at the Edge of the Statistical Precipice
Season 1 · Episode 6
lundi 6 juin 2022 • Duration 01:01:08
We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility.
5: QMIX
Season 1 · Episode 5
mardi 26 avril 2022 • Duration 42:06
We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL.
4: Can Neural Nets Learn the Same Model Twice?
Season 1 · Episode 4
mercredi 6 avril 2022 • Duration 55:23
Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility
and Double Descent from the Decision Boundary Perspective (https://arxiv.org/pdf/2203.08124.pdf)
Summary:
A discussion of reproducibility and double descent through visualizations of decision boundaries.
Highlights of the discussion:
- Relationship between model performance and reproducibility
- Which models are robust and reproducible
- How they calculate the various scores
3: VICReg
Season 1 · Episode 3
lundi 21 mars 2022 • Duration 44:46
Todays paper: VICReg (https://arxiv.org/abs/2105.04906)
Summary of the paper
VICReg prevents representation collapse using a mixture of variance, invariance and covariance when calculating the loss. It does not require negative samples and achieves great performance on downstream tasks.
Highlights of discussion
- The VICReg architecture (Figure 1)
- Sensitivity to hyperparameters (Table 7)
- Top 5 metric usefulness
2: data2vec
Season 1 · Episode 2
lundi 7 mars 2022 • Duration 53:23
Todays paper: data2vec (https://arxiv.org/abs/2202.03555)
Summary of the paper
A multimodal SSL algorithm that predicts latent representation of different types of input.
Highlights of discussion
- What are the motivations of SSL and multimodal
- How does the student teacher learning work?
- What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.
1: Reward is Enough
Season 1 · Episode 1
lundi 21 février 2022 • Duration 54:36
This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.
Todays paper: Reward is Enough
Summary of the paper
The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.
Highlights of discussion
- High level overview of Reinforcement Learning
- How evolution can be encoded as a reward maximization problem
- What is the one reward signal we are trying to optimize?
14: Whisper
Season 1 · Episode 14
vendredi 17 mars 2023 • Duration 49:14
13: AlphaTensor
Season 1 · Episode 13
samedi 11 mars 2023 • Duration 49:05
We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.



