Argmax – Details, episodes & analysis

Podcast details

Technical and general information from the podcast's RSS feed.

Argmax

Vahe Hagopian, Taka Hasegawa, Farrukh Rahman

Science

Frequency: 1 episode/60d. Total Eps: 17

A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm

Site

RSS

Apple

Recent rankings

Latest chart positions across Apple Podcasts and Spotify rankings.

Apple Podcasts

🇩🇪 Germany - mathematics
09/06/2026
#28
🇺🇸 USA - mathematics
09/06/2026
#51
🇩🇪 Germany - mathematics
08/06/2026
#27
🇺🇸 USA - mathematics
08/06/2026
#44
🇩🇪 Germany - mathematics
07/06/2026
#24
🇺🇸 USA - mathematics
07/06/2026
#57
🇩🇪 Germany - mathematics
05/06/2026
#25
🇩🇪 Germany - mathematics
04/06/2026
#24
🇩🇪 Germany - mathematics
03/06/2026
#23
🇩🇪 Germany - mathematics
02/06/2026
#22

Spotify

No recent rankings available

Shared links between episodes and podcasts

Links found in episode descriptions and other podcasts that share them.

See all

https://www.sciencedirect.com/science/article/pii/S0004370221000862
4 shares
https://arxiv.org/abs/2105.04906
3 shares
https://www.nature.com/articles/s41586-021-04357-7
2 shares

https://www.youtube.com/@argmaxfm
1 share
https://youtu.be/jPCV4GKX9Dw
1 share
https://youtu.be/lLzHr0VFi3Y
1 share

RSS feed quality and score

Technical evaluation of the podcast's RSS feed quality and structure.

See all

RSS feed quality

To improve

Score global : 38%

Publication history

Monthly episode publishing history over the past years.

Year

Episodes published by month in

Latest published episodes

Recent episodes with titles, durations, and descriptions.

See all

LoRA

Season 2 · Episode 1

samedi 2 septembre 2023 • Duration 01:02:56

We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y

15: InstructGPT

Season 1 · Episode 15

mardi 28 mars 2023 • Duration 57:27

In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.

6: Deep Reinforcement Learning at the Edge of the Statistical Precipice

Season 1 · Episode 6

lundi 6 juin 2022 • Duration 01:01:08

We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility.

5: QMIX

Season 1 · Episode 5

mardi 26 avril 2022 • Duration 42:06

We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL.

4: Can Neural Nets Learn the Same Model Twice?

Season 1 · Episode 4

mercredi 6 avril 2022 • Duration 55:23

Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility
and Double Descent from the Decision Boundary Perspective (https://arxiv.org/pdf/2203.08124.pdf)

Summary:
A discussion of reproducibility and double descent through visualizations of decision boundaries.

Highlights of the discussion:

Relationship between model performance and reproducibility
Which models are robust and reproducible
How they calculate the various scores

3: VICReg

Season 1 · Episode 3

lundi 21 mars 2022 • Duration 44:46

Todays paper: VICReg (https://arxiv.org/abs/2105.04906)

Summary of the paper
VICReg prevents representation collapse using a mixture of variance, invariance and covariance when calculating the loss. It does not require negative samples and achieves great performance on downstream tasks.

Highlights of discussion

The VICReg architecture (Figure 1)
Sensitivity to hyperparameters (Table 7)
Top 5 metric usefulness

2: data2vec

Season 1 · Episode 2

lundi 7 mars 2022 • Duration 53:23

Todays paper: data2vec (https://arxiv.org/abs/2202.03555)

Summary of the paper
A multimodal SSL algorithm that predicts latent representation of different types of input.

Highlights of discussion

What are the motivations of SSL and multimodal
How does the student teacher learning work?
What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.

1: Reward is Enough

Season 1 · Episode 1

lundi 21 février 2022 • Duration 54:36

This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.

Todays paper: Reward is Enough

Summary of the paper
The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.

Highlights of discussion

High level overview of Reinforcement Learning
How evolution can be encoded as a reward maximization problem
What is the one reward signal we are trying to optimize?

14: Whisper

Season 1 · Episode 14

vendredi 17 mars 2023 • Duration 49:14

This week we talk about Whisper. It is a weakly supervised speech recognition model.

13: AlphaTensor

Season 1 · Episode 13

samedi 11 mars 2023 • Duration 49:05

We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.

Argmax – Details, episodes & analysis

Podcast details

Recent rankings

Apple Podcasts

Spotify

Shared links between episodes and podcasts

Other12

Youtube3

RSS feed quality and score

Publication history

Latest published episodes

Similar podcasts and content

Related Shows Based on Content Similarities