Argmax – Détails, épisodes et analyse

Détails du podcast

Informations techniques et générales issues du flux RSS du podcast.

Argmax

Vahe Hagopian, Taka Hasegawa, Farrukh Rahman

Sciences

Fréquence : 1 épisode/60j. Total Éps: 17

A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm

Site

RSS

Apple

Classements récents

Dernières positions dans les classements Apple Podcasts et Spotify.

Apple Podcasts

🇩🇪 Allemagne - mathematics
09/06/2026
#28
🇺🇸 États-Unis - mathematics
09/06/2026
#51
🇩🇪 Allemagne - mathematics
08/06/2026
#27
🇺🇸 États-Unis - mathematics
08/06/2026
#44
🇩🇪 Allemagne - mathematics
07/06/2026
#24
🇺🇸 États-Unis - mathematics
07/06/2026
#57
🇩🇪 Allemagne - mathematics
05/06/2026
#25
🇩🇪 Allemagne - mathematics
04/06/2026
#24
🇩🇪 Allemagne - mathematics
03/06/2026
#23
🇩🇪 Allemagne - mathematics
02/06/2026
#22

Spotify

Aucun classement récent disponible

Liens partagés entre épisodes et podcasts

Liens présents dans les descriptions d'épisodes et autres podcasts les utilisant également.

See all

https://www.sciencedirect.com/science/article/pii/S0004370221000862
4 partages
https://arxiv.org/abs/2105.04906
3 partages
https://www.nature.com/articles/s41586-021-04357-7
2 partages

https://www.youtube.com/@argmaxfm
1 partage
https://youtu.be/jPCV4GKX9Dw
1 partage
https://youtu.be/lLzHr0VFi3Y
1 partage

Qualité et score du flux RSS

Évaluation technique de la qualité et de la structure du flux RSS.

See all

Qualité du flux RSS

À améliorer

Score global : 38%

Historique des publications

Répartition mensuelle des publications d'épisodes au fil des années.

Year

Episodes published by month in

Derniers épisodes publiés

Liste des épisodes récents, avec titres, durées et descriptions.

See all

LoRA

Saison 2 · Épisode 1

samedi 2 septembre 2023 • Durée 01:02:56

We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y

15: InstructGPT

Saison 1 · Épisode 15

mardi 28 mars 2023 • Durée 57:27

In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.

6: Deep Reinforcement Learning at the Edge of the Statistical Precipice

Saison 1 · Épisode 6

lundi 6 juin 2022 • Durée 01:01:08

We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility.

5: QMIX

Saison 1 · Épisode 5

mardi 26 avril 2022 • Durée 42:06

We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL.

4: Can Neural Nets Learn the Same Model Twice?

Saison 1 · Épisode 4

mercredi 6 avril 2022 • Durée 55:23

Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility
and Double Descent from the Decision Boundary Perspective (https://arxiv.org/pdf/2203.08124.pdf)

Summary:
A discussion of reproducibility and double descent through visualizations of decision boundaries.

Highlights of the discussion:

Relationship between model performance and reproducibility
Which models are robust and reproducible
How they calculate the various scores

3: VICReg

Saison 1 · Épisode 3

lundi 21 mars 2022 • Durée 44:46

Todays paper: VICReg (https://arxiv.org/abs/2105.04906)

Summary of the paper
VICReg prevents representation collapse using a mixture of variance, invariance and covariance when calculating the loss. It does not require negative samples and achieves great performance on downstream tasks.

Highlights of discussion

The VICReg architecture (Figure 1)
Sensitivity to hyperparameters (Table 7)
Top 5 metric usefulness

2: data2vec

Saison 1 · Épisode 2

lundi 7 mars 2022 • Durée 53:23

Todays paper: data2vec (https://arxiv.org/abs/2202.03555)

Summary of the paper
A multimodal SSL algorithm that predicts latent representation of different types of input.

Highlights of discussion

What are the motivations of SSL and multimodal
How does the student teacher learning work?
What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.

1: Reward is Enough

Saison 1 · Épisode 1

lundi 21 février 2022 • Durée 54:36

This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.

Todays paper: Reward is Enough

Summary of the paper
The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.

Highlights of discussion

High level overview of Reinforcement Learning
How evolution can be encoded as a reward maximization problem
What is the one reward signal we are trying to optimize?

14: Whisper

Saison 1 · Épisode 14

vendredi 17 mars 2023 • Durée 49:14

This week we talk about Whisper. It is a weakly supervised speech recognition model.

13: AlphaTensor

Saison 1 · Épisode 13

samedi 11 mars 2023 • Durée 49:05

We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.

Podcasts Similaires Basées sur le Contenu

Découvrez des podcasts liées à Argmax. Explorez des podcasts avec des thèmes, sujets, et formats similaires. Ces similarités sont calculées grâce à des données tangibles, pas d'extrapolations !

About us Privacy Policy

Argmax – Détails, épisodes et analyse

Détails du podcast

Classements récents

Apple Podcasts

Spotify

Liens partagés entre épisodes et podcasts

Other12

Youtube3

Qualité et score du flux RSS

Historique des publications

Derniers épisodes publiés

Podcasts et contenus similaires

Podcasts Similaires Basées sur le Contenu