Back

Explore every episode of the podcast Mixture of Experts

Dive into the complete episode list for Mixture of Experts. Each episode is cataloged with detailed descriptions, making it easy to find and explore specific topics. Keep track of all episodes from your favorite podcast and never miss a moment of insightful content.

Rows per page:

1–50 of 99

TitlePub. DateDuration
Introducing Mixture of Experts Podcast07 Jun 202400:01:11

Introducing the Mixture of Experts podcast, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and the impact on business. You will hear from a panel of researchers, engineers, data scientists, ethics experts, veteran product leaders and more! Tune in weekly to stay ahead of the AI wave.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 29: Scaling AI, agent-led future, and race to AGI15 Nov 202400:39:11

Is 2024 the year scaling AI officially breaks? In Episode 29 of Mixture of Experts, host Tim Hwang is joined by Anthony Annunziata, Kate Soule and Naveen Rao. First, the experts discuss whether we are living in a post scale world. Next, we can’t have an episode without chatting AI agents, but what does the future hold for this technology? Finally, is AGI here to stay? Tune-in to this week’s Mixture of Experts to find out.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 20: Apple Intelligence, Reflection 70B, open-source AI agents, and LLM research ideas13 Sep 202400:38:29

Can Apple Intelligence compete with the AI market offerings? In Episode 20 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Kate Soule and Maya Murad. Today, the experts chat Apple Intelligence, the performance of Reflection’s 70B, and a new paper released on LLMs generating novel research ideas. Additionally, IBM soft launched the Bee Agent Framework to help build agentic workflows with leading open-source and proprietary models. Tune-in to hear our expert panel break down this week’s AI news.



The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 19: NEO 1X robot, OpenAI chips, The AI Scientist, and the future of prompt engineering06 Sep 202400:37:58

Will prompt engineering ever die? In Episode 19 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Kate Soule and Shobhit Varshney. Today, the experts chat the future of prompt engineering, a new paper released about The AI Scientist, NEO 1X’s humanoid robot, and OpenAI’s in-house AI chips. Will AI takeover scientific discovery? Will everyone have at home AI assistants? Why is OpenAI investing in chip production? Tune-in for our expert’s takes.


0:00 - Intro

1:17 - Future of Prompt Engineering

11:18 - NEO 1X Robot

21:56 - AI for Scientific Discovery

31:48 - OpenAI's in-house Chips


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 18: Cursor hype, Perplexity introduces ads, and AI at the US Open30 Aug 202400:40:29

Is search less trustworthy? In Episode 18 of Mixture of Experts, host Tim Hwang is joined by the IBM Fellows—Aaron Baughman, Kush Varshney, and Trent Gray-Donald. Today, the experts chat how AI is being integrated at the US Open. Next, the Perplexity is introducing ads in Q4, what is the affect on search? Finally, what's all the hype with Cursor? Tune-in to today’s episode for all this and more.


0:01 - Intro

00:59 - AI at the US Open

13:35 - Paid search in Perplexity

24:12 - Cursor hype!

35:53 - IBM Fellows


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.




Episode 17: Agent Q, no AI in art, and AMD acquires ZT Systems23 Aug 202400:46:56

What’s new with AI agents? In Episode 17 of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Skyler Speakman, and Volkmar Uhlig. Today, the experts chat Agent Q and the improvements in reasoning and planning. Next, the CEO of Procreate came out with a statement that there will be no gen AI integrated into their products—can art avoid the AI wave? Finally, AMD acquired ZT Systems, can they now compete with NVIDIA? All this and more on today’s episode.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Segments:

0:01 — Intro

00:51 — Agent Q

14:21 — No AI in Art

29:12 — AMD Acquires ZT Systems

Episode 16: Cost of a Data Breach 2024 and OpenAI's Project Strawberry16 Aug 202400:22:56

Is OpenAI about to release their biggest AI project? In Episode 16 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kate Soule, and Shobhit Varshney. Today, the experts chat IBM’s 2024 Cost of a Data Breach Report and analyze how gen AI could reduce the cost of cyber threats. Next, rumors are circulating the internet about OpenAI dropping “Project Strawberry,” what they internally reference as a “level 2” model. Are the rumors true? Tune-in for more.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Segments:

0:01 — Intro

00:52 — Cost of a Data Breach 2024

12:33— Project Strawberry

Episode 15: OpenAI Structured Outputs, character.ai “acquisition,” and is it an AI bubble?09 Aug 202400:32:08

Is it an AI bubble? In Episode 15 of Mixture of Experts, host Tim Hwang is joined by our veteran panel: Marina Danilevsky, Kush Varshney, and Shobhit Varshney. Today, the experts chat the stock market crash and the involvement of AI companies. Then, OpenAI released Structured Outputs, and analyze how this can support enterprise implementation of AI. Finally, Google "acquires" character.ai, does this make any sense? Tune-in for the breakdown.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


0:01 — Intro

1:07 — AI Bubble?

11:49 — Structured Outputs

22:41 — character.ai "Acquisition"

Episode 14: SAM 2, friend.com and will gen AI projects be abandoned?02 Aug 202400:28:48

Meta releases SAM 2! In Episode 14 of Mixture of Experts, host Tim Hwang is joined by Ambhi Ganesan, Kate Soule and Vagner Santana. Today, the experts chat the next generation of Meta’s Segment Anything Model (SAM). Then, another AI companion attempt via friend.com, we analyze if startups effectively compete in the AI hardware space. Finally, we get expert opinions on various topics: Will gen AI projects be abandoned? Which is bigger—9.11 or 9.9? Tune-in today to find out.


0:01 — Intro

1:00 — SAM 2

10:49 — Friend.com

20:53 — Abandoned gen AI projects

25:38 — Which is bigger—9.11 or 9.9?


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 13: Meta's Llama 3.1, Mistral Large 2, and big interest in small models26 Jul 202400:20:20

Meta strikes back with the launch of Llama 3.1! In Episode 13 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Shobhit Varshney, and Maryam Ashoori. Today, the experts analyze the business of AI in relation to the launch of Llama 3.1, including Llama 405B. Then, Mistral Large 2 sparks conversation about the open-source wave. Finally, the experts talk GPT 4o-mini and the model price war. Are little models having their moment? Tune-in to find out.


0:01 — Intro

1:33 — Llama 3.1 and Mistral Large 2

10:08 — Are Little Models Having a Moment?


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 12: Goldman Sachs Gen AI report, Claude 2.0 Engineer, and RIAA lawsuits19 Jul 202400:31:15

Will modern AI break the music industry? In Episode 12 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Marina Danilevsky, and Brent Smolinski. Today, we review Goldman Sachs’ report on investment in Gen AI, “too much spend, too little benefit.” Next, the experts break down Claude 2.0 Engineer and the future of coding agents. Finally, the Recording Industry Association of America (RIAA) files lawsuits against two generative music companies. Tune-in to hear our expert takes!


0:01 — Intro

1:48 — Goldman Sachs Gen AI Spend Report

12:30 — Claude Engineer 2.0

21:02 — RIAA vs. Suno / Udio


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 11: AI at Wimbledon, ChatGPT for coding, and scaling with AI personas12 Jul 202400:41:21

It's Wimbledon finals week! In Episode 11 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Kaoutar El Maghraoui, and Skyler Speakman. Today, we review how AI is providing insights throughout one of the most prestigious tennis tournaments and the future of AI in sports. Next, the experts break down the quality of ChatGPT for coding. Finally, how did scaling synthetic data create one billion personas?


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 28: SearchGPT, from Naptime to Big Sleep, and GitHub Octoverse updates08 Nov 202400:39:49

Could AI wipe out software engineers? In Episode 28 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Kaoutar El Maghraoui, and Shobhit Varshney. First, the experts discuss GitHub reporting a rise of developers driven by AI code assistant tools. Next, Big Sleep finds a vulnerability in SQLite, what is the future for these kinds of AI agents? Finally, OpenAI released SearchGPT, what is the future of AI search? Tune-in today to find out!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Episode 10: AI Hardware: Training, Inference, Devices and Model Optimization03 Jul 202400:38:26

In Episode 10 of Mixture of Experts we are talking all hardware all the time. Guest host Bryan Casey is joined by Volkmar Uhlig, Chris Hay, and Kaoutar El Maghraoui to explore the intricacies of AI hardware. Is Apple creating a pattern for the industry with their on device and cloud architecture? Tune in to hear the experts debate the details.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 9: Claude 3.5 Sonnet, BIRD-SQL, and the latest in AI Slop28 Jun 202400:39:03

Is shrimp Jesus the best use case of AI content creation? In Episode 9 of Mixture of Experts, guest host Bryan Casey is joined by Shobhit Varshney, Marina Danilevsky, and Michael Glass. The experts analyze both the release Anthropic’s Claude 3.5 and BIRD-SQL. Finally, we talk the latest in AI slop and how it is affecting content creation.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 8: NVIDIA’s Nemotron-4 340B models, Safe Superintelligence Inc. and AI agents21 Jun 202400:43:47

Is there a new major player in the AI space? In Episode 8 of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Kate Soule, and Maya Murad. First, the experts react to NVIDIA’s Nemotron-4 340B model launch and the future of LLM training. Next, new developments in enterprise agents create a great discussion around the reality of AI agents. Finally, we discuss the launch of a new AI company, Safe Superintelligence Inc.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Apple's WWDC24 reactions and mechanistic interpretability14 Jun 202400:39:41

Is Apple late to the AI game? In Episode 7 of Mixture of Experts, host Tim Hwang is joined by Shobhit Varshney, Skyler Speakman, and Kaoutar El Maghraoui. Today, the experts react to Apple’s slew of AI announcements at WWDC24. Then, part 2 on interpretability this week, as OpenAI released their study mechanistic interpretability.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

AI safety, RAG benchmarking and responsible AI at ACM FAccT Conference07 Jun 202400:40:29

What’s the future of AGI? In Episode 6 of Mixture of Experts, host Tim Hwang is joined by Vagner Figueredo de Santana, Marina Danilesky, and Shobhit Varshney. Today, the experts unpack Leopold Aschenbrenner’s AI safety screed, Situational Awareness. We also break down the state of responsible AI amid the annual ACM Fairness, Accountability, and Transparency (FAccT) Conference. Finally, we chat about RAG benchmarking and what it tells us about the industry as a whole.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Google’s AI Overviews, Golden Gate Claude, the "whale computer" and scaling laws31 May 202400:44:20

How is the market reacting to Google's AI overviews? In Episode 5 of Mixture of Experts, Bryan Casey, our guest host, is joined by Kate Soule, Chris Hay, and Skyler Speakman. Today, our experts revisit a conversation from a previous episode around Google’s AI Overviews and the market reaction. Additionally, they break down Anthropic’s Golden Gate Claude. Finally, what is the “whale computer” and how does it relate to scaling laws?


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Scarlett Johansson, FMTI and Think 202424 May 202400:38:26

What’s going on between Scarlet Johansson and OpenAI? In Episode 4 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Marina Danilevsky, and Armand Ruiz. Kate explains the future of FMTI, Marina highlights innovations driving the open-source community, and Armand dives into the latest from IBM’s THINK 2024 event.


Subscribe for AI updates:https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120

Learn more about artificial intelligence: https://www.ibm.com/think/artificial-intelligence


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

GPT-4o, AI overviews and our multimodal future17 May 202400:40:58

In Episode 3 of Mixture of Experts, host Tim Hwang is joined by Shobhit Varshney, Chris Hay, and Bryan Casey for the OpenAI vs. Google showdown. Shobhit analyzes the showcase demos released by OpenAI and Google. Chris breaks down latency and cost in relation to GPT-4 and Gemini 1.5 Flash. Finally, after years of people proclaiming the death of search, Bryan answers the big question: are LLMs forcing the death of Google search?


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

The state of open source, InspectorRAGet, and what’s going on with Kolmogorov-Arnold Networks10 May 202400:46:14

In Episode 2 of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Marina Danilevsky, and David Cox. This week, the three AI experts weigh in on the explosion of open source technology and identify how it will shape the market. Kush and Tim produce the single most easy explanation of what’s going on with Kolmogorov-Arnold Networks and why it matters. Finally, we kick it back to the 90s with Inspector RAGet!

The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Rabbit AI hiccups, GPT-2 chatbot, and OpenAI's licensing deal with the Financial Times03 May 202400:41:39

In the inaugural episode of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Shobhit Varshney, and Chris Hay. The three AI experts debate the pros and cons of Rabbit’s R1 device. They also unpack GPT-2’s potential evolution and OpenAI’s licensing deal with the Financial Times. Finally, what do Sam Altman and Taylor Swift have in common? Join us to find out!



The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 27: The future of agents, AI energy consumption, Anthropic's computer use, and Google watermarking AI01 Nov 202400:32:59

Agents, agents, and more agents! In Episode 27 of Mixture of Experts, host Tim Hwang is joined by Volkmar Uhlig and Vyoma Gajjar. First, the experts chat about Mark Benioff’s spicy tweet, and what this means for the future of AI agents. Next, how much energy is needed to power AI models, and should we be concerned? Then, the experts debrief Anthropic’s release of computer use. Finally, Google is integrating SynthID-Text into Gemini to help watermark AI-generated text, do we need this feature? Learn more on today’s Mixture of Experts.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 26: Granite 3.0, NVIDIA’s Nemotron AI model, and Perplexity’s fundraising25 Oct 202400:37:18

Can chat replace search? In Episode 26 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Petros Zerfos for IBM TechXchange week! First, the experts describe how the team created the Granite 3.0 models. Next, NVIDIA enters the open source model game, what does this mean for the competition? Finally, Perplexity AI is seeking over double their valuation in new funding rounds, what does this mean for start-ups? All that and more on today’s episode.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Episode 25: Machines of Loving Grace, Entropix, AI and elections, GSM8K18 Oct 202400:41:17

Can AI solve infectious disease? In Episode 25 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Maya Murad, and Ruben Boonen. Today we analyze some papers. First, the experts dissect Machines of Loving Grace, a 15,000 word essay written by Anthropic’s CEO making some major AI predictions. Then, Apple generated a new benchmark based of GSM8K in a recent paper, the findings were intriguing. Next, we talk Entropix, a sampler intending to replicate chain of thought features. Finally, OpenAI disclosed they are seeing an increase in AI models faking articles, what can we do to fix this? All this and more, on today’s Mixture of Experts.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 24: AI in the Nobels, DGX B200 arrival, and Unstructured’s $40M funding round11 Oct 202400:37:19

Could AI win a Nobel Prize in the future? In Episode 24 of Mixture of Experts, host Tim Hwang is joined by Chris Hay and Edward Calvesbert. First, the experts debrief the ‘Godfather of AI’ sharing a Nobel Prize. Next, we talk AI platforms and the hype around DGX B200. Finally, unstructured data is becoming usable for LLMs, why are companies like NVIDIA so interested in this data? Tune-in today to find out!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 23: NotebookLM, OpenAI DevDay, and will AI prevent phishing attacks?04 Oct 202400:39:15

Will DeepDive replace the Mixture of Experts podcast? In Episode 23, host Tim Hwang is joined by IBM Researchers Marina Danilevsky, Nathalie Baracaldo and Vagner Santana to dissect this week’s AI news. First, the experts talk about the hype around Google’s NotebookLM, specifically regarding the DeepDive podcast feature. Next, OpenAI DevDay sparks some interesting conversation around vision fine-tuning and multimodality. Finally, it’s Cybersecurity Awareness Month and IBM X-Force released the Cloud Threat Landscape Report. Will AI prevent phishing attacks? Tune-in to this week’s episode to learn more!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 22: Llama 3.2, AI Snake Oil, and gen AI for sustainability27 Sep 202400:33:51

Meta releases Llama 3.2! In Episode 22 of Mixture of Experts, host Tim Hwang is joined by Maryam Ashoori, Skyler Speakman, and Shobhit Varshney to debrief an exciting week of AI news. First, Meta is back with the release of Llama 3.2, and lightweight (1B/3B) models. Next, it’s Climate Week NYC, we chat the use of gen AI in achieving sustainable development goals. Specifically, IBM and NASA’s AI model for weather and climate. Finally, the book version of “AI Snake Oil” officially dropped and the authors claim they will be wrong in 2.5 years. What do our experts think? Tune-in today to find out!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 21: OpenAI o1 preview, Agentforce, AI in fantasy football, and machine unlearning20 Sep 202400:47:45

Strawberry is officially here! In Episode 21 of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Nathalie Baracaldo, and Aaron Baughman to chat about the hype around OpenAI’s o1 preview. Additionally, we cover AI agents again, with the launch of Agentforce. Next, Aaron—our resident AI in sports expert analyzes the AI powered insights for fantasy football. Finally, what is “machine unlearning,” and why should we care? All this and more, on today’s episode of Mixture of Experts.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Episode 39: DeepSeek-R1, Mistral IPO, FrontierMath controversy, and IDC code assistant report24 Jan 202500:39:45

What does the future hold for DeepSeek? In episode 39 of Mixture of Experts, join host Tim Hwang along with experts Abraham Daniels, Kaoutar El Maghraoui and Skyler Speakman to discuss the release of DeepSeek-R1. Next, Mistral indicates going IPO. Then, FrontierMath’s new benchmark is particularly difficult, the experts debrief. Finally, IDC released a report on code assistants, what do we need to know about generalist and specialized coding assistants? Tune-in to this week’s episode to find out. 


00:01 – Intro  

01:08 – DeepSeek-R1 

14:08 – Mistral indicates IPO 

20:54 – FrontierMath controversy 

30:04 -- IDC code assistants report 


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 

Episode 38: Anthropic valuation rumors, Microsoft CoreAI, NotebookLM upgrades, and AI agents in finance17 Jan 202500:44:49

What would you do with $2 billion? In episode 38 of Mixture of Experts, join host Tim Hwang along with experts Chris Hay, Kaoutar El Maghraoui and Vyoma Gajjar to discuss the Anthropic valuation rumors. Next, Microsoft CEO Nadella created a new CoreAI group to build and run apps for customers. Then, NotebookLM upgraded some of its features, including podcast intervention. Finally, AI agents are making their way into the financial services industry. Can an agent invest all of your money? Tune-in to this week’s episode to find out. 


00:01 -- What would you do with $2 billion? 

00:51 -- Anthropic valuation 

12:14 -- Microsoft CoreAI 

25:01 -- NotebookLM upgrades 

35:17 -- AI agents in finance 


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 

Episode 37: CES 2025, NVIDIA DIGITS, Apple Intelligence fails, and Sam Altman’s reflections10 Jan 202500:35:38

What’s the most exciting CES AI announcement? In episode 37 of Mixture of Experts, host Tim Hwang is joined by Skyler Speakman, Volkmar Uhlig and Shobhit Varshney to debrief CES 2025. Specifically, the experts dive into NVIDIA’S Project DIGITS, among other announcements from the AI hardware giant. Next, a new enterprise AI development survey came out that detailing how developers really feel about AI implementation. Then, Apple Intelligence experienced some major hallucination fails, what does this tell us about Apple’s stake in the AI game? Finally, Sam Altman of OpenAI released a reflection blog, what does he say about the future of AI? All that and more on today’s Mixture of Experts.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 36: OpenAI o3, DeepSeek-V3, and the Brundage/Marcus AI bet03 Jan 202500:39:19

Is deep learning hitting a wall? It’s 2025 and Mixture of Experts is back and better than ever. In episode 36, host Tim Hwang is joined by Chris Hay, Kate Soule and Kush Varshney to debrief one of the biggest releases of 2024, OpenAI o3. Next, DeepSeek-V3 is here! Finally, will AI exist in 2027? The experts dissect the AI bet between Miles Brundage and Gary Marcus. All that and more on the first Mixture of Experts of 2025.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


00:00 — Intro

00:49 — OpenAI o3

14:40 — DeepSeek-V3

28:00 — The Brundage/Marcus bet

Episode 35: 2024 Rewind: Breakthroughs in AI models, agents, hardware and products27 Dec 202401:01:43

Will 2025 be the year of AI agents? In Episode 35 of Mixture of Experts, host Tim Hwang is joined by some show veterans to debrief 2024 in AI. This week, we review AI models, agents, hardware and product releases with some of the top industry experts. What was the best model of 2024? Is NVIDIA king? What are some of the AI trends in 2025? All that and more on this special edition of Mixture of Experts.


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 34: Granite 3.1, NVIDIA Jetson, stealing AI models, and is pre-training over?20 Dec 202400:40:30

Is pre-training a thing of the past? In Episode 34 of Mixture of Experts, host Tim Hwang is joined by Abraham Daniels, Vagner Santana and Volkmar Uhlig to debrief this week in AI. First, OpenAI cofounder Ilya Sutskever said that “peak data” was achieved, does this mean there is no longer a need to model pre-training? Next, IBM released Granite 3.1 with a slew of features, we cover them all. Then, there is a new way to steal AI models, how do we protect against model exfiltration. Finally, can NVIDIA Jetson for AI developers really increase hardware accessibility? Tune-in for more!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


00:01 — Intro

00:49— Is pre-training over?

10:25 — Granite 3.1

22:23 — AI model stealing

33:38—NVIDIA Jetson

Episode 33: 12 Days of OpenAI, NeurIPS, ARC Prize, and Llama 3.3 70B13 Dec 202400:40:50

Is o1 Pro worth the cost? In Episode 33 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Kate Soule and Vyoma Gajjar. First, the experts debrief the 12 Days of OpenAI. Next, we review some of the top papers in NeurIPS, how are the experts keeping up with all these research papers? Then, we are back with another benchmark, can ARC Prize make AGI more tractable? Finally, Meta announced the launch of Llama 3.3 70B with the promise of 405B performance, can we have our cake and eat it too? Find out more on today’s Mixture of Experts!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 32: Inside AWS re:Invent 2024, LLM Flowbreaking, and David Mayer06 Dec 202400:37:55

What’s the mystery behind the name ChatGPT refuses to discuss? In Episode 32 of Mixture of Experts host Tim Hwang dives into the hottest topics shaping the AI landscape with an all-star panel: Aaron Baughman, Vagner Figueredo de Santana, and Shobhit Varshney. First, they disect the biggest announcements and takeaways from AWS re:Invent 2024, Amazon’s premier AI event. Next, they talk about overcoming architectural vulnerabilities in AI systems, and finally, they uncover the curious case of a name ChatGPT won’t discuss—and the questions this raises about privacy and transparency in AI. Get ready for an episode packed with insights, debates, and forward-thinking perspectives!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 31: AI in education: Safety, literacy, and predictions27 Nov 202400:36:31

How much future learning will be done with an AI assistant? In Episode 31 of Mixture of Experts, host Tim Hwang is joined by Phaedra Boinodiris, Marina Danilevsky and Skyler Speakman for the AI in education special episode. First, the experts give an update on the state of AI within education. Next, we cover concerns around AI safety and literacy, what do students and teachers need to be aware of? Finally, the panel gives their predictions on what the future of education holds as it relates to AI. Tune-in to this special episode for an in-depth analysis!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 30: “Near-infinite memory,” Microsoft Ignite, FrontierMath, and AlphaFold322 Nov 202400:43:12

Should your AI assistant remember everything about you? In Episode 30 of Mixture of Experts, host Tim Hwang is joined by Vagner Santana, Vyoma Gajjar and Shobhit Varshney. First, the experts breakdown claims of “near-infinite memory” within AI models. Next, Shobhit is fresh off the plane from Microsoft Ignite, he shares some of the exciting new announcements following the event. Then, a new benchmark has entered the chat, what do we know about FrontierMath? Finally, AlphaFold3 is now more open, why does this matter? Find out more on today’s episode!


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 40: DeepSeek facts vs hype, model distillation, and open source competition31 Jan 202500:39:17

Let’s bust some early myths about DeepSeek. In episode 40 of Mixture of Experts, join host Tim Hwang along with experts Aaron Baughman, Chris Hay and Kate Soule. Last week, we covered the release of DeepSeek-R1; now that the entire world is up to speed, let’s separate the facts from the hype. Next, what is model distillation and why does it matter for competition in AI? Finally, Sam Altman among other tech CEOs shared his response to DeepSeek. Will R1 radically change the open-source strategy of other tech giants? Find out all this and more on Mixture of Experts.


00:01 – Intro

00:41 – DeepSeek facts vs hype

21:00 – Model distillation

31:21 – Open source and OpenAI


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Episode 43: Deep Research, OpenAI inference chip, small VLMs, and AI agent job posting21 Feb 202500:45:51

What is all the hype around Deep Research? In episode 43 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Volkmar Uhlig and Shobhit Varshney. This week, we discuss reasoning model features coming out of companies like OpenAI’s Deep Research, Google Gemini, Perplexity, xAI’s Grok-3 and more! Next, OpenAI is rumored to release an inference chip, but how likely is this to be a success in the AI chip game? Then, we analyze the capabilities of small vision-language models (VLMs). Finally, a startup, Firecrawl, released a job posting in search of an AI agent. Is this the future for AI tools in the workforce? Tune-in to today’s Mixture of Experts to find out.


00:01 – Intro

00:35 – Deep Research

11:58 – OpenAI inference chip

22:17 – Small VLMs

32:31 – AI agent job posting


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

Episode 42: Paris AI Summit, Altman's "Three Observations," and Anthropic's Economic Index14 Feb 202500:39:56

Live from Paris, Tim Hwang is at the AI Action Summit 2025. In episode 42 of Mixture of Experts, we welcome Anastasia Stasenko, CEO and Co-Founder of pleias along with our veteran experts Marina Danilevsky and Chris Hay. Last week, we touched on some potential conversations at the Paris AI Summit, this week we recap what actually happened. Is AI safety improving Globally? Next, for our paper of the week, we breakdown s1: Simple test-time scaling. Then, Sam Altman is back with another blog, “Three Observations,” what do our experts have to say? Finally, what can we learn from Anthropic’s Economic Index? All that and more on today’s Mixture of Experts. 


00:01 – Intro 

00:42 – Paris AI Summit 

11:10 – s1: Simple test-time scaling 

19:32 – Sam Altman’s “Three Observations” 

30:41 – Anthropic’s Economic Index 


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 


Resources:

Read the paper about s1: Simple test-time scaling: https://arxiv.org/abs/2501.19393

Read Sam Altman's "Three Observations": https://blog.samaltman.com/three-observations

Read Anthropic's Economic Index: https://www.anthropic.com/economic-index

Read more about AGI: https://www.ibm.com/think/topics/artificial-general-intelligence

Episode 41: OpenAI deep research, o3-mini, AI Action Summit, and Anthropic’s Constitutional Classifiers07 Feb 202500:38:08

What does Sam Altman have up his sleeve? In episode 41 of Mixture of Experts, join host Tim Hwang along with experts Nathalie Baracaldo, Marina Danilevsky and Chris Hay. Last week, we covered all things DeepSeek, and this week OpenAI has some new releases to share. Today, the experts dissect deep research and o3-mini. Next, our host Tim Hwang is travelling to AI Action Summit, he asks our experts what we can expect coming out of the event. Then, we talk about Anthropic’s Constitutional Classifiers. Finally, Microsoft is creating a unit to study AI’s impact, what does this mean? Find out all this and more on Mixture of Experts.


00:01 – intro

00:41 – Open AI deep research and o3-mini

13:51 – AI Action Summit

20:17 – Anthropic’s Constitutional Classifiers

28:54 – Microsoft AI Impact team


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.


Bonus: OpenAI GPT-4.5: And the future of pre-training is...01 Mar 202500:24:07

Is pre-training dead? In this bonus episode of Mixture of Experts, guest host Bryan Casey is joined by Kate Soule and Chris Hay. On Thursday, Sam Altman dropped GPT-4.5 just after we wrapped our weekly recording. We got a few of our veteran experts on the podcast to analyze OpenAI’s largest and “best” chat model yet. What’s the hype? Tune-in to this bonus episode to find out! 


00:01 – Intro  

00:25 – GPT-4.5 


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 

Episode 44: Claude 3.7 Sonnet, BeeAI agents, Granite 3.2, and emergent misalignment28 Feb 202500:39:45

Granite 3.2 is officially here! In episode 44 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Maya Murad and Kaoutar El Maghraoui to debrief a few big AI announcements. Last week we covered small vision-language models (VLMs), and this week Granite 3.2 dropped with  new VLMs, enhanced reasoning capabilities, and more! Kate takes us under the hood to understand the new features and how they were created. Next, Anthropic dropped a new intelligence model, Claude 3.7 Sonnet, and a new agentic coding tool, Claude Code. Why did Anthropic release these separately? Then, as we cannot have an episode without covering agents, Maya takes us through the new BeeAI agents! Finally, can fine tuning on a malicious task lead to much broader misalignment? Our experts analyze a new paper released on ‘Emergent misalignment.’ All that and more on this week's episode! 


00:01 – Intro  

00:41 – Claude 3.7 Sonnet 

11:58 – BeeAI agents  

20:11– Granite 3.2 

29:23 – Emergent misalignment 


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 

Quantum leap, Model Context Protocol, CoreWeave IPO and an AI voice companion07 Mar 202500:45:25

When can we expect quantum to reach consumer devices? In episode 45 of Mixture of Experts, host Tim Hwang is joined by special guest, Blake Johnson, to debrief the quantum noise in the news. Blake helps us understand the intersection between quantum and AI and how far we are from this technology. Then, veteran experts Chris Hay and Volkmar Uhlig hash out some other news in AI this week. We cover Anthropic’s Model Context Protocol, CoreWeave filing for an IPO and Sesame AI’s new voice companion. All that and more on today’s Mixture of Experts! 


00:01 – Intro  

01:06 – Quantum leap 

20:08 -- Model Context Protocol 

28:24 -- CoreWeave IPO 

40:12 -- Sesame AI voice companion 


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 

OpenAI goes open, Anthropic on interpretability, Apple Intelligence updates and Amazon AI agents04 Apr 202500:43:25

Will OpenAI be fully open source by 2027? In episode 49 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Ash Minhas and Chris Hay to analyze Sam Altman’s latest move towards open source. Next, we explore Anthropic's mechanistic interpretability results and the progress the AI research community is making. Then, can Apple catch up? We analyze the latest critiques on Apple Intelligence. Finally, Amazon enters the chat with AI agents. How does this elevate the competition? All that and more on today’s Mixture of Experts.


00:01 -- Introduction

00:48 -- OpenAI goes open  

11:36 -- Anthropic interpretability results 

24:55 -- Daring Fireball on Apple Intelligence 

34:22 -- Amazon’s AI agents


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.



Subscribe for AI updates: https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120

Learn more about artificial intelligence https://www.ibm.com/think/artificial-intelligence

Visit Mixture of Experts podcast page to learn more AI content https://www.ibm.com/think/podcasts/mixture-of-experts

DeepSeek-V3-0324, Gemini Canvas and GPT-4o image generation28 Mar 202500:41:43

What’s the best open-source model? In episode 48 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Skyler Speakman to explore the future of open-source AI models. First, we chat about the release of DeepSeek-V3-0324. Then, more announcements coming out of Google including Gemini Canvas and Gemini 2.5. Next, Extropic has entered the chat with a thermodynamic chip. Finally, AI image generation is on the rise as OpenAI released GPT-4o image generation. All that, and more on today’s Mixture of Experts.


00:01 – Intro

00:42– DeepSeek-V3-0324

09:48 – Gemini 2.5 and Canvas

21:27– Extropic’s thermodynamic chip

30:20 – OpenAI image generation


The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

NVIDIA GTC, Baidu reasoning models, and Gemini AI image generation21 Mar 202500:39:16

What’s the most exciting announcement coming out of NVIDIA GTC? In episode 47 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kaoutar El Maghraoui and Vyoma Gajjar. First, we dive into the latest announcements from NVIDIA GTC, including the Groot N1 model for humanoid robotics. Next, Baidu released some new AI reasoning models, and they’re not open source? Then, for our paper of the week we discuss the flaws of Chain-of-Thought reasoning. Finally, Gemini Flash 2.0 has released image generation models for developer experimentation., Iis Google catching up on the AI game? Tune -in to today’s Mixture of Experts to find out! 

 

00:01 – Intro  

01:27– NVIDIA GTC 

14:18– New Baidu AI models 

21:19– Chain-of-Thought reasoning 

32:18 – Gemini image generation 

 

The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 

© My Podcast Data