Explore every episode of the podcast Mixture of Experts
| Title | Pub. Date | Duration | |
|---|---|---|---|
| Introducing Mixture of Experts Podcast | 07 Jun 2024 | 00:01:11 | |
Introducing the Mixture of Experts podcast, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and the impact on business. You will hear from a panel of researchers, engineers, data scientists, ethics experts, veteran product leaders and more! Tune in weekly to stay ahead of the AI wave. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 29: Scaling AI, agent-led future, and race to AGI | 15 Nov 2024 | 00:39:11 | |
Is 2024 the year scaling AI officially breaks? In Episode 29 of Mixture of Experts, host Tim Hwang is joined by Anthony Annunziata, Kate Soule and Naveen Rao. First, the experts discuss whether we are living in a post scale world. Next, we can’t have an episode without chatting AI agents, but what does the future hold for this technology? Finally, is AGI here to stay? Tune-in to this week’s Mixture of Experts to find out. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 20: Apple Intelligence, Reflection 70B, open-source AI agents, and LLM research ideas | 13 Sep 2024 | 00:38:29 | |
Can Apple Intelligence compete with the AI market offerings? In Episode 20 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Kate Soule and Maya Murad. Today, the experts chat Apple Intelligence, the performance of Reflection’s 70B, and a new paper released on LLMs generating novel research ideas. Additionally, IBM soft launched the Bee Agent Framework to help build agentic workflows with leading open-source and proprietary models. Tune-in to hear our expert panel break down this week’s AI news. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 19: NEO 1X robot, OpenAI chips, The AI Scientist, and the future of prompt engineering | 06 Sep 2024 | 00:37:58 | |
Will prompt engineering ever die? In Episode 19 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Kate Soule and Shobhit Varshney. Today, the experts chat the future of prompt engineering, a new paper released about The AI Scientist, NEO 1X’s humanoid robot, and OpenAI’s in-house AI chips. Will AI takeover scientific discovery? Will everyone have at home AI assistants? Why is OpenAI investing in chip production? Tune-in for our expert’s takes. 0:00 - Intro 1:17 - Future of Prompt Engineering 11:18 - NEO 1X Robot 21:56 - AI for Scientific Discovery 31:48 - OpenAI's in-house Chips The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 18: Cursor hype, Perplexity introduces ads, and AI at the US Open | 30 Aug 2024 | 00:40:29 | |
Is search less trustworthy? In Episode 18 of Mixture of Experts, host Tim Hwang is joined by the IBM Fellows—Aaron Baughman, Kush Varshney, and Trent Gray-Donald. Today, the experts chat how AI is being integrated at the US Open. Next, the Perplexity is introducing ads in Q4, what is the affect on search? Finally, what's all the hype with Cursor? Tune-in to today’s episode for all this and more. 0:01 - Intro 00:59 - AI at the US Open 13:35 - Paid search in Perplexity 24:12 - Cursor hype! 35:53 - IBM Fellows The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 17: Agent Q, no AI in art, and AMD acquires ZT Systems | 23 Aug 2024 | 00:46:56 | |
What’s new with AI agents? In Episode 17 of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Skyler Speakman, and Volkmar Uhlig. Today, the experts chat Agent Q and the improvements in reasoning and planning. Next, the CEO of Procreate came out with a statement that there will be no gen AI integrated into their products—can art avoid the AI wave? Finally, AMD acquired ZT Systems, can they now compete with NVIDIA? All this and more on today’s episode. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Segments: 0:01 — Intro 00:51 — Agent Q 14:21 — No AI in Art 29:12 — AMD Acquires ZT Systems | |||
| Episode 16: Cost of a Data Breach 2024 and OpenAI's Project Strawberry | 16 Aug 2024 | 00:22:56 | |
Is OpenAI about to release their biggest AI project? In Episode 16 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kate Soule, and Shobhit Varshney. Today, the experts chat IBM’s 2024 Cost of a Data Breach Report and analyze how gen AI could reduce the cost of cyber threats. Next, rumors are circulating the internet about OpenAI dropping “Project Strawberry,” what they internally reference as a “level 2” model. Are the rumors true? Tune-in for more. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Segments: 0:01 — Intro 00:52 — Cost of a Data Breach 2024 12:33— Project Strawberry | |||
| Episode 15: OpenAI Structured Outputs, character.ai “acquisition,” and is it an AI bubble? | 09 Aug 2024 | 00:32:08 | |
Is it an AI bubble? In Episode 15 of Mixture of Experts, host Tim Hwang is joined by our veteran panel: Marina Danilevsky, Kush Varshney, and Shobhit Varshney. Today, the experts chat the stock market crash and the involvement of AI companies. Then, OpenAI released Structured Outputs, and analyze how this can support enterprise implementation of AI. Finally, Google "acquires" character.ai, does this make any sense? Tune-in for the breakdown. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 0:01 — Intro 1:07 — AI Bubble? 11:49 — Structured Outputs 22:41 — character.ai "Acquisition" | |||
| Episode 14: SAM 2, friend.com and will gen AI projects be abandoned? | 02 Aug 2024 | 00:28:48 | |
Meta releases SAM 2! In Episode 14 of Mixture of Experts, host Tim Hwang is joined by Ambhi Ganesan, Kate Soule and Vagner Santana. Today, the experts chat the next generation of Meta’s Segment Anything Model (SAM). Then, another AI companion attempt via friend.com, we analyze if startups effectively compete in the AI hardware space. Finally, we get expert opinions on various topics: Will gen AI projects be abandoned? Which is bigger—9.11 or 9.9? Tune-in today to find out. 0:01 — Intro 1:00 — SAM 2 10:49 — Friend.com 20:53 — Abandoned gen AI projects 25:38 — Which is bigger—9.11 or 9.9? The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 13: Meta's Llama 3.1, Mistral Large 2, and big interest in small models | 26 Jul 2024 | 00:20:20 | |
Meta strikes back with the launch of Llama 3.1! In Episode 13 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Shobhit Varshney, and Maryam Ashoori. Today, the experts analyze the business of AI in relation to the launch of Llama 3.1, including Llama 405B. Then, Mistral Large 2 sparks conversation about the open-source wave. Finally, the experts talk GPT 4o-mini and the model price war. Are little models having their moment? Tune-in to find out. 0:01 — Intro 1:33 — Llama 3.1 and Mistral Large 2 10:08 — Are Little Models Having a Moment? The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 12: Goldman Sachs Gen AI report, Claude 2.0 Engineer, and RIAA lawsuits | 19 Jul 2024 | 00:31:15 | |
Will modern AI break the music industry? In Episode 12 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Marina Danilevsky, and Brent Smolinski. Today, we review Goldman Sachs’ report on investment in Gen AI, “too much spend, too little benefit.” Next, the experts break down Claude 2.0 Engineer and the future of coding agents. Finally, the Recording Industry Association of America (RIAA) files lawsuits against two generative music companies. Tune-in to hear our expert takes! 0:01 — Intro 1:48 — Goldman Sachs Gen AI Spend Report 12:30 — Claude Engineer 2.0 21:02 — RIAA vs. Suno / Udio The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 11: AI at Wimbledon, ChatGPT for coding, and scaling with AI personas | 12 Jul 2024 | 00:41:21 | |
It's Wimbledon finals week! In Episode 11 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Kaoutar El Maghraoui, and Skyler Speakman. Today, we review how AI is providing insights throughout one of the most prestigious tennis tournaments and the future of AI in sports. Next, the experts break down the quality of ChatGPT for coding. Finally, how did scaling synthetic data create one billion personas? The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 28: SearchGPT, from Naptime to Big Sleep, and GitHub Octoverse updates | 08 Nov 2024 | 00:39:49 | |
Could AI wipe out software engineers? In Episode 28 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Kaoutar El Maghraoui, and Shobhit Varshney. First, the experts discuss GitHub reporting a rise of developers driven by AI code assistant tools. Next, Big Sleep finds a vulnerability in SQLite, what is the future for these kinds of AI agents? Finally, OpenAI released SearchGPT, what is the future of AI search? Tune-in today to find out! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 10: AI Hardware: Training, Inference, Devices and Model Optimization | 03 Jul 2024 | 00:38:26 | |
In Episode 10 of Mixture of Experts we are talking all hardware all the time. Guest host Bryan Casey is joined by Volkmar Uhlig, Chris Hay, and Kaoutar El Maghraoui to explore the intricacies of AI hardware. Is Apple creating a pattern for the industry with their on device and cloud architecture? Tune in to hear the experts debate the details. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 9: Claude 3.5 Sonnet, BIRD-SQL, and the latest in AI Slop | 28 Jun 2024 | 00:39:03 | |
Is shrimp Jesus the best use case of AI content creation? In Episode 9 of Mixture of Experts, guest host Bryan Casey is joined by Shobhit Varshney, Marina Danilevsky, and Michael Glass. The experts analyze both the release Anthropic’s Claude 3.5 and BIRD-SQL. Finally, we talk the latest in AI slop and how it is affecting content creation. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 8: NVIDIA’s Nemotron-4 340B models, Safe Superintelligence Inc. and AI agents | 21 Jun 2024 | 00:43:47 | |
Is there a new major player in the AI space? In Episode 8 of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Kate Soule, and Maya Murad. First, the experts react to NVIDIA’s Nemotron-4 340B model launch and the future of LLM training. Next, new developments in enterprise agents create a great discussion around the reality of AI agents. Finally, we discuss the launch of a new AI company, Safe Superintelligence Inc. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Apple's WWDC24 reactions and mechanistic interpretability | 14 Jun 2024 | 00:39:41 | |
Is Apple late to the AI game? In Episode 7 of Mixture of Experts, host Tim Hwang is joined by Shobhit Varshney, Skyler Speakman, and Kaoutar El Maghraoui. Today, the experts react to Apple’s slew of AI announcements at WWDC24. Then, part 2 on interpretability this week, as OpenAI released their study mechanistic interpretability. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| AI safety, RAG benchmarking and responsible AI at ACM FAccT Conference | 07 Jun 2024 | 00:40:29 | |
What’s the future of AGI? In Episode 6 of Mixture of Experts, host Tim Hwang is joined by Vagner Figueredo de Santana, Marina Danilesky, and Shobhit Varshney. Today, the experts unpack Leopold Aschenbrenner’s AI safety screed, Situational Awareness. We also break down the state of responsible AI amid the annual ACM Fairness, Accountability, and Transparency (FAccT) Conference. Finally, we chat about RAG benchmarking and what it tells us about the industry as a whole. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Google’s AI Overviews, Golden Gate Claude, the "whale computer" and scaling laws | 31 May 2024 | 00:44:20 | |
How is the market reacting to Google's AI overviews? In Episode 5 of Mixture of Experts, Bryan Casey, our guest host, is joined by Kate Soule, Chris Hay, and Skyler Speakman. Today, our experts revisit a conversation from a previous episode around Google’s AI Overviews and the market reaction. Additionally, they break down Anthropic’s Golden Gate Claude. Finally, what is the “whale computer” and how does it relate to scaling laws? The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Scarlett Johansson, FMTI and Think 2024 | 24 May 2024 | 00:38:26 | |
What’s going on between Scarlet Johansson and OpenAI? In Episode 4 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Marina Danilevsky, and Armand Ruiz. Kate explains the future of FMTI, Marina highlights innovations driving the open-source community, and Armand dives into the latest from IBM’s THINK 2024 event. Subscribe for AI updates:https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence: https://www.ibm.com/think/artificial-intelligence The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| GPT-4o, AI overviews and our multimodal future | 17 May 2024 | 00:40:58 | |
In Episode 3 of Mixture of Experts, host Tim Hwang is joined by Shobhit Varshney, Chris Hay, and Bryan Casey for the OpenAI vs. Google showdown. Shobhit analyzes the showcase demos released by OpenAI and Google. Chris breaks down latency and cost in relation to GPT-4 and Gemini 1.5 Flash. Finally, after years of people proclaiming the death of search, Bryan answers the big question: are LLMs forcing the death of Google search? The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| The state of open source, InspectorRAGet, and what’s going on with Kolmogorov-Arnold Networks | 10 May 2024 | 00:46:14 | |
In Episode 2 of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Marina Danilevsky, and David Cox. This week, the three AI experts weigh in on the explosion of open source technology and identify how it will shape the market. Kush and Tim produce the single most easy explanation of what’s going on with Kolmogorov-Arnold Networks and why it matters. Finally, we kick it back to the 90s with Inspector RAGet! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Rabbit AI hiccups, GPT-2 chatbot, and OpenAI's licensing deal with the Financial Times | 03 May 2024 | 00:41:39 | |
In the inaugural episode of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Shobhit Varshney, and Chris Hay. The three AI experts debate the pros and cons of Rabbit’s R1 device. They also unpack GPT-2’s potential evolution and OpenAI’s licensing deal with the Financial Times. Finally, what do Sam Altman and Taylor Swift have in common? Join us to find out! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 27: The future of agents, AI energy consumption, Anthropic's computer use, and Google watermarking AI | 01 Nov 2024 | 00:32:59 | |
Agents, agents, and more agents! In Episode 27 of Mixture of Experts, host Tim Hwang is joined by Volkmar Uhlig and Vyoma Gajjar. First, the experts chat about Mark Benioff’s spicy tweet, and what this means for the future of AI agents. Next, how much energy is needed to power AI models, and should we be concerned? Then, the experts debrief Anthropic’s release of computer use. Finally, Google is integrating SynthID-Text into Gemini to help watermark AI-generated text, do we need this feature? Learn more on today’s Mixture of Experts. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 26: Granite 3.0, NVIDIA’s Nemotron AI model, and Perplexity’s fundraising | 25 Oct 2024 | 00:37:18 | |
Can chat replace search? In Episode 26 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Petros Zerfos for IBM TechXchange week! First, the experts describe how the team created the Granite 3.0 models. Next, NVIDIA enters the open source model game, what does this mean for the competition? Finally, Perplexity AI is seeking over double their valuation in new funding rounds, what does this mean for start-ups? All that and more on today’s episode. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 25: Machines of Loving Grace, Entropix, AI and elections, GSM8K | 18 Oct 2024 | 00:41:17 | |
Can AI solve infectious disease? In Episode 25 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Maya Murad, and Ruben Boonen. Today we analyze some papers. First, the experts dissect Machines of Loving Grace, a 15,000 word essay written by Anthropic’s CEO making some major AI predictions. Then, Apple generated a new benchmark based of GSM8K in a recent paper, the findings were intriguing. Next, we talk Entropix, a sampler intending to replicate chain of thought features. Finally, OpenAI disclosed they are seeing an increase in AI models faking articles, what can we do to fix this? All this and more, on today’s Mixture of Experts. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 24: AI in the Nobels, DGX B200 arrival, and Unstructured’s $40M funding round | 11 Oct 2024 | 00:37:19 | |
Could AI win a Nobel Prize in the future? In Episode 24 of Mixture of Experts, host Tim Hwang is joined by Chris Hay and Edward Calvesbert. First, the experts debrief the ‘Godfather of AI’ sharing a Nobel Prize. Next, we talk AI platforms and the hype around DGX B200. Finally, unstructured data is becoming usable for LLMs, why are companies like NVIDIA so interested in this data? Tune-in today to find out! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 23: NotebookLM, OpenAI DevDay, and will AI prevent phishing attacks? | 04 Oct 2024 | 00:39:15 | |
Will DeepDive replace the Mixture of Experts podcast? In Episode 23, host Tim Hwang is joined by IBM Researchers Marina Danilevsky, Nathalie Baracaldo and Vagner Santana to dissect this week’s AI news. First, the experts talk about the hype around Google’s NotebookLM, specifically regarding the DeepDive podcast feature. Next, OpenAI DevDay sparks some interesting conversation around vision fine-tuning and multimodality. Finally, it’s Cybersecurity Awareness Month and IBM X-Force released the Cloud Threat Landscape Report. Will AI prevent phishing attacks? Tune-in to this week’s episode to learn more! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 22: Llama 3.2, AI Snake Oil, and gen AI for sustainability | 27 Sep 2024 | 00:33:51 | |
Meta releases Llama 3.2! In Episode 22 of Mixture of Experts, host Tim Hwang is joined by Maryam Ashoori, Skyler Speakman, and Shobhit Varshney to debrief an exciting week of AI news. First, Meta is back with the release of Llama 3.2, and lightweight (1B/3B) models. Next, it’s Climate Week NYC, we chat the use of gen AI in achieving sustainable development goals. Specifically, IBM and NASA’s AI model for weather and climate. Finally, the book version of “AI Snake Oil” officially dropped and the authors claim they will be wrong in 2.5 years. What do our experts think? Tune-in today to find out! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 21: OpenAI o1 preview, Agentforce, AI in fantasy football, and machine unlearning | 20 Sep 2024 | 00:47:45 | |
Strawberry is officially here! In Episode 21 of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Nathalie Baracaldo, and Aaron Baughman to chat about the hype around OpenAI’s o1 preview. Additionally, we cover AI agents again, with the launch of Agentforce. Next, Aaron—our resident AI in sports expert analyzes the AI powered insights for fantasy football. Finally, what is “machine unlearning,” and why should we care? All this and more, on today’s episode of Mixture of Experts. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 39: DeepSeek-R1, Mistral IPO, FrontierMath controversy, and IDC code assistant report | 24 Jan 2025 | 00:39:45 | |
What does the future hold for DeepSeek? In episode 39 of Mixture of Experts, join host Tim Hwang along with experts Abraham Daniels, Kaoutar El Maghraoui and Skyler Speakman to discuss the release of DeepSeek-R1. Next, Mistral indicates going IPO. Then, FrontierMath’s new benchmark is particularly difficult, the experts debrief. Finally, IDC released a report on code assistants, what do we need to know about generalist and specialized coding assistants? Tune-in to this week’s episode to find out. 00:01 – Intro 01:08 – DeepSeek-R1 14:08 – Mistral indicates IPO 20:54 – FrontierMath controversy 30:04 -- IDC code assistants report The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 38: Anthropic valuation rumors, Microsoft CoreAI, NotebookLM upgrades, and AI agents in finance | 17 Jan 2025 | 00:44:49 | |
What would you do with $2 billion? In episode 38 of Mixture of Experts, join host Tim Hwang along with experts Chris Hay, Kaoutar El Maghraoui and Vyoma Gajjar to discuss the Anthropic valuation rumors. Next, Microsoft CEO Nadella created a new CoreAI group to build and run apps for customers. Then, NotebookLM upgraded some of its features, including podcast intervention. Finally, AI agents are making their way into the financial services industry. Can an agent invest all of your money? Tune-in to this week’s episode to find out. 00:01 -- What would you do with $2 billion? 00:51 -- Anthropic valuation 12:14 -- Microsoft CoreAI 25:01 -- NotebookLM upgrades 35:17 -- AI agents in finance The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 37: CES 2025, NVIDIA DIGITS, Apple Intelligence fails, and Sam Altman’s reflections | 10 Jan 2025 | 00:35:38 | |
What’s the most exciting CES AI announcement? In episode 37 of Mixture of Experts, host Tim Hwang is joined by Skyler Speakman, Volkmar Uhlig and Shobhit Varshney to debrief CES 2025. Specifically, the experts dive into NVIDIA’S Project DIGITS, among other announcements from the AI hardware giant. Next, a new enterprise AI development survey came out that detailing how developers really feel about AI implementation. Then, Apple Intelligence experienced some major hallucination fails, what does this tell us about Apple’s stake in the AI game? Finally, Sam Altman of OpenAI released a reflection blog, what does he say about the future of AI? All that and more on today’s Mixture of Experts. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 36: OpenAI o3, DeepSeek-V3, and the Brundage/Marcus AI bet | 03 Jan 2025 | 00:39:19 | |
Is deep learning hitting a wall? It’s 2025 and Mixture of Experts is back and better than ever. In episode 36, host Tim Hwang is joined by Chris Hay, Kate Soule and Kush Varshney to debrief one of the biggest releases of 2024, OpenAI o3. Next, DeepSeek-V3 is here! Finally, will AI exist in 2027? The experts dissect the AI bet between Miles Brundage and Gary Marcus. All that and more on the first Mixture of Experts of 2025. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 00:00 — Intro 00:49 — OpenAI o3 14:40 — DeepSeek-V3 28:00 — The Brundage/Marcus bet | |||
| Episode 35: 2024 Rewind: Breakthroughs in AI models, agents, hardware and products | 27 Dec 2024 | 01:01:43 | |
Will 2025 be the year of AI agents? In Episode 35 of Mixture of Experts, host Tim Hwang is joined by some show veterans to debrief 2024 in AI. This week, we review AI models, agents, hardware and product releases with some of the top industry experts. What was the best model of 2024? Is NVIDIA king? What are some of the AI trends in 2025? All that and more on this special edition of Mixture of Experts. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 34: Granite 3.1, NVIDIA Jetson, stealing AI models, and is pre-training over? | 20 Dec 2024 | 00:40:30 | |
Is pre-training a thing of the past? In Episode 34 of Mixture of Experts, host Tim Hwang is joined by Abraham Daniels, Vagner Santana and Volkmar Uhlig to debrief this week in AI. First, OpenAI cofounder Ilya Sutskever said that “peak data” was achieved, does this mean there is no longer a need to model pre-training? Next, IBM released Granite 3.1 with a slew of features, we cover them all. Then, there is a new way to steal AI models, how do we protect against model exfiltration. Finally, can NVIDIA Jetson for AI developers really increase hardware accessibility? Tune-in for more! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 00:01 — Intro 00:49— Is pre-training over? 10:25 — Granite 3.1 22:23 — AI model stealing 33:38—NVIDIA Jetson | |||
| Episode 33: 12 Days of OpenAI, NeurIPS, ARC Prize, and Llama 3.3 70B | 13 Dec 2024 | 00:40:50 | |
Is o1 Pro worth the cost? In Episode 33 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Kate Soule and Vyoma Gajjar. First, the experts debrief the 12 Days of OpenAI. Next, we review some of the top papers in NeurIPS, how are the experts keeping up with all these research papers? Then, we are back with another benchmark, can ARC Prize make AGI more tractable? Finally, Meta announced the launch of Llama 3.3 70B with the promise of 405B performance, can we have our cake and eat it too? Find out more on today’s Mixture of Experts! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 32: Inside AWS re:Invent 2024, LLM Flowbreaking, and David Mayer | 06 Dec 2024 | 00:37:55 | |
What’s the mystery behind the name ChatGPT refuses to discuss? In Episode 32 of Mixture of Experts host Tim Hwang dives into the hottest topics shaping the AI landscape with an all-star panel: Aaron Baughman, Vagner Figueredo de Santana, and Shobhit Varshney. First, they disect the biggest announcements and takeaways from AWS re:Invent 2024, Amazon’s premier AI event. Next, they talk about overcoming architectural vulnerabilities in AI systems, and finally, they uncover the curious case of a name ChatGPT won’t discuss—and the questions this raises about privacy and transparency in AI. Get ready for an episode packed with insights, debates, and forward-thinking perspectives! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 31: AI in education: Safety, literacy, and predictions | 27 Nov 2024 | 00:36:31 | |
How much future learning will be done with an AI assistant? In Episode 31 of Mixture of Experts, host Tim Hwang is joined by Phaedra Boinodiris, Marina Danilevsky and Skyler Speakman for the AI in education special episode. First, the experts give an update on the state of AI within education. Next, we cover concerns around AI safety and literacy, what do students and teachers need to be aware of? Finally, the panel gives their predictions on what the future of education holds as it relates to AI. Tune-in to this special episode for an in-depth analysis! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 30: “Near-infinite memory,” Microsoft Ignite, FrontierMath, and AlphaFold3 | 22 Nov 2024 | 00:43:12 | |
Should your AI assistant remember everything about you? In Episode 30 of Mixture of Experts, host Tim Hwang is joined by Vagner Santana, Vyoma Gajjar and Shobhit Varshney. First, the experts breakdown claims of “near-infinite memory” within AI models. Next, Shobhit is fresh off the plane from Microsoft Ignite, he shares some of the exciting new announcements following the event. Then, a new benchmark has entered the chat, what do we know about FrontierMath? Finally, AlphaFold3 is now more open, why does this matter? Find out more on today’s episode! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 40: DeepSeek facts vs hype, model distillation, and open source competition | 31 Jan 2025 | 00:39:17 | |
Let’s bust some early myths about DeepSeek. In episode 40 of Mixture of Experts, join host Tim Hwang along with experts Aaron Baughman, Chris Hay and Kate Soule. Last week, we covered the release of DeepSeek-R1; now that the entire world is up to speed, let’s separate the facts from the hype. Next, what is model distillation and why does it matter for competition in AI? Finally, Sam Altman among other tech CEOs shared his response to DeepSeek. Will R1 radically change the open-source strategy of other tech giants? Find out all this and more on Mixture of Experts. 00:01 – Intro 00:41 – DeepSeek facts vs hype 21:00 – Model distillation 31:21 – Open source and OpenAI The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 43: Deep Research, OpenAI inference chip, small VLMs, and AI agent job posting | 21 Feb 2025 | 00:45:51 | |
What is all the hype around Deep Research? In episode 43 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Volkmar Uhlig and Shobhit Varshney. This week, we discuss reasoning model features coming out of companies like OpenAI’s Deep Research, Google Gemini, Perplexity, xAI’s Grok-3 and more! Next, OpenAI is rumored to release an inference chip, but how likely is this to be a success in the AI chip game? Then, we analyze the capabilities of small vision-language models (VLMs). Finally, a startup, Firecrawl, released a job posting in search of an AI agent. Is this the future for AI tools in the workforce? Tune-in to today’s Mixture of Experts to find out. 00:01 – Intro 00:35 – Deep Research 11:58 – OpenAI inference chip 22:17 – Small VLMs 32:31 – AI agent job posting The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 42: Paris AI Summit, Altman's "Three Observations," and Anthropic's Economic Index | 14 Feb 2025 | 00:39:56 | |
Live from Paris, Tim Hwang is at the AI Action Summit 2025. In episode 42 of Mixture of Experts, we welcome Anastasia Stasenko, CEO and Co-Founder of pleias along with our veteran experts Marina Danilevsky and Chris Hay. Last week, we touched on some potential conversations at the Paris AI Summit, this week we recap what actually happened. Is AI safety improving Globally? Next, for our paper of the week, we breakdown s1: Simple test-time scaling. Then, Sam Altman is back with another blog, “Three Observations,” what do our experts have to say? Finally, what can we learn from Anthropic’s Economic Index? All that and more on today’s Mixture of Experts. 00:01 – Intro 00:42 – Paris AI Summit 11:10 – s1: Simple test-time scaling 19:32 – Sam Altman’s “Three Observations” 30:41 – Anthropic’s Economic Index The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Resources: Read the paper about s1: Simple test-time scaling: https://arxiv.org/abs/2501.19393 Read Sam Altman's "Three Observations": https://blog.samaltman.com/three-observations Read Anthropic's Economic Index: https://www.anthropic.com/economic-index Read more about AGI: https://www.ibm.com/think/topics/artificial-general-intelligence | |||
| Episode 41: OpenAI deep research, o3-mini, AI Action Summit, and Anthropic’s Constitutional Classifiers | 07 Feb 2025 | 00:38:08 | |
What does Sam Altman have up his sleeve? In episode 41 of Mixture of Experts, join host Tim Hwang along with experts Nathalie Baracaldo, Marina Danilevsky and Chris Hay. Last week, we covered all things DeepSeek, and this week OpenAI has some new releases to share. Today, the experts dissect deep research and o3-mini. Next, our host Tim Hwang is travelling to AI Action Summit, he asks our experts what we can expect coming out of the event. Then, we talk about Anthropic’s Constitutional Classifiers. Finally, Microsoft is creating a unit to study AI’s impact, what does this mean? Find out all this and more on Mixture of Experts. 00:01 – intro 00:41 – Open AI deep research and o3-mini 13:51 – AI Action Summit 20:17 – Anthropic’s Constitutional Classifiers 28:54 – Microsoft AI Impact team The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.
| |||
| Bonus: OpenAI GPT-4.5: And the future of pre-training is... | 01 Mar 2025 | 00:24:07 | |
Is pre-training dead? In this bonus episode of Mixture of Experts, guest host Bryan Casey is joined by Kate Soule and Chris Hay. On Thursday, Sam Altman dropped GPT-4.5 just after we wrapped our weekly recording. We got a few of our veteran experts on the podcast to analyze OpenAI’s largest and “best” chat model yet. What’s the hype? Tune-in to this bonus episode to find out! 00:01 – Intro 00:25 – GPT-4.5 The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Episode 44: Claude 3.7 Sonnet, BeeAI agents, Granite 3.2, and emergent misalignment | 28 Feb 2025 | 00:39:45 | |
Granite 3.2 is officially here! In episode 44 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Maya Murad and Kaoutar El Maghraoui to debrief a few big AI announcements. Last week we covered small vision-language models (VLMs), and this week Granite 3.2 dropped with new VLMs, enhanced reasoning capabilities, and more! Kate takes us under the hood to understand the new features and how they were created. Next, Anthropic dropped a new intelligence model, Claude 3.7 Sonnet, and a new agentic coding tool, Claude Code. Why did Anthropic release these separately? Then, as we cannot have an episode without covering agents, Maya takes us through the new BeeAI agents! Finally, can fine tuning on a malicious task lead to much broader misalignment? Our experts analyze a new paper released on ‘Emergent misalignment.’ All that and more on this week's episode! 00:01 – Intro 00:41 – Claude 3.7 Sonnet 11:58 – BeeAI agents 20:11– Granite 3.2 29:23 – Emergent misalignment The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| Quantum leap, Model Context Protocol, CoreWeave IPO and an AI voice companion | 07 Mar 2025 | 00:45:25 | |
When can we expect quantum to reach consumer devices? In episode 45 of Mixture of Experts, host Tim Hwang is joined by special guest, Blake Johnson, to debrief the quantum noise in the news. Blake helps us understand the intersection between quantum and AI and how far we are from this technology. Then, veteran experts Chris Hay and Volkmar Uhlig hash out some other news in AI this week. We cover Anthropic’s Model Context Protocol, CoreWeave filing for an IPO and Sesame AI’s new voice companion. All that and more on today’s Mixture of Experts! 00:01 – Intro 01:06 – Quantum leap 20:08 -- Model Context Protocol 28:24 -- CoreWeave IPO 40:12 -- Sesame AI voice companion The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| OpenAI goes open, Anthropic on interpretability, Apple Intelligence updates and Amazon AI agents | 04 Apr 2025 | 00:43:25 | |
Will OpenAI be fully open source by 2027? In episode 49 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Ash Minhas and Chris Hay to analyze Sam Altman’s latest move towards open source. Next, we explore Anthropic's mechanistic interpretability results and the progress the AI research community is making. Then, can Apple catch up? We analyze the latest critiques on Apple Intelligence. Finally, Amazon enters the chat with AI agents. How does this elevate the competition? All that and more on today’s Mixture of Experts. 00:01 -- Introduction 00:48 -- OpenAI goes open 11:36 -- Anthropic interpretability results 24:55 -- Daring Fireball on Apple Intelligence 34:22 -- Amazon’s AI agents The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updates: https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to learn more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts | |||
| DeepSeek-V3-0324, Gemini Canvas and GPT-4o image generation | 28 Mar 2025 | 00:41:43 | |
What’s the best open-source model? In episode 48 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Skyler Speakman to explore the future of open-source AI models. First, we chat about the release of DeepSeek-V3-0324. Then, more announcements coming out of Google including Gemini Canvas and Gemini 2.5. Next, Extropic has entered the chat with a thermodynamic chip. Finally, AI image generation is on the rise as OpenAI released GPT-4o image generation. All that, and more on today’s Mixture of Experts. 00:01 – Intro 00:42– DeepSeek-V3-0324 09:48 – Gemini 2.5 and Canvas 21:27– Extropic’s thermodynamic chip 30:20 – OpenAI image generation The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||
| NVIDIA GTC, Baidu reasoning models, and Gemini AI image generation | 21 Mar 2025 | 00:39:16 | |
What’s the most exciting announcement coming out of NVIDIA GTC? In episode 47 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kaoutar El Maghraoui and Vyoma Gajjar. First, we dive into the latest announcements from NVIDIA GTC, including the Groot N1 model for humanoid robotics. Next, Baidu released some new AI reasoning models, and they’re not open source? Then, for our paper of the week we discuss the flaws of Chain-of-Thought reasoning. Finally, Gemini Flash 2.0 has released image generation models for developer experimentation., Iis Google catching up on the AI game? Tune -in to today’s Mixture of Experts to find out!
00:01 – Intro 01:27– NVIDIA GTC 14:18– New Baidu AI models 21:19– Chain-of-Thought reasoning 32:18 – Gemini image generation
The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. | |||