Explore every episode of the podcast Google AI: Release Notes
| Title | Pub. Date | Duration | |
|---|---|---|---|
| Launching Gemini 2.5 | 28 Mar 2025 | 00:27:55 | |
Tulsee Doshi, Head of Product for Gemini Models joins host Logan Kilpatrick for an in-depth discussion on the latest Gemini 2.5 Pro experimental launch. Gemini 2.5 is a well-rounded, multimodal thinking model, designed to tackle increasingly complex problems. From enhanced reasoning to advanced coding, Gemini 2.5 can create impressive web applications and agentic code applications. Learn about the process of building Gemini 2.5 Pro experimental, the improvements made across the stack, and what’s next for Gemini 2.5.
Chapters: 0:00 - Introduction
Resources:
| |||
| Gemini app: Canvas, Deep Research and Personalization | 20 Mar 2025 | 00:36:53 | |
Dave Citron, Senior Director Product Management, joins host Logan Kilpatrick for an in-depth discussion on the latest Gemini updates and demos. Learn more about Canvas for collaborative content creation, enhanced Deep Research with Thinking Models and Audio Overview and a new personalization feature. 0:00 - Introduction | |||
| Developing Google DeepMind's Thinking Models | 24 Feb 2025 | 01:03:32 | |
Jack Rae, Principal Scientist at Google DeepMind, joins host Logan Kilpatrick for an in-depth discussion on the development of Google’s thinking models. Learn more about practical applications of thinking models, the impact of increased 'thinking time' on model performance and the key role of long context. 01:14 - Defining Thinking Models | |||
| Behind the Scenes of Gemini 2.0 | 11 Dec 2024 | 00:35:18 | |
Tulsee Doshi, Gemini model product lead, joins host Logan Kilpatrick to go behind the scenes of Gemini 2.0, taking a deep dive into the model's multimodal capabilities and native tool use, and Google's approach to shipping experimental models.
Watch on YouTube: https://www.youtube.com/watch?v=L7dw799vu5o
Chapters:
Meet Tulsee Doshi
Gemini's Progress Over the Past Year
Introducing Gemini 2.0
Shipping Experimental Models
Gemini 2.0’s Native Tool Use
Function Calling
Multimodal Agents
Rapid Fire Questions
| |||
| Smaller, Faster, Cheaper & The Story of Flash 8B | 05 Dec 2024 | 00:43:20 | |
Logan Kilpatrick sits down with Emanuel Taropa, a key figure in the development of Gemini to delve into the cutting edge of AI. Taropa provides insights into the technical challenges and triumphs of building and deploying large language models, focusing on the recent release of the Flash 8B Gemini model.
Their conversation covers everything from the intricacies of model architecture and training to the practical challenges of shipping AI models at scale, and even speculates on the future of AI.
| |||
| Deep Dive into Long Context | 02 May 2025 | 00:59:32 | |
Explore the synergy between long context models and Retrieval Augmented Generation (RAG) in this episode of Release Notes. Join Google DeepMind's Nikolay Savinov as he discusses the importance of large context windows, how they enable Al agents, and what's next in the field. Chapters: | |||
| Google I/O 2025 Recap with Josh Woodward and Tulsee Doshi | 22 May 2025 | 00:40:15 | |
Learn more
Chapters
| |||
| Building Gemini's Coding Capabilities | 16 Jun 2025 | 01:00:27 | |
Connie Fan, Product Lead for Gemini's coding capabilities, and Danny Tarlow, Research Lead for Gemini's coding capabilities, join host Logan Kilpatrick for an in-depth discussion on how the team built one of the world's leading AI coding models. Learn more about the early goals that shaped Gemini's approach to code, the rise of 'vibe coding' and its impact on development, strategies for tackling large codebases with long context and agents, and the future of programming languages in the age of AI. Watch on YouTube: https://www.youtube.com/watch?v=jwbG_m-X-gE Chapters: 0:00 - Intro
| |||
| Sergey Brin on the Future of AI & Gemini | 16 Jun 2025 | 00:27:19 | |
A conversation with Sergey Brin, co-founder of Google and computer scientist working on Gemini, in reaction to a year of progress with Gemini. Watch on YouTube: https://www.youtube.com/watch?v=o7U4DV9Fkc0 0:20 - Initial reactions to I/O
| |||
| Gemini's Multimodality | 02 Jul 2025 | 00:44:17 | |
Ani Baddepudi, Gemini Model Behavior Product Lead, joins host Logan Kilpatrick for a deep dive into Gemini's multimodal capabilities. Their conversation explores why Gemini was built as a natively multimodal model from day one, the future of proactive AI assistants, and how we are moving towards a world where "everything is vision." Learn about the differences between video and image understanding and token representations, higher FPS video sampling, and more.
Chapters: 0:00 - Intro
| |||
| Demis Hassabis on shipping momentum, better evals and world models | 11 Aug 2025 | 00:31:09 | |
Demis Hassabis, CEO of Google DeepMind, sits down with host Logan Kilpatrick. In this episode, learn about the evolution from game-playing AI to today's thinking models, how projects like Genie 3 are building world models to help AI understand reality and why new testing grounds like Kaggle’s Game Arena are needed to evaluate progress on the path to AGI. Watch on YouTube: https://www.youtube.com/watch?v=njDochQ2zHs
| |||
| Building real-time voice applications with Live API | 06 Aug 2025 | 00:40:14 | |
Shrestha Basu Mallick, one of the product leads for the Gemini API, joins host Logan Kilpatrick for a deep dive of Gemini Live API, Google’s real-time, multimodal interface for developers. Learn about how native audio alongside new capabilities like proactive audio and async function calling unlocks the unique power of audio as an interface. Watch on YouTube: https://www.youtube.com/watch?v=4xlwlU6h-wM
| |||
| Building a frontier AI search experience | 23 Jul 2025 | 00:43:16 | |
Robby Stein, VP of Product for Google Search, joins host Logan Kilpatrick to explore how Search is evolving into a frontier AI product. Their conversation covers the shift from simple keywords to complex, conversational queries, the rise of agentic capabilities that can take action on your behalf, and the vision to help billions of users truly "ask anything." Learn more about the technology behind AI Overviews, AI Mode, Deep Search, and the future of multimodal interaction. Chapters
| |||
| Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy | 26 Nov 2025 | 00:27:34 | |
Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Google’s long-term bets on infrastructure, what it’s actually like to ship SOTA models, and the rise of vibe coding. Sundar also shares his personal launch day rituals and thoughts on future moonshots like putting data centers in space. Watch on YouTube: https://www.youtube.com/watch?v=iFqDyWFuw1c | |||
| Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model | 26 Nov 2025 | 00:36:24 | |
Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advanced visual reasoning and multi-turn generation capabilities, and how this next-gen tool enables complex image edits and real-world applications. In this episode, we discuss how user feedback and continuous benchmarking drive model improvements, ensuring a superior experience for developers. Watch on YouTube: https://www.youtube.com/watch?v=hk6gwiZmSWA | |||
| Koray Kavukcuoglu: “This Is How We Are Going to Build AGI” | 25 Nov 2025 | 00:48:44 | |
Join Logan Kilpatrick and Koray Kavukcuoglu, CTO of Google DeepMind and Chief AI Architect of Google, as they discuss Gemini 3 and the state of AI! Their conversation includes the reception of Gemini 3, the ongoing advancements in AI research, and the role of benchmarks in pushing new frontiers. They explore critical areas for Gemini's focus, emphasizing instruction following, tool calls, and internationalization, alongside Google's collaborative approach to AI development. Watch on YouTube: https://www.youtube.com/watch?v=fXtna7UrL44 Chapters: | |||
| Google Antigravity: Hands on with our new agentic development platform | 25 Nov 2025 | 00:44:49 | |
Explore Antigravity, Google DeepMind’s innovative new AI developer coding product, with Varun Mohan on Release Notes. This episode dives into Antigravity as a powerful agent development platform, integrating a familiar IDE experience with browser verification and Gemini 3.0 capabilities. Discover how developers can orchestrate complex agentic workflows, leverage artifacts for task communication, and balance AI automation with human collaboration. Learn about the philosophy behind building next-gen agentic experiences, the platform's multimodal strengths, and its role in accelerating software development at scale. Watch on YouTube: https://www.youtube.com/watch?v=uzFOhkORVfk Chapters
| |||
| Gemini 3: Launch day reactions | 25 Nov 2025 | 00:42:16 | |
Join us for a special episode of Release Notes as we unpack Gemini 3, Google’s latest AI model with key team members. Learn how Gemini 3 empowers developers with enhanced multimodal understanding, agentic capabilities for complex tasks, and generative interfaces that transform prompts into interactive applications. We discuss real-world use cases, the iterative development process driven by user feedback, and the strategic balance between model performance and broad accessibility across various Google platforms. Watch on YouTube: https://www.youtube.com/watch?v=mci0f2dy7G0 Chapters: | |||
| How a Moonshot Led to Google DeepMind's Veo 3 | 16 Oct 2025 | 00:48:10 | |
Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.
| |||
| GDM’s Pushmeet Kohli on solving science's biggest challenges with AI | 15 Sep 2025 | 00:37:28 | |
Pushmeet Kohli, Head of Science and Strategic Initiatives at Google DeepMind, joins host Logan Kilpatrick to explore the intersection of AI and scientific discovery. Learn how the team's unique problem-solving framework led to innovations like AlphaFold and AlphaEvolve, and how new tools like AI Co-scientist aim to democratize these types of breakthroughs for everyone. Watch on YouTube: https://www.youtube.com/watch?v=o7mdsL6BHsk Chapters: | |||
| Behind the scenes of Google's state-of-the-art "nano-banana" image model | 27 Aug 2025 | 00:30:32 | |
Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi. | |||