Back
Explore every episode of the podcast Vanishing Gradients
Dive into the complete episode list for Vanishing Gradients. Each episode is cataloged with detailed descriptions, making it easy to find and explore specific topics. Keep track of all episodes from your favorite podcast and never miss a moment of insightful content.
| Title | Pub. Date | Duration | |
|---|---|---|---|
| Episode 1: Introducing Vanishing Gradients | 16 Feb 2022 | 00:05:29 | |
In this brief introduction, Hugo introduces the rationale behind launching a new data science podcast and gets excited about his upcoming guests: Jeremy Howard, Rachael Tatman, and Heather Nolis!
Original music, bleeps, and blops by local Sydney legend PlaneFace (https://planeface.bandcamp.com/album/fishing-from-an-asteroid)!
| |||
| Episode 39: From Models to Products: Bridging Research and Practice in Generative AI at Google Labs | 25 Nov 2024 | 01:43:28 | |
Hugo speaks with Ravin Kumar,*Senior Research Data Scientist at Google Labs. Ravinās career has taken him from building rockets at SpaceX to driving data science and technology at Sweetgreen, and now to advancing generative AI research and applications at Google Labs and DeepMind. His multidisciplinary experience gives him a rare perspective on building AI systems that combine technical rigor with practical utility.
In this episode, we dive into:
⢠Ravinās fascinating career path, including the skills and mindsets needed to work effectively with AI and machine learning models at different stages of the pipeline.
⢠How to build generative AI systems that are scalable, reliable, and aligned with user needs.
⢠Real-world applications of generative AI, such as using open weight models such as Gemma to help a bakery streamline operationsāan example of delivering tangible business value through AI.
⢠The critical role of UX in AI adoption, and how Ravin approaches designing tools like Notebook LM with the user journey in mind.
We also include a live demo where Ravin uses Notebook LM to analyze my website, extract insights, and even generate a podcast-style conversation about me. While some of the demo is visual, much can be appreciated through audio, and weāve added a link to the video in the show notes for those who want to see it in action. Weāve also included the generated segment at the end of the episode for you to enjoy.
LINKS
The livestream on YouTube (https://www.youtube.com/live/ffS6NWqoo_k)
Google Labs (https://labs.google/)
Ravin's GenAI Handbook (https://ravinkumar.com/GenAiGuidebook/book_intro.html)
Breadboard: A library for prototyping generative AI applications (https://breadboard-ai.github.io/breadboard/)
As mentioned in the episode, Hugo is teaching a four-week course, Building LLM Applications for Data Scientists and SWEs, co-led with Stefan Krawczyk (Dagworks, ex-StitchFix). The course focuses on building scalable, production-grade generative AI systems, with hands-on sessions, $1,000+ in cloud credits, live Q&As, and guest lectures from industry experts.
Listeners of Vanishing Gradients can get 25% off the course using this special link (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=VG25) or by applying the code VG25 at checkout.
| |||
| Episode 30: Lessons from a Year of Building with LLMs (Part 2) | 26 Jun 2024 | 01:15:23 | |
Hugo speaks about Lessons Learned from a Year of Building with LLMs with Eugene Yan from Amazon, Bryan Bischof from Hex, Charles Frye from Modal, Hamel Husain from Parlance Labs, and Shreya Shankar from UC Berkeley.
These five guests, along with Jason Liu who couldn't join us, have spent the past year building real-world applications with Large Language Models (LLMs). They've distilled their experiences into a report of 42 lessons across operational, strategic, and tactical dimensions (https://applied-llms.org/), and they're here to share their insights.
Weāve split this roundtable into 2 episodes and, in this second episode, we'll explore:
An inside look at building end-to-end systems with LLMs;
The experimentation mindset: Why it's the key to successful AI products;
Building trust in AI: Strategies for getting stakeholders on board;
The art of data examination: Why looking at your data is more crucial than ever;
Evaluation strategies that separate the pros from the amateurs.
Although we're focusing on LLMs, many of these insights apply broadly to data science, machine learning, and product development, more generally.
LINKS
The livestream on YouTube (https://www.youtube.com/live/c0gcsprsFig)
The Report: What Weāve Learned From A Year of Building with LLMs (https://applied-llms.org/)
About the Guests/Authors (https://applied-llms.org/about.html) <-- connect with them all on LinkedIn, follow them on Twitter, subscribe to their newsletters! (Seriously, though, the amount of collective wisdom here is š¤
Your AI product needs evals by Hamel Husain (https://hamel.dev/blog/posts/evals/)
Prompting Fundamentals and How to Apply them Effectively by Eugene Yan (https://eugeneyan.com/writing/prompting/)
Fuck You, Show Me The Prompt by Hamel Husain (https://hamel.dev/blog/posts/prompt/)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Vanishing Gradients on Twitter (https://x.com/vanishingdata)
Vanishing Gradients on Lu.ma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
| |||
| Episode 29: Lessons from a Year of Building with LLMs (Part 1) | 26 Jun 2024 | 01:30:21 | |
Hugo speaks about Lessons Learned from a Year of Building with LLMs with Eugene Yan from Amazon, Bryan Bischof from Hex, Charles Frye from Modal, Hamel Husain from Parlance Labs, and Shreya Shankar from UC Berkeley.
These five guests, along with Jason Liu who couldn't join us, have spent the past year building real-world applications with Large Language Models (LLMs). They've distilled their experiences into a report of 42 lessons across operational, strategic, and tactical dimensions (https://applied-llms.org/), and they're here to share their insights.
Weāve split this roundtable into 2 episodes and, in this first episode, we'll explore:
The critical role of evaluation and monitoring in LLM applications and why they're non-negotiable, including "evals" - short for evaluations, which are automated tests for assessing LLM performance and output quality;
Why data literacy is your secret weapon in the AI landscape;
The fine-tuning dilemma: when to do it and when to skip it;
Real-world lessons from building LLM applications that textbooks won't teach you;
The evolving role of data scientists and AI engineers in the age of AI.
Although we're focusing on LLMs, many of these insights apply broadly to data science, machine learning, and product development, more generally.
LINKS
The livestream on YouTube (https://www.youtube.com/live/c0gcsprsFig)
The Report: What Weāve Learned From A Year of Building with LLMs (https://applied-llms.org/)
About the Guests/Authors (https://applied-llms.org/about.html) <-- connect with them all on LinkedIn, follow them on Twitter, subscribe to their newsletters! (Seriously, though, the amount of collective wisdom here is š¤
Your AI product needs evals by Hamel Husain (https://hamel.dev/blog/posts/evals/)
Prompting Fundamentals and How to Apply them Effectively by Eugene Yan (https://eugeneyan.com/writing/prompting/)
Fuck You, Show Me The Prompt by Hamel Husain (https://hamel.dev/blog/posts/prompt/)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Vanishing Gradients on Twitter (https://x.com/vanishingdata)
Vanishing Gradients on Lu.ma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
| |||
| Episode 28: Beyond Supervised Learning: The Rise of In-Context Learning with LLMs | 09 Jun 2024 | 01:05:38 | |
Hugo speaks with Alan Nichol, co-founder and CTO of Rasa, where they build software to enable developers to create enterprise-grade conversational AI and chatbot systems across industries like telcos, healthcare, fintech, and government.
What's super cool is that Alan and the Rasa team have been doing this type of thing for over a decade, giving them a wealth of wisdom on how to effectively incorporate LLMs into chatbots - and how not to. For example, if you want a chatbot that takes specific and important actions like transferring money, do you want to fully entrust the conversation to one big LLM like ChatGPT, or secure what the LLMs can do inside key business logic?
In this episode, they also dive into the history of conversational AI and explore how the advent of LLMs is reshaping the field. Alan shares his perspective on how supervised learning has failed us in some ways and discusses what he sees as the most overrated and underrated aspects of LLMs.
Alan offers advice for those looking to work with LLMs and conversational AI, emphasizing the importance of not sleeping on proven techniques and looking beyond the latest hype. In a live demo, he showcases Rasa's Calm (Conversational AI with Language Models), which allows developers to define business logic declaratively and separate it from the LLM, enabling reliable execution of conversational flows.
LINKS
The livestream on YouTube (https://www.youtube.com/live/kMFBYC2pB30?si=yV5sGq1iuC47LBSi)
Alan's Rasa CALM Demo: Building Conversational AI with LLMs (https://youtu.be/4UnxaJ-GcT0?si=6uLY3GD5DkOmWiBW)
Alan on twitter.com (https://x.com/alanmnichol)
Rasa (https://rasa.com/)
CALM, an LLM-native approach to building reliable conversational AI (https://rasa.com/docs/rasa-pro/calm/)
Task-Oriented Dialogue with In-Context Learning (https://arxiv.org/abs/2402.12234)
'We donāt know how to build conversational software yet' by Alan Nicol (https://medium.com/rasa-blog/we-don-t-know-how-to-build-conversational-software-yet-a18301db0e4b)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Upcoming Livestreams
Lessons from a Year of Building with LLMs (https://lu.ma/e8huz3s6?utm_source=vgan)
VALIDATING THE VALIDATORS with Shreya Shanker (https://lu.ma/zz3qic45?utm_source=vgan)
| |||
| Episode 27: How to Build Terrible AI Systems | 31 May 2024 | 01:32:24 | |
Hugo speaks with Jason Liu, an independent consultant who uses his expertise in recommendation systems to help fast-growing startups build out their RAG applications. He was previously at Meta and Stitch Fix is also the creator of Instructor, Flight, and an ML and data science educator.
They talk about how Jason approaches consulting companies across many industries, including construction and sales, in building production LLM apps, his playbook for getting ML and AI up and running to build and maintain such apps, and the future of tooling to do so.
They take an inverted thinking approach, envisaging all the failure modes that would result in building terrible AI systems, and then figure out how to avoid such pitfalls.
LINKS
The livestream on YouTube (https://youtube.com/live/USTG6sQlB6s?feature=share)
Jason's website (https://jxnl.co/)
PyDdantic is all you need, Jason's Keynote at AI Engineer Summit, 2023 (https://youtu.be/yj-wSRJwrrc?si=JIGhN0mx0i50dUR9)
How to build a terrible RAG system by Jason (https://jxnl.co/writing/2024/01/07/inverted-thinking-rag/)
To express interest in Jason's Systematically improving RAG Applications course (https://q7gjsgfstrp.typeform.com/ragcourse?typeform-source=vg)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Upcoming Livestreams
Good Riddance to Supervised Learning with Alan Nichol (CTO and co-founder, Rasa) (https://lu.ma/gphzzyyn?utm_source=vgj)
Lessons from a Year of Building with LLMs (https://lu.ma/e8huz3s6?utm_source=vgj)
| |||
| Episode 26: Developing and Training LLMs From Scratch | 15 May 2024 | 01:51:35 | |
Hugo speaks with Sebastian Raschka, a machine learning & AI researcher, programmer, and author. As Staff Research Engineer at Lightning AI, he focuses on the intersection of AI research, software development, and large language models (LLMs).
How do you build LLMs? How can you use them, both in prototype and production settings? What are the building blocks you need to know about?
āIn this episode, weāll tell you everything you need to know about LLMs, but were too afraid to ask: from covering the entire LLM lifecycle, what type of skills you need to work with them, what type of resources and hardware, prompt engineering vs fine-tuning vs RAG, how to build an LLM from scratch, and much more.
The idea here is not that youāll need to use an LLM youāve built from scratch, but that weāll learn a lot about LLMs and how to use them in the process.
Near the end we also did some live coding to fine-tune GPT-2 in order to create a spam classifier!
LINKS
The livestream on YouTube (https://youtube.com/live/qL4JY6Y5pmA)
Sebastian's website (https://sebastianraschka.com/)
Machine Learning Q and AI: 30 Essential Questions and Answers on Machine Learning and AI by Sebastian (https://nostarch.com/machine-learning-q-and-ai)
Build a Large Language Model (From Scratch) by Sebastian (https://www.manning.com/books/build-a-large-language-model-from-scratch)
PyTorch Lightning (https://lightning.ai/docs/pytorch/stable/)
Lightning Fabric (https://lightning.ai/docs/fabric/stable/)
LitGPT (https://github.com/Lightning-AI/litgpt)
Sebastian's notebook for finetuning GPT-2 for spam classification! (https://github.com/rasbt/LLMs-from-scratch/blob/main/ch06/01_main-chapter-code/ch06.ipynb)
The end of fine-tuning: Jeremy Howard on the Latent Space Podcast (https://www.latent.space/p/fastai)
Our next livestream: How to Build Terrible AI Systems with Jason Liu (https://lu.ma/terrible-ai-systems?utm_source=vg)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
| |||
| Episode 25: Fully Reproducible ML & AI Workflows | 18 Mar 2024 | 01:20:38 | |
Hugo speaks with Omoju Miller, a machine learning guru and founder and CEO of Fimio, where she is building 21st century dev tooling. In the past, she was Technical Advisor to the CEO at GitHub, spent time co-leading non-profit investment in Computer Science Education for Google, and served as a volunteer advisor to the Obama administrationās White House Presidential Innovation Fellows.
We need open tools, open data, provenance, and the ability to build fully reproducible, transparent machine learning workflows. With the advent of closed-source, vendor-based APIs and compute becoming a form of gate-keeping, developer tools are at the risk of becoming commoditized and developers becoming consumers.
Weāll talk about how ideas for escaping these burgeoning walled gardens. Weāll dive into
What fully reproducible ML workflows would look like, including git for the workflow build process,
The need for loosely coupled and composable tools that embrace a UNIX-like philosophy,
What a much more scientific toolchain would look like,
What a future open sources commons for Generative AI could look like,
What an open compute ecosystem could look like,
How to create LLMs and tooling so everyone can use them to build production-ready apps,
And much more!
LINKS
The livestream on YouTube (https://www.youtube.com/live/n81PWNsHSMk?si=pgX2hH5xADATdJMu)
Omoju on Twitter (https://twitter.com/omojumiller)
Hugo on Twitter (https://twitter.com/hugobowne)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Lu.ma Calendar that includes details of Hugo's European Tour for Outerbounds (https://lu.ma/Outerbounds)
Blog post that includes details of Hugo's European Tour for Outerbounds (https://outerbounds.com/blog/ob-on-the-road-2024-h1/)
| |||
| Episode 24: LLM and GenAI Accessibility | 27 Feb 2024 | 01:30:03 | |
Hugo speaks with Johno Whitaker, a Data Scientist/AI Researcher doing R&D with answer.ai. His current focus is on generative AI, flitting between different modalities. He also likes teaching and making courses, having worked with both Hugging Face and fast.ai in these capacities.
Johno recently reminded Hugo how hard everything was 10 years ago: āWant to install TensorFlow? Good luck. Need data? Perhaps try ImageNet. But now you can use big models from Hugging Face with hi-res satellite data and do all of this in a Colab notebook. Or think ecology and vision models⦠or medicine and multimodal models!ā
We talk about where weāve come from regarding tooling and accessibility for foundation models, ML, and AI, where we are, and where weāre going. Weāll delve into
What the Generative AI mindset is, in terms of using atomic building blocks, and how it evolved from both the data science and ML mindsets;
How fast.ai democratized access to deep learning, what successes they had, and what was learned;
The moving parts now required to make GenAI and ML as accessible as possible;
The importance of focusing on UX and the application in the world of generative AI and foundation models;
The skillset and toolkit needed to be an LLM and AI guru;
What theyāre up to at answer.ai to democratize LLMs and foundation models.
LINKS
The livestream on YouTube (https://youtube.com/live/hxZX6fBi-W8?feature=share)
Zindi, the largest professional network for data scientists in Africa (https://zindi.africa/)
A new old kind of R&D lab: Announcing Answer.AI (http://www.answer.ai/posts/2023-12-12-launch.html)
Why and how Iām shifting focus to LLMs by Johno Whitaker (https://johnowhitaker.dev/dsc/2023-07-01-why-and-how-im-shifting-focus-to-llms.html)
Applying AI to Immune Cell Networks by Rachel Thomas (https://www.fast.ai/posts/2024-01-23-cytokines/)
Replicate -- a cool place to explore GenAI models, among other things (https://replicate.com/explore)
Hands-On Generative AI with Transformers and Diffusion Models (https://www.oreilly.com/library/view/hands-on-generative-ai/9781098149239/)
Johno on Twitter (https://twitter.com/johnowhitaker)
Hugo on Twitter (https://twitter.com/hugobowne)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
SciPy 2024 CFP (https://www.scipy2024.scipy.org/#CFP)
Escaping Generative AI Walled Gardens with Omoju Miller, a Vanishing Gradients Livestream (https://lu.ma/xonnjqe4)
| |||
| Episode 23: Statistical and Algorithmic Thinking in the AI Age | 20 Dec 2023 | 01:20:37 | |
Hugo speaks with Allen Downey, a curriculum designer at Brilliant, Professor Emeritus at Olin College, and the author of Think Python, Think Bayes, Think Stats, and other computer science and data science books. In 2019-20 he was a Visiting Professor at Harvard University. He previously taught at Wellesley College and Colby College and was a Visiting Scientist at Google. He is also the author of the upcoming book Probably Overthinking It!
They discuss Allen's new book and the key statistical and data skills we all need to navigate an increasingly data-driven and algorithmic world. The goal was to dive deep into the statistical paradoxes and fallacies that get in the way of using data to make informed decisions.
For example, when it was reported in 2021 that āin the United Kingdom, 70-plus percent of the people who die now from COVID are fully vaccinated,ā this was correct but the implication was entirely wrong. Their conversation jumps into many such concrete examples to get to the bottom of using data for more than ālies, damned lies, and statistics.ā They cover
Information and misinformation around pandemics and the base rate fallacy;
The tools we need to comprehend the small probabilities of high-risk events such as stock market crashes, earthquakes, and more;
The many definitions of algorithmic fairness, why they can't all be met at once, and what we can do about it;
Public health, the need for robust causal inference, and variations on Berksonās paradox, such as the low-birthweight paradox: an influential paper found that that the mortality rate for children of smokers is lower for low-birthweight babies;
Why none of us are normal in any sense of the word, both in physical and psychological measurements;
The Inspection paradox, which shows up in the criminal justice system and distorts our perception of prison sentences and the risk of repeat offenders.
LINKS
The livestream on YouTube (https://youtube.com/live/G8LulD72kzs?feature=share)
Allen Downey on Github (https://github.com/AllenDowney)
Allen's new book Probably Overthinking It! (https://greenteapress.com/wp/probably-overthinking-it/)
Allen on Twitter (https://twitter.com/AllenDowney)
Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions by Mitchell et al. (https://arxiv.org/abs/1811.07867)
| |||
| Episode 22: LLMs, OpenAI, and the Existential Crisis for Machine Learning Engineering | 27 Nov 2023 | 01:20:07 | |
Jeremy Howard (Fast.ai), Shreya Shankar (UC Berkeley), and Hamel Husain (Parlance Labs) join Hugo Bowne-Anderson to talk about how LLMs and OpenAI are changing the worlds of data science, machine learning, and machine learning engineering.
Jeremy Howard (https://twitter.com/jeremyphoward) is co-founder of fast.ai, an ex-Chief Scientist at Kaggle, and creator of the ULMFiT approach on which all modern language models are based. Shreya Shankar (https://twitter.com/sh_reya) is at UC Berkeley, ex Google brain, Facebook, and Viaduct. Hamel Husain (https://twitter.com/HamelHusain) has his own generative AI and LLM consultancy Parlance Labs (https://parlance-labs.com/) and was previously at Outerbounds, Github, and Airbnb.
They talk about
How LLMs shift the nature of the work we do in DS and ML,
How they change the tools we use,
The ways in which they could displace the role of traditional ML (e.g. will we stop using xgboost any time soon?),
How to navigate all the new tools and techniques,
The trade-offs between open and closed models,
Reactions to the recent Open Developer Day and the increasing existential crisis for ML.
LINKS
The panel on YouTube (https://youtube.com/live/MTJHvgJtynU?feature=share)
Hugo and Jeremy's upcoming livestream on what the hell happened recently at OpenAI, among many other things (https://lu.ma/byxyzfrr?utm_source=vg)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Vanishing Gradients on twitter (https://twitter.com/VanishingData)
| |||
| Episode 21: Deploying LLMs in Production: Lessons Learned | 14 Nov 2023 | 01:08:11 | |
Hugo speaks with Hamel Husain, a machine learning engineer who loves building machine learning infrastructure and tools š·. Hamel leads and contributes to many popular open-source machine learning projects. He also has extensive experience (20+ years) as a machine learning engineer across various industries, including large tech companies like Airbnb and GitHub. At GitHub, he led CodeSearchNet (https://github.com/github/CodeSearchNet), a large language model for semantic search that was a precursor to CoPilot. Hamel is the founder of Parlance-Labs (https://parlance-labs.com/), a research and consultancy focused on LLMs.
They talk about generative AI, large language models, the business value they can generate, and how to get started.
They delve into
Where Hamel is seeing the most business interest in LLMs (spoiler: the answer isnāt only tech);
Common misconceptions about LLMs;
The skills you need to work with LLMs and GenAI models;
Tools and techniques, such as fine-tuning, RAGs, LoRA, hardware, and more!
Vendor APIs vs OSS models.
LINKS
Our upcoming livestream LLMs, OpenAI Dev Day, and the Existential Crisis for Machine Learning Engineering with Jeremy Howard (Fast.ai), Shreya Shankar (UC Berkeley), and Hamel Husain (Parlance Labs): Sign up for free! (https://lu.ma/m81oepqe/utm_source=vghh)
Our recent livestream Data and DevOps Tools for Evaluating and Productionizing LLMs (https://youtube.com/live/B_DMMlDuJB0) with Hamel and Emil Sedgh, Lead AI engineer at Rechat -- in it, we showcase an actual industrial use case that Hamel and Emil are working on with Rechat, a real estate CRM, taking you through LLM workflows and tools.
Extended Guide: Instruction-tune Llama 2 (https://www.philschmid.de/instruction-tune-llama-2) by Philipp Schmid
The livestream recoding of this episode! (https://youtube.com/live/l7jJhL9geZQ?feature=share)
Hamel on twitter (https://twitter.com/HamelHusain)
| |||
| Episode 38: The Art of Freelance AI Consulting and Products: Data, Dollars, and Deliverables | 04 Nov 2024 | 01:23:47 | |
Hugo speaks with Jason Liu, an independent AI consultant with experience at Meta and Stitch Fix. At Stitch Fix, Jason developed impactful AI systems, like a $50 million product similarity search and the widely adopted Flight recommendation framework. Now, he helps startups and enterprises design and deploy production-level AI applications, with a focus on retrieval-augmented generation (RAG) and scalable solutions.
This episode is a bit of an experiment. Instead of our usual technical deep dives, weāre focusing on the world of AI consulting and freelancing. We explore Jasonās consulting playbook, covering how he structures contracts to maximize value, strategies for moving from hourly billing to securing larger deals, and the mindset shift needed to align incentives with clients. Weāll also discuss the challenges of moving from deterministic software to probabilistic AI systems and even do a live role-playing session where Jason coaches me on client engagement and pricing pitfalls.
LINKS
The livestream on YouTube (https://youtube.com/live/9CFs06UDbGI?feature=share)
Jason's Upcoming course: AI Consultant Accelerator: From Expert to High-Demand Business (https://maven.com/indie-consulting/ai-consultant-accelerator?utm_campaign=9532cc&utm_medium=partner&utm_source=instructor)
Hugo's upcoming course: Building LLM Applications for Data Scientists and Software Engineers (https://maven.com/s/course/d56067f338)
Jason's website (https://jxnl.co/)
Jason's indie consulting newsletter (https://indieconsulting.podia.com/)
Your AI Product Needs Evals by Hamel Husain (https://hamel.dev/blog/posts/evals/)
What Weāve Learned From A Year of Building with LLMs (https://applied-llms.org/)
Dear Future AI Consultant by Jason (https://jxnl.co/writing/#dear-future-ai-consultant)
Alex Hormozi's books (https://www.acquisition.com/books)
The Burnout Society by Byung-Chul Han (https://www.sup.org/books/theory-and-philosophy/burnout-society)
Jason on Twitter (https://x.com/jxnlco)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Vanishing Gradients' lu.ma calendar (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Vanishing Gradients on YouTube (https://www.youtube.com/@vanishinggradients)
| |||
| Episode 20: Data Science: Past, Present, and Future | 05 Oct 2023 | 01:26:39 | |
Hugo speaks with Chris Wiggins (Columbia, NYTimes) and Matthew Jones (Princeton) about their recent book How Data Happened, and the Columbia course it expands upon, data: past, present, and future.
Chris is an associate professor of applied mathematics at Columbia University and the New York Timesā chief data scientist, and Matthew is a professor of history at Princeton University and former Guggenheim Fellow.
From facial recognition to automated decision systems that inform who gets loans and who receives bail, we all now move through a world determined by data-empowered algorithms. These technologies didnāt just appear: they are part of a history that goes back centuries, from the census enshrined in the US Constitution to the birth of eugenics in Victorian Britain to the development of Google search.
DJ Patil, former U.S. Chief Data Scientist, said of the book "This is the first comprehensive look at the history of data and how power has played a critical role in shaping the history. Itās a must read for any data scientist about how we got here and what we need to do to ensure that data works for everyone."
If youāre a data scientist, machine learning engineer, or work with data in any way, itās increasingly important to know more about the history and future of the work that you do and understand how your work impacts society and the world.
Among other things, they'll delve into
* the history of human use of data;
* how data are used to reveal insight and support decisions;
* how data and data-powered algorithms shape, constrain, and manipulate our commercial, civic, and personal transactions and experiences; and
* how exploration and analysis of data have become part of our logic and rhetoric of communication and persuasion.
You can also sign up for our next livestreamed podcast recording here (https://www.eventbrite.com/e/data-science-past-present-and-future-tickets-695643357007?aff=kjvg)!
LINKS
How Data Happened, the book! (https://wwnorton.com/books/how-data-happened)
data: past, present, and future, the course (https://data-ppf.github.io/)
Race After Technology, by Ruha Benjamin (https://www.ruhabenjamin.com/race-after-technology)
The problem with metrics is a big problem for AI by Rachel Thomas (https://www.ruhabenjamin.com/race-after-technology)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
| |||
| Episode 19: Privacy and Security in Data Science and Machine Learning | 14 Aug 2023 | 01:23:19 | |
Hugo speaks with Katharine Jarmul about privacy and security in data science and machine learning. Katharine is a Principal Data Scientist at Thoughtworks Germany focusing on privacy, ethics, and security for data science workflows. Previously, she has held numerous roles at large companies and startups in the US and Germany, implementing data processing and machine learning systems with a focus on reliability, testability, privacy, and security.
In this episode, Hugo and Katharine talk about
What data privacy and security are, what they arenāt and the differences between them (hopefully dispelling common misconceptions along the way!);
Why you should care about them (hint: the answers will involve regulatory, ethical, risk, and organizational concerns);
Data governance, anonymization techniques, and privacy in data pipelines;
Privacy attacks!
The state of the art in privacy-aware machine learning and data science, including federated learning;
What you need to know about the current state of regulation, including GDPR and CCPAā¦
And much more, all the while grounding our conversation in real-world examples from data science, machine learning, business, and life!
You can also sign up for our next livestreamed podcast recording here (https://lu.ma/4b5xalpz)!
LINKS
Win a copy of Practical Data Privacy, Katharine's new book! (https://forms.gle/wkF92vyvjfZLM6qt8)
Katharine on twitter (https://twitter.com/kjam)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Probably Private, a newsletter for privacy and data science enthusiasts (https://probablyprivate.com/)
Probably Private on YouTube (https://www.youtube.com/@ProbablyPrivate)
| |||
| Episode 18: Research Data Science in Biotech | 24 May 2023 | 01:12:42 | |
Hugo speaks with Eric Ma about Research Data Science in Biotech. Eric leads the Research team in the Data Science and Artificial Intelligence group at Moderna Therapeutics. Prior to that, he was part of a special ops data science team at the Novartis Institutes for Biomedical Research's Informatics department.
In this episode, Hugo and Eric talk about
What tools and techniques they use for drug discovery (such as mRNA vaccines and medicines);
The importance of machine learning, deep learning, and Bayesian inference;
How to think more generally about such high-dimensional, multi-objective optimization problems;
The importance of open-source software and Python;
Institutional and cultural questions, including hiring and the trade-offs between being an individual contributor and a manager;
How theyāre approaching accelerating discovery science to the speed of thought using computation, data science, statistics, and ML.
And as always, much, much more!
LINKS
Eric's website (https://ericmjl.github.io/)
Eric on twitter (https://twitter.com/ericmjl)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Cell Biology by the Numbers by Ron Milo and Rob Phillips (http://book.bionumbers.org/)
Eric's JAX tutorials at PyCon (https://youtu.be/ztthQJQFe20) and SciPy (https://youtu.be/DmR36wtel4Y)
Eric's blog post on Hiring data scientists at Moderna! (https://ericmjl.github.io/blog/2021/8/26/hiring-data-scientists-at-moderna-2021/)
| |||
| Episode 17: End-to-End Data Science | 17 Feb 2023 | 01:16:04 | |
Hugo speaks with Tanya Cashorali, a data scientist and consultant that helps businesses get the most out of data, about what end-to-end data science looks like across many industries, such as retail, defense, biotech, and sports, including
scoping out projects,
figuring out the correct questions to ask,
how projects can change,
delivering on the promise,
the importance of rapid prototyping,
what it means to put models in production, and
how to measure success.
And much more, all the while grounding their conversation in real-world examples from data science, business, and life.
In a world where most organizations think they need AI and yet 10-15% of data science actually involves model building, itās time to get real about how data science and machine learning actually deliver value!
LINKS
Tanya on Twitter (https://twitter.com/tanyacash21)
Vanishing Gradients on YouTube (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Saving millions with a Shiny app | Data Science Hangout with Tanya Cashorali (https://youtu.be/qdAroyFRFCg)
Our next livestream: Research Data Science in Biotech with Eric Ma (https://www.eventbrite.com/e/research-data-science-in-biotech-tickets-550400882857?aff=fs)
| |||
| Episode 16: Data Science and Decision Making Under Uncertainty | 14 Dec 2022 | 01:23:15 | |
Hugo speaks with JD Long, agricultural economist, quant, and stochastic modeler, about decision making under uncertainty and how we can use our knowledge of risk, uncertainty, probabilistic thinking, causal inference, and more to help us use data science and machine learning to make better decisions in an uncertain world.
This is part 2 of a two part conversation in which we delve into decision making under uncertainty. Feel free to check out part 1 here (https://vanishinggradients.fireside.fm/15) but this episode should also stand alone.
Why am I speaking to JD about all of this? Because not only is he a wild conversationalist with a real knack for explaining hard to grok concepts with illustrative examples and useful stories, but he has worked for many years in re-insurance, thatās right, not insurance but re-insurance ā these are the people who insure the insurers so if anyone can actually tell us about risk and uncertainty in decision making, itās him!
In part 1, we discussed risk, uncertainty, probabilistic thinking, and simulation, all with a view towards improving decision making.
In this, part 2, we discuss the ins and outs of decision making under uncertainty, including
How data science can be more tightly coupled with the decision function in organisations;
Some common mistakes and failure modes of making decisions under uncertainty;
Heuristics for principled decision-making in data science;
The intersection of model building, storytelling, and cognitive biases to keep in mind;
As JD says, and I paraphrase, āYou may think you train your models, but your models are really training you.ā
Links
Vanishing Gradients' new YouTube channel! (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
JD on twitter (https://twitter.com/CMastication)
Executive Data Science, episode 5 of Vanishing Gradients, in which Jim Savage and Hugo talk through decision making and why you should always be integrating your loss function over your posterior (https://vanishinggradients.fireside.fm/5)
Fooled by Randomness by Nassim Taleb (https://en.wikipedia.org/wiki/Fooled_by_Randomness)
Superforecasting: The Art and Science of Prediction Philip E. Tetlock and Dan Gardner (https://en.wikipedia.org/wiki/Superforecasting:_The_Art_and_Science_of_Prediction)
Thinking in Bets by Annie Duke (https://www.penguin.com.au/books/thinking-in-bets-9780735216372)
The Signal and the Noise: Why So Many Predictions Fail by Nate Silver (https://en.wikipedia.org/wiki/The_Signal_and_the_Noise)
Thinking, Fast and Slow by Daniel Kahneman (https://en.wikipedia.org/wiki/Thinking,_Fast_and_Slow)
| |||
| Episode 15: Uncertainty, Risk, and Simulation in Data Science | 07 Dec 2022 | 00:53:30 | |
Hugo speaks with JD Long, agricultural economist, quant, and stochastic modeler, about decision making under uncertainty and how we can use our knowledge of risk, uncertainty, probabilistic thinking, causal inference, and more to help us use data science and machine learning to make better decisions in an uncertain world.
This is part 1 of a two part conversation. In this, part 1, we discuss risk, uncertainty, probabilistic thinking, and simulation, all with a view towards improving decision making and we draw on examples from our personal lives, the pandemic, our jobs, the reinsurance space, and the corporate world. In part 2, weāll get into the nitty gritty of decision making under uncertainty.
As JD says, and I paraphrase, āYou may think you train your models, but your models are really training you.ā
Links
Vanishing Gradients' new YouTube channel! (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
JD on twitter (https://twitter.com/CMastication)
Executive Data Science, episode 5 of Vanishing Gradients, in which Jim Savage and Hugo talk through decision making and why you should always be integrating your loss function over your posterior (https://vanishinggradients.fireside.fm/5)
Fooled by Randomness by Nassim Taleb (https://en.wikipedia.org/wiki/Fooled_by_Randomness)
Superforecasting: The Art and Science of Prediction Philip E. Tetlock and Dan Gardner (https://en.wikipedia.org/wiki/Superforecasting:_The_Art_and_Science_of_Prediction)
Thinking in Bets by Annie Duke (https://www.penguin.com.au/books/thinking-in-bets-9780735216372)
The Signal and the Noise: Why So Many Predictions Fail by Nate Silver (https://en.wikipedia.org/wiki/The_Signal_and_the_Noise)
Thinking, Fast and Slow by Daniel Kahneman (https://en.wikipedia.org/wiki/Thinking,_Fast_and_Slow)
| |||
| Episode 14: Decision Science, MLOps, and Machine Learning Everywhere | 20 Nov 2022 | 01:09:01 | |
Hugo Bowne-Anderson, host of Vanishing Gradients, reads 3 audio essays about decision science, MLOps, and what happens when machine learning models are everywhere.
Links
Our upcoming Vanishing Gradients live recording of Data Science and Decision Making Under Uncertainty with Hugo and JD Long! (https://www.eventbrite.com/e/data-science-and-decision-making-under-uncertainty-tickets-467379864757?aff=vg)
Decision-Making in a Time of Crisis (https://www.oreilly.com/radar/decision-making-in-a-time-of-crisis/) by Hugo Bowne-Anderson
MLOps and DevOps: Why Data Makes It Different (https://www.oreilly.com/radar/mlops-and-devops-why-data-makes-it-different/) by Ville Tuulos and Hugo Bowne-Anderson
The above essay syndicated on VentureBeat (https://venturebeat.com/business/mlops-vs-devops-why-data-makes-it-different/)
When models are everywhere (https://www.oreilly.com/radar/when-models-are-everywhere/) by Hugo Bowne-Anderson and Mike Loukides
| |||
| Episode 13: The Data Science Skills Gap, Economics, and Public Health | 11 Oct 2022 | 01:22:41 | |
Hugo speak with Norma Padron about data science education and continuous learning for people working in healthcare, broadly construed, along with how we can think about the democratization of data science skills more generally.
Norma is CEO of EmpiricaLab, where her teamās mission is to bridge work and training and empower healthcare teams to focus on what they care about the most: patient care. In a word, EmpiricaLab is a platform focused on peer learning and last-mile training for healthcare teams.
As youāll discover, Normaās background is fascinating: with a Ph.D. in health policy and management from Yale University, a master's degree in economics from Duke University (among other things), and then working with multiple early stage digital health companies to accelerate their growth and scale, this is a wide ranging conversation about how and where learning actually occurs, particularly with respect to data science; we talk about how the worlds of economics and econometrics, including causal inference, can be used to make data science and more robust and less fragile field, and why these disciplines are essential to both public and health policy. It was really invigorating to talk about the data skills gaps that exists in organizations and how Normaās team at Empiricalab is thinking about solving it in the health space using a 3 tiered solution of content creation, a social layer, and an information discovery platform.
All of this in service of a key question weāre facing in this field: how do you get the right data skills, tools, and workflows, in the hands of the people who need them, when the space is evolving so quickly?
Links
Norma's website (https://www.normapadron.com/)
EmpiricaLab (https://www.empiricalab.com/)
Norma on twitter (https://twitter.com/NormaPadron__)
| |||
| Episode 12: Data Science for Social Media: Twitter and Reddit | 30 Sep 2022 | 01:32:45 | |
Hugo speakswith Katie Bauer (https://twitter.com/imightbemary) about her time working in data science at both Twitter and Reddit. At the time of recording, Katie was a data science manager at Twitter and prior to that, a founding member of the data team at Reddit. Sheās now Head of Data Science at Gloss Genius so congrats on the new job, Katie!
In this conversation, we dive into what type of challenges social media companies face that data science is equipped to solve: in doing so, we traverse
the difference and similarities in companies such as Twitter and Reddit,
the major differences in being an early member of a data team and joining an established data function at a larger organization,
the supreme importance of robust measurement and telemetry in data science, along with
the mixed incentives for career data scientists, such as building flashy new things instead of maintaining existing infrastructure.
Iāve always found conversations with Katie to be a treasure trove of insights into data science and machine learning practice, along with key learnings about data science management.
In a word, Katie helps me to understand our space better. In this conversation, she told me that one important function data science can serve in any organization is creating a shared context for lots of different people in the org. We dive deep into what this actually means, how it can play out, traversing the world of dashboards, metric stores, feature stores, machine learning products, the need for top-down support, and much, much more.
| |||
| Episode 11: Data Science: The Great Stagnation | 16 Sep 2022 | 01:45:38 | |
Hugo speaks with Mark Saroufim, an Applied AI Engineer at Meta who works on PyTorch where his teamās main focus is making it as easy as possible for people to deploy PyTorch in production outside Meta.
Mark first came on our radar with an essay he wrote called Machine Learning: the Great Stagnation (https://marksaroufim.substack.com/p/machine-learning-the-great-stagnation), which was concerned with the stagnation in machine learning in academic research and in which he stated
Machine learning researchers can now engage in risk-free, high-income, high-prestige work. They are todayās Medieval Catholic priests.
This is just the tip of the icebergs of Markās critical and often sociological eye and one of the reasons I was excited to speak with him.
In this conversation, we talk about the importance of open source software in modern data science and machine learning and how Mark thinks about making it as easy to use as possible. We also talk about risk assessments in considering whether to adopt open source or not, the supreme importance of good documentation, and what we can learn from the world of video game development when thinking about open source.
We then dive into the rise of the machine learning cult leader persona, in the context of examples such as Hugging Face and the community theyāve built. We discuss the role of marketing in open source tooling, along with for profit data science and ML tooling, how it can impact you as an end user, and how much of data science can be considered differing forms of live action role playing and simulation.
We also talk about developer marketing and content for data professionals and how we see some of the largest names in ML researchers being those that have gigantic Twitter followers, such as Andrei Karpathy. This is part of a broader trend in society about the skills that are required to capture significant mind share these days.
If thatās not enough, we jump into how machine learning ideally allows businesses to build sustainable and defensible moats, by which we mean the ability to maintain competitive advantages over competitors to retain market share.
In between this interview and its release, PyTorch joined the Linux Foundation, which is something weāll need to get Mark back to discuss sometime.
Links
The Myth of Objective Tech Screens (https://marksaroufim.substack.com/p/the-myth-of-objective-tech-screens)
Machine Learning: The Great Stagnation (https://marksaroufim.substack.com/p/machine-learning-the-great-stagnation)
Fear the Boom and Bust: Keynes vs. Hayek - The Original Economics Rap Battle! (https://www.youtube.com/watch?v=d0nERTFo-Sk)
History and the Security of Property (https://archive.ph/dRXEK#selection-21.0-21.36) by Nick Szabo
Mark on YouTube (https://www.youtube.com/marksaroufim)
Mark's Substack (https://marksaroufim.substack.com/p/machine-learning-the-great-stagnation)
Mark's Discord (https://discord.com/invite/drmuTjWZrm)
| |||
| Episode 37: Prompt Engineering, Security in Generative AI, and the Future of AI Research Part 2 | 08 Oct 2024 | 00:50:36 | |
Hugo speaks with three leading figures from the world of AI research: Sander Schulhoff, a recent University of Maryland graduate and lead contributor to the Learn Prompting initiative; Philip Resnik, professor at the University of Maryland, known for his pioneering work in computational linguistics; and Dennis Peskoff, a researcher from Princeton specializing in prompt engineering and its applications in the social sciences.
This is Part 2 of a special two-part episode, promptedāno pun intendedāby these guys being part of a team, led by Sander, that wrote a 76-page survey analyzing prompting techniques, agents, and generative AI. The survey included contributors from OpenAI, Microsoft, the University of Maryland, Princeton, and more.
In this episode, we cover:
The Prompt Report: A comprehensive survey on prompting techniques, agents, and generative AI, including advanced evaluation methods for assessing these techniques.
Security Risks and Prompt Hacking: A detailed exploration of the security concerns surrounding prompt engineering, including Sanderās thoughts on its potential applications in cybersecurity and military contexts.
AIās Impact Across Fields: A discussion on how generative AI is reshaping various domains, including the social sciences and security.
Multimodal AI: Updates on how large language models (LLMs) are expanding to interact with images, code, and music.
Case Study - Detecting Suicide Risk: A careful examination of how prompting techniques are being used in important areas like detecting suicide risk, showcasing the critical potential of AI in addressing sensitive, real-world challenges.
The episode concludes with a reflection on the evolving landscape of LLMs and multimodal AI, and what might be on the horizon.
If you havenāt yet, make sure to check out Part 1, where we discuss the history of NLP, prompt engineering techniques, and Sanderās development of the Learn Prompting initiative.
LINKS
The livestream on YouTube (https://youtube.com/live/FreXovgG-9A?feature=share)
The Prompt Report: A Systematic Survey of Prompting Techniques (https://arxiv.org/abs/2406.06608)
Learn Prompting: Your Guide to Communicating with AI (https://learnprompting.org/)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Vanishing Gradients' lu.ma calendar (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Vanishing Gradients on YouTube (https://www.youtube.com/@vanishinggradients)
| |||
| Episode 10: Investing in Machine Learning | 18 Aug 2022 | 01:26:33 | |
Hugo speaks with Sarah Catanzaro, General Partner at Amplify Partners, about investing in data science and machine learning tooling and where we see progress happening in the space.
Sarah invests in the tools that we both wish we had earlier in our careers: tools that enable data scientists and machine learners to collect, store, manage, analyze, and model data more effectively. As youāll discover, Sarah identifies as a scientist first and an investor second and still believes that her mission is to enable companies to become data-driven and to generate ROI through machine and statistical learning. In her words, sheās still that cuckoo kid whoās ranting and raving about how data and AI will shift every tide.
In this conversation, we talk about what scientific inquiry actually is and the elements of playfulness and seriousness it necessarily involves, and how it can be used to generate business value. We talk about Sarahās unorthodox path from a data scientist working in defense to her time at Palantir and how that led her to build out a data team and function for a venture capital firm and then to becoming a VC in the data tooling space.
We then really dive into the data science and machine learning tooling space to figure out why itās so fragmented: we look to the data analytics stack and software engineering communities to find historical tethers that may be useful. We discuss the moving parts that led to the establishment of a standard, a system of record, and clearly defined roles in analytics and what we can learn from that for machine learning!
We also dive into the development of tools, workflows, and division of labour as partial exercises in pattern recognition and how this can be at odds with the variance we see in the machine learning landscape, more generally!
Two take-aways are that we need best practices and we need more standardization.
We also discussed that, with all our focus and conversations on tools, what conversation weāre missing and Sarah was adamant that we need to be focusing on questions, not solutions, and even questioning what ML is useful for and what it isnāt, diving into a bunch of thoughtful and nuanced examples.
Iām also grateful that Sarah let me take her down a slightly dangerous and self-critical path where we riffed on both our roles in potentially contributing to the tragedy of commons weāre all experiencing in the data tooling landscape, me working in tool building, developer relations, and in marketing, and Sarah in venture capital.
| |||
| 9: AutoML, Literate Programming, and Data Tooling Cargo Cults | 19 Jul 2022 | 01:41:42 | |
Hugo speaks with Hamel Husain, Head of Data Science at Outerbounds, with extensive experience in data science consulting, at DataRobot, Airbnb, and Github.
In this conversation, they talk about Hamel's early days in data science, consulting for a wide array of companies, such as Crocs, restaurants, and casinos in Las Vegas, diving into what data science even looked like in 2005 and how you could think about delivering business value using data and analytics back then.
They talk about his trajectory in moving to data science and machine learning in Silicon Valley, what his expectations were, and what he actually found there.
They then take a dive into AutoML, discussing what should be automated in Machine learning and what shouldnāt. They talk about software engineering best practices and what aspects it would be useful for data scientists to know about.
They also got to talk about the importance of literate programming, notebooks, and documentation in data science and ML. All this and more!
Links
Hamel on twitter (https://twitter.com/HamelHusain)
The Outerbounds documentation project repo (https://github.com/outerbounds/docs)
Practical Advice for R in Production (https://www.rstudio.com/blog/practical-advice-for-r-in-production-answering-your-questions/)
nbdev: Create delightful python projects using Jupyter Notebooks (https://nbdev.fast.ai/)
| |||
| Episode 8: The Open Source Cybernetic Revolution | 16 May 2022 | 01:05:57 | |
Hugo speaks with Peter Wang, CEO of Anaconda, about what the value proposition of data science actually is, data not as the new oil, but rather data as toxic, nuclear sludge, the fact that data isnāt real (and what we really have are frozen models), and the future promise of data science.
They also dive into an experimental conversation around open source software development as a model for the development of human civilization, in the context of developing systems that prize local generativity over global extractive principles. If thatās a mouthful, which it was, or an earful, which it may have been, all will be revealed in the conversation.
LInks
Peter on twitter (https://twitter.com/pwang)
Anaconda Nucleus (https://anaconda.cloud/)
Jordan Hall on the Jim Rutt Show (https://www.jimruttshow.com/jordan-greenhall-hall/): Game B
Meditations On Moloch (https://slatestarcodex.com/2014/07/30/meditations-on-moloch) -- On multipolar traps
Here Comes Everybody: The Power of Organizing Without Organizations (https://en.wikipedia.org/wiki/Here_Comes_Everybody_(book)) by Clay Shirky
Finite and Infinite Games (https://en.wikipedia.org/wiki/Finite_and_Infinite_Games) by James Carse
Governing the Commons: The Evolution of Institutions for Collective Action (https://www.cambridge.org/core/books/governing-the-commons/7AB7AE11BADA84409C34815CC288CD79) by Elinor Olstrom
Elinor Ostrom's 8 Principles for Managing A Commmons (https://www.onthecommons.org/magazine/elinor-ostroms-8-principles-managing-commmons)
Haunted by Data (https://idlewords.com/talks/haunted_by_data.htm), a beautiful and mesmerising talk by Pinboard.in founder Maciej Ceglowski
| |||
| Episode 7: The Evolution of Python for Data Science | 01 May 2022 | 01:02:31 | |
Hugo speaks with Peter Wang, CEO of Anaconda, about how Python became so big in data science, machine learning, and AI. They jump into many of the technical and sociological beginnings of Python being used for data science, a history of PyData, the conda distribution, and NUMFOCUS.
They also talk about the emergence of online collaborative environments, particularly with respect to open source, and attempt to figure out the movings parts of PyData and why it has had the impact it has, including the fact that many core developers were not computer scientists or software engineers, but rather scientists and researchers building tools that they needed on an as-needed basis
They also discuss the challenges in getting adoption for Python and the things that the PyData stack solves, those that it doesnāt and what progress is being made there.
People who have listened to Hugo podcast for some time may have recognized that he's interested in the sociology of the data science space and he really considered speaking with Peter a fascinating opportunity to delve into how the Pythonic data science space evolved, particularly with respect to tooling, not only because Peter had a front row seat for much of it, but that he was one of several key actors at various different points. On top of this, Hugo wanted to allow Peterās inner sociologist room to breathe and evolve in this conversation.
What happens then is slightly experimental ā Peter is a deep, broad, and occasionally hallucinatory thinker and Hugo wanted to explore new spaces with him so we hope you enjoy the experiments they play as they begin to discuss open-source software in the broader context of finite and infinite games and how OSS is a paradigm of humanityās ability to create generative, nourishing and anti-rivlarous systems where, by anti-rivalrous, we mean things that become more valuable for everyone the more people use them! But we need to be mindful of finite-game dynamics (for example, those driven by corporate incentives) co-opting and parasitizing the generative systems that we build.
These are all considerations they delve far deeper into in Part 2 of this interview, which will be the next episode of VG, where we also dive into the relationship between OSS, tools, and venture capital, amonh many others things.
LInks
Peter on twitter (https://twitter.com/pwang)
Anaconda Nucleus (https://anaconda.cloud/)
Calling out SciPy on diversity (even though it hurts) (https://ilovesymposia.com/2015/04/03/calling-out-scipy-on-diversity/) by Juan Nunez-Iglesias
Here Comes Everybody: The Power of Organizing Without Organizations (https://en.wikipedia.org/wiki/Here_Comes_Everybody_(book)) by Clay Shirky
Finite and Infinite Games (https://en.wikipedia.org/wiki/Finite_and_Infinite_Games) by James Carse
Governing the Commons: The Evolution of Institutions for Collective Action (https://www.cambridge.org/core/books/governing-the-commons/7AB7AE11BADA84409C34815CC288CD79) by Elinor Olstrom
Elinor Ostrom's 8 Principles for Managing A Commmons (https://www.onthecommons.org/magazine/elinor-ostroms-8-principles-managing-commmons)
| |||
| Episode 6: Bullshit Jobs in Data Science (and what to do about them) | 04 Apr 2022 | 01:27:01 | |
Hugo speaks with Jacqueline Nolis, Chief Product Officer at Saturn Cloud (formerly Head of Data Science), about all types of failure modes in data science, ML, and AI, and they delve into bullshit jobs in data science (yes, thatās a technical term, as youāll find out) āthey discuss the elements that are bullshit, the elements that arenāt, and how to increase the ratio of the latter to the former.
They also talk about her journey in moving from mainly working in prescriptive analytics building reports in PDFs and power points to deploying machine learning products in production. They delve into her motion from doing data science to designing products for data scientists and how to think about choosing career paths. Jacqueline has been an individual contributor, a team lead, and a principal data scientist so has a lot of valuable experience here. They talk about her experience of transitioning gender while working in data science and they work hard to find a bright vision for the future of this industry!
Links
Jacqueline on twitter (https://twitter.com/skyetetra)
Building a Career in Data Science (https://jnolis.com/book/) by Jacqueline and Emily Robinson
Saturn Cloud (https://saturncloud.io/)
Why are we so surprised? (http://allendowney.blogspot.com/2016/11/why-are-we-so-surprised.html), a post by Allen Downey on communicating and thinking through uncertainty
Data Mishaps Night! (https://datamishapsnight.com/)
The Trump administrationās ācubic modelā of coronavirus deaths, explained (https://www.vox.com/2020/5/8/21250641/kevin-hassett-cubic-model-smoothing) by Matthew Yglesias
Working Class Deep Learner (https://marksaroufim.substack.com/p/working-class-deep-learner?s=r) by Mark Saroufim
| |||
| Episode 5: Executive Data Science | 23 Mar 2022 | 01:48:14 | |
Hugo speaks with Jim Savage, the Director of Data Science at Schmidt Futures, about the need for data science in executive training and decision, what data scientists can learn from economists, the perils of "data for good", and why you should always be integrating your loss function over your posterior.
Jim and Hugo talk about what data science is and isnāt capable of, what can actually deliver value, and what people really enjoy doing: the intersection in this Venn diagram is where we need to focus energy and it may not be quite what you think it is!
They then dive into Jim's thoughts on what he dubs Executive Data Science. You may be aware of the slicing of the data science and machine learning spaces into descriptive analytics, predictive analytics, and prescriptive analytics but, being the thought surgeon that he is, Jim proposes a different slicing into
(1) tool building OR data science as a product,
(2) tools to automate and augment parts of us, and
(3) what Jim calls Executive Data Science.
Jim and Hugo also talk about decision theory, the woeful state of causal inference techniques in contemporary data science, and what techniques it would behoove us all to import from econometrics and economics, more generally. If thatās not enough, they talk about the importance of thinking through the data generating process and things that can go wrong if you donāt. In terms of allowing your data work to inform your decision making, thery also discuss Jimās maxim āALWAYS BE INTEGRATING YOUR LOSS FUNCTION OVER YOUR POSTERIORā
Last but definitively not least, as Jim has worked in the data for good space for much of his career, they talk about what this actually means, with particular reference to fast.ai founder & QUT professor of practice Rachel Thomasā blog post called āDoing Data Science for Social Good, Responsiblyā (https://www.fast.ai/2021/11/23/data-for-good/). Rachelās post takes as its starting point the following words of Sarah Hooker, a researcher at Google Brain:
"Data for good" is an imprecise term that says little about who we serve, the tools used, or the goals. Being more precise can help us be more accountable & have a greater positive impact.
And Jim and I discuss his work in the light of these foundational considerations.
Links
Jim on twitter (https://twitter.com/abiylfoyp/)
What Is Causal Inference?An Introduction for Data Scientists (https://www.oreilly.com/radar/what-is-causal-inference/) by Hugo Bowne-Anderson and Mike Loukides
Jim's must-watch Data Council talk on Productizing Structural Models (https://www.datacouncil.ai/talks/productizing-structural-models)
[Mastering Metrics}(https://www.masteringmetrics.com/) by Angrist and Pischke
Mostly Harmless Econometrics: An Empiricist's Companion (https://press.princeton.edu/books/paperback/9780691120355/mostly-harmless-econometrics) by Angrist and Pischke
The Book of Why (https://en.wikipedia.org/wiki/The_Book_of_Why) by Judea Pearl
Decision-Making in a Time of Crisis (https://www.oreilly.com/radar/decision-making-in-a-time-of-crisis/) by Hugo Bowne-Anderson
Doing Data Science for Social Good, Responsibly (https://www.fast.ai/2021/11/23/data-for-good/) by Rachel Thomas
| |||
| Episode 4: Machine Learning at T-Mobile | 09 Mar 2022 | 01:44:10 | |
Hugo speaks with Heather Nolis, Principal Machine Learning engineer at T-mobile, about what data science, machine learning, and AI look like at T-mobile, along with Heatherās path from a software development intern there to principal ML engineer running a team of 15.
They talk about: how to build a DS culture from scratch and what executive-level support looks like, as well as how to demonstrate machine learning value early on from a shark tank style pitch night to the initial investment through to the POC and building out the function; all the great work they do with R and the Tidyverse in production; what itās like to be a lesbian in tech, and about what it was like to discover she was autistic and how that impacted her work; how to measure and demonstrate success and ROI for the org; some massive data science fails!; how to deal with execs wanting you to use the latest GPT-X ā in a fragmented tooling landscape; how to use the simplest technology to deliver the most value.
Finally, the team just hired their first FT ethicist and they speak about how ethics can be embedded in a team and across an institution.
Links
Put R in prod (https://putrinprod.com/): Tools and guides to put R models into production
Enterprise Web Services with Neural Networks Using R and TensorFlow (https://medium.com/tmobile-tech/enterprise-web-services-with-neural-networks-using-r-and-tensorflow-a09c1b100c11)
Heather on twitter (https://twitter.com/heatherklus)
T-Mobile is hiring! (https://www.t-mobile.com/careers)
Hugo's upcoming fireside chat and AMA with Hilary Parker about how to actually produce sustainable business value using machine learning and product management for ML! (https://www.eventbrite.com/e/select-ml-project-where-value-is-not-null-tickets-284000161127?aff=hba)
| |||
| Episode 3: Language Tech For All | 01 Mar 2022 | 01:32:33 | |
Rachael Tatman is a senior developer advocate for Rasa, where sheās helping developers build and deploy ML chatbots using their open source framework.
Rachael has a PhD in Linguistics from the University of Washington where her research was on computational sociolinguistics, or how our social identity affects the way we use language in computational contexts. Previously she was a data scientist at Kaggle and sheās still a Kaggle Grandmaster.
In this conversation, Rachael and I talk about the history of NLP and conversational AI//chatbots and we dive into the fascinating tension between rule-based techniques and ML and deep learning ā we also talk about how to incorporate machine and human intelligence together by thinking through questions such as āshould a response to a human ever be automated?ā Spoiler alert: the answer is a resounding NO WAY!
In this journey, something that becomes apparent is that many of the trends, concepts, questions, and answers, although framed for NLP and chatbots, are applicable to much of data science, more generally.
We also discuss the data scientistās responsibility to end-users and stakeholders using, among other things, the lens of considering those whose data youāre working with to be data donors.
We then consider what globalized language technology looks like and can look like, what we can learn from the history of science here, particularly given that so much training data and models are in English when it accounts for so little of language spoken globally.
Links
Rachael's website (https://www.rctatman.com/)
Rasa (https://rasa.com/)
Speech and Language Processing (https://web.stanford.edu/~jurafsky/slp3/)
by Dan Jurafsky and James H. Martin
Masakhane (https://twitter.com/MasakhaneNLP), putting African languages on the #NLP map since 2019
The Distributed AI Research Institute (https://www.dair-institute.org/), a space for independent, community-rooted AI research, free from Big Techās pervasive influence
The Algorithmic Justice League (https://www.ajl.org/), unmasking AI harms and biases
Black in AI (https://blackinai.github.io/#/), increasing the presence and inclusion of Black people in the field of AI by creating space for sharing ideas, fostering collaborations, mentorship and advocacy
Hugo's blog post on his new job and why it's exciting for him to double down on helping scientists do better science (https://outerbounds.com/blog/hba-excited-to-join-metaflow-and-outerbounds/)
| |||
| Episode 2: Making Data Science Uncool Again | 20 Feb 2022 | 01:45:45 | |
Jeremy Howard is a data scientist, researcher, developer, educator, and entrepreneur. Jeremy is a founding researcher at fast.ai, a research institute dedicated to making deep learning more accessible. He is also a Distinguished Research Scientist at the University of San Francisco, the chair of WAMRI, and is Chief Scientist at platform.ai.
In this conversation, weāll be talking about the history of data science, machine learning, and AI, where weāve come from and where weāre going, how new techniques can be applied to real-world problems, whether it be deep learning to medicine or porting techniques from computer vision to NLP. Weāll also talk about whatās present and whatās missing in the ML skills revolution, what software engineering skills data scientists need to learn, how to cope in a space of such fragmented tooling, and paths for emerging out of the shadow of FAANG. If thatās not enough, weāll jump into how spreading DS skills around the globe involves serious investments in education, building software, communities, and research, along with diving into the social challenges that the information age and the AI revolution (so to speak) bring with it.
But to get to all of this, youāll need to listen to a few minutes of us chatting about chocolate biscuits in Australia!
Links
* fast.ai Ā· making neural nets uncool again
* nbdev: create delightful python projects using Jupyter Notebooks (https://github.com/fastai/nbdev)
* The fastai book, published as Jupyter Notebooks (https://github.com/fastai/fastbook)
* Deep Learning for Coders with fastai and PyTorch (https://www.oreilly.com/library/view/deep-learning-for/9781492045519/)
* The wonderful and terrifying implications of computers that can learn (https://www.youtube.com/watch?v=t4kyRyKyOpo) -- Jeremy' awesome TED talk!
* Manna (https://marshallbrain.com/manna) by Marshall Brain
* Ghost Work (https://ghostwork.info/) by Mary L. Gray and Siddharth Suri
* Uberland (https://www.ucpress.edu/book/9780520324800/uberland) by Alex Rosenblat
| |||
| Episode 36: Prompt Engineering, Security in Generative AI, and the Future of AI Research Part 1 | 30 Sep 2024 | 01:03:46 | |
Hugo speaks with three leading figures from the world of AI research: Sander Schulhoff, a recent University of Maryland graduate and lead contributor to the Learn Prompting initiative; Philip Resnik, professor at the University of Maryland, known for his pioneering work in computational linguistics; and Dennis Peskoff, a researcher from Princeton specializing in prompt engineering and its applications in the social sciences.
This is Part 1 of a special two-part episode, promptedāno pun intendedāby these guys being part of a team, led by Sander, that wrote a 76-page survey analyzing prompting techniques, agents, and generative AI. The survey included contributors from OpenAI, Microsoft, the University of Maryland, Princeton, and more.
In this first part,
* weāll explore the critical role of prompt engineering,
* & diving into adversarial techniques like prompt hacking and
* the challenges of evaluating these techniques.
* weāll examine the impact of few-shot learning and
* the groundbreaking taxonomy of prompting techniques from the Prompt Report.
Along the way,
* weāll uncover the rich history of natural language processing (NLP) and AI, showing how modern prompting techniques evolved from early rule-based systems and statistical methods.
* weāll also hear how Sanderās experimentation with GPT-3 for diplomatic tasks led him to develop Learn Prompting, and
* how Dennis highlights the accessibility of AI through prompting, which allows non-technical users to interact with AI without needing to code.
Finally, weāll explore the future of multimodal AI, where LLMs interact with images, code, and even music creation. Make sure to tune in to Part 2, where we dive deeper into security risks, prompt hacking, and more.
LINKS
The livestream on YouTube (https://youtube.com/live/FreXovgG-9A?feature=share)
The Prompt Report: A Systematic Survey of Prompting Techniques (https://arxiv.org/abs/2406.06608)
Learn Prompting: Your Guide to Communicating with AI (https://learnprompting.org/)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Vanishing Gradients' lu.ma calendar (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Vanishing Gradients on YouTube (https://www.youtube.com/@vanishinggradients)
| |||
| Episode 35: Open Science at NASA -- Measuring Impact and the Future of AI | 19 Sep 2024 | 00:58:13 | |
Hugo speaks with Dr. Chelle Gentemann, Open Science Program Scientist for NASAās Office of the Chief Science Data Officer, about NASAās ambitious efforts to integrate AI across the research lifecycle. In this episode, weāll dive deeper into how AI is transforming NASAās approach to science, making data more accessible and advancing open science practices. We explore
Measuring the Impact of Open Science: How NASA is developing new metrics to evaluate the effectiveness of open science, moving beyond traditional publication-based assessments.
The Process of Scientific Discovery: Insights into the collaborative nature of research and how breakthroughs are achieved at NASA.
** AI Applications in NASAās Science:** From rats in space to exploring the origins of the universe, we cover how AI is being applied across NASAās divisions to improve data accessibility and analysis.
Addressing Challenges in Open Science: The complexities of implementing open science within government agencies and research environments.
Reforming Incentive Systems: How NASA is reconsidering traditional metrics like publications and citations, and starting to recognize contributions such as software development and data sharing.
The Future of Open Science: How open science is shaping the future of research, fostering interdisciplinary collaboration, and increasing accessibility.
This conversation offers valuable insights for researchers, data scientists, and those interested in the practical applications of AI and open science. Join us as we discuss how NASA is working to make science more collaborative, reproducible, and impactful.
LINKS
The livestream on YouTube (https://youtube.com/live/VJDg3ZbkNOE?feature=share)
NASA's Open Science 101 course <-- do it to learn and also to get NASA Swag! (https://openscience101.org/)
Science Cast (https://sciencecast.org/)
NASA and IBM Openly Release Geospatial AI Foundation Model for NASA Earth Observation Data (https://www.earthdata.nasa.gov/news/impact-ibm-hls-foundation-model)
Jake VanderPlas' daily conundrum tweet from 2013 (https://x.com/jakevdp/status/408678764705378304)
Replit, "an AI-powered software development & deployment platform for building, sharing, and shipping software fast." (https://replit.com/)
| |||
| Episode 34: The AI Revolution Will Not Be Monopolized | 22 Aug 2024 | 01:42:51 | |
Hugo speaks with Ines Montani and Matthew Honnibal, the creators of spaCy and founders of Explosion AI. Collectively, they've had a huge impact on the fields of industrial natural language processing (NLP), ML, and AI through their widely-used open-source library spaCy and their innovative annotation tool Prodigy. These tools have become essential for many data scientists and NLP practitioners in industry and academia alike.
In this wide-ranging discussion, we dive into:
⢠The evolution of applied NLP and its role in industry
⢠The balance between large language models and smaller, specialized models
⢠Human-in-the-loop distillation for creating faster, more data-private AI systems
⢠The challenges and opportunities in NLP, including modularity, transparency, and privacy
⢠The future of AI and software development
⢠The potential impact of AI regulation on innovation and competition
We also touch on their recent transition back to a smaller, more independent-minded company structure and the lessons learned from their journey in the AI startup world.
Ines and Matt offer invaluable insights for data scientists, machine learning practitioners, and anyone interested in the practical applications of AI. They share their thoughts on how to approach NLP projects, the importance of data quality, and the role of open-source in advancing the field.
Whether you're a seasoned NLP practitioner or just getting started with AI, this episode offers a wealth of knowledge from two of the field's most respected figures. Join us for a discussion that explores the current landscape of AI development, with insights that bridge the gap between cutting-edge research and real-world applications.
LINKS
The livestream on YouTube (https://youtube.com/live/-6o5-3cP0ik?feature=share)
How S&P Global is making markets more transparent with NLP, spaCy and Prodigy (https://explosion.ai/blog/sp-global-commodities)
A practical guide to human-in-the-loop distillation (https://explosion.ai/blog/human-in-the-loop-distillation)
Laws of Tech: Commoditize Your Complement (https://gwern.net/complement)
spaCy: Industrial-Strength Natural Language Processing (https://spacy.io/)
LLMs with spaCy (https://spacy.io/usage/large-language-models)
Explosion, building developer tools for AI, Machine Learning and Natural Language Processing (https://explosion.ai/)
Back to our roots: Company update and future plans, by Matt and Ines (https://explosion.ai/blog/back-to-our-roots-company-update)
Matt's detailed blog post: back to our roots (https://honnibal.dev/blog/back-to-our-roots)
Ines on twitter (https://x.com/_inesmontani)
Matt on twitter (https://x.com/honnibal)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Check out and subcribe to our lu.ma calendar (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk) for upcoming livestreams!
| |||
| Episode 33: What We Learned Teaching LLMs to 1,000s of Data Scientists | 12 Aug 2024 | 01:25:10 | |
Hugo speaks with Dan Becker and Hamel Husain, two veterans in the world of data science, machine learning, and AI education. Collectively, theyāve worked at Google, DataRobot, Airbnb, Github (where Hamel built out the precursor to copilot and more) and they both currently work as independent LLM and Generative AI consultants.
Dan and Hamel recently taught a course on fine-tuning large language models that evolved into a full-fledged conference, attracting over 2,000 participants. This experience gave them unique insights into the current state and future of AI education and application.
In this episode, we dive into:
* The evolution of their course from fine-tuning to a comprehensive AI conference
* The unexpected challenges and insights gained from teaching LLMs to data scientists
* The current state of AI tooling and accessibility compared to a decade ago
* The role of playful experimentation in driving innovation in the field
* Thoughts on the economic impact and ROI of generative AI in various industries
* The importance of proper evaluation in machine learning projects
* Future predictions for AI education and application in the next five years
* We also touch on the challenges of using AI tools effectively, the potential for AI in physical world applications, and the need for a more nuanced understanding of AI capabilities in the workplace.
During our conversation, Dan mentions an exciting project he's been working on, which we couldn't showcase live due to technical difficulties. However, I've included a link to a video demonstration in the show notes that you won't want to miss. In this demo, Dan showcases his innovative AI-powered 3D modeling tool that allows users to create 3D printable objects simply by describing them in natural language.
LINKS
The livestream on YouTube (https://youtube.com/live/hDmnwtjktsc?feature=share)
Educational resources from Dan and Hamel's LLM course (https://parlance-labs.com/education/)
Upwork Study Finds Employee Workloads Rising Despite Increased C-Suite Investment in Artificial Intelligence (https://investors.upwork.com/news-releases/news-release-details/upwork-study-finds-employee-workloads-rising-despite-increased-c)
Episode 29: Lessons from a Year of Building with LLMs (Part 1) (https://vanishinggradients.fireside.fm/29)
Episode 30: Lessons from a Year of Building with LLMs (Part 2) (https://vanishinggradients.fireside.fm/30)
Dan's demo: Creating Physical Products with Generative AI (https://youtu.be/U5J5RUOuMkI?si=_7cYLYOU1iwweQeO)
Build Great AI, Dan's boutique consulting firm helping clients be successful with large language models (https://buildgreat.ai/)
Parlance Labs, Hamel's Practical consulting that improves your AI (https://parlance-labs.com/)
Hamel on Twitter (https://x.com/HamelHusain)
Dan on Twitter (https://x.com/dan_s_becker)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
| |||
| Episode 32: Building Reliable and Robust ML/AI Pipelines | 27 Jul 2024 | 01:15:10 | |
Hugo speaks with Shreya Shankar, a researcher at UC Berkeley focusing on data management systems with a human-centered approach. Shreya's work is at the cutting edge of human-computer interaction (HCI) and AI, particularly in the realm of large language models (LLMs). Her impressive background includes being the first ML engineer at Viaduct, doing research engineering at Google Brain, and software engineering at Facebook.
In this episode, we dive deep into the world of LLMs and the critical challenges of building reliable AI pipelines. We'll explore:
The fascinating journey from classic machine learning to the current LLM revolution
Why Shreya believes most ML problems are actually data management issues
The concept of "data flywheels" for LLM applications and how to implement them
The intriguing world of evaluating AI systems - who validates the validators?
Shreya's work on SPADE and EvalGen, innovative tools for synthesizing data quality assertions and aligning LLM evaluations with human preferences
The importance of human-in-the-loop processes in AI development
The future of low-code and no-code tools in the AI landscape
We'll also touch on the potential pitfalls of over-relying on LLMs, the concept of "Habsburg AI," and how to avoid disappearing up our own proverbial arseholes in the world of recursive AI processes.
Whether you're a seasoned AI practitioner, a curious data scientist, or someone interested in the human side of AI development, this conversation offers valuable insights into building more robust, reliable, and human-centered AI systems.
LINKS
The livestream on YouTube (https://youtube.com/live/hKV6xSJZkB0?feature=share)
Shreya's website (https://www.sh-reya.com/)
Shreya on Twitter (https://x.com/sh_reya)
Data Flywheels for LLM Applications (https://www.sh-reya.com/blog/ai-engineering-flywheel/)
SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines (https://arxiv.org/abs/2401.03038)
What Weāve Learned From A Year of Building with LLMs (https://applied-llms.org/)
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences (https://arxiv.org/abs/2404.12272)
Operationalizing Machine Learning: An Interview Study (https://arxiv.org/abs/2209.09125)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
In the podcast, Hugo also mentioned that this was the 5th time he and Shreya chatted publicly. which is wild!
If you want to dive deep into Shreya's work and related topics through their chats, you can check them all out here:
Outerbounds' Fireside Chat: Operationalizing ML -- Patterns and Pain Points from MLOps Practitioners (https://www.youtube.com/watch?v=7zB6ESFto_U)
The Past, Present, and Future of Generative AI (https://youtu.be/q0A9CdGWXqc?si=XmaUnQmZiXL2eagS)
LLMs, OpenAI Dev Day, and the Existential Crisis for Machine Learning Engineering (https://www.youtube.com/live/MTJHvgJtynU?si=Ncjqn5YuFBemvOJ0)
Lessons from a Year of Building with LLMs (https://youtube.com/live/c0gcsprsFig?feature=share)
Check out and subcribe to our lu.ma calendar (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk) for upcoming livestreams!
| |||
| Episode 31: Rethinking Data Science, Machine Learning, and AI | 09 Jul 2024 | 01:36:04 | |
Hugo speaks with Vincent Warmerdam, a senior data professional and machine learning engineer at :probabl, the exclusive brand operator of scikit-learn. Vincent is known for challenging common assumptions and exploring innovative approaches in data science and machine learning.
In this episode, they dive deep into rethinking established methods in data science, machine learning, and AI. We explore Vincent's principled approach to the field, including:
The critical importance of exposing yourself to real-world problems before applying ML solutions
Framing problems correctly and understanding the data generating process
The power of visualization and human intuition in data analysis
Questioning whether algorithms truly meet the actual problem at hand
The value of simple, interpretable models and when to consider more complex approaches
The importance of UI and user experience in data science tools
Strategies for preventing algorithmic failures by rethinking evaluation metrics and data quality
The potential and limitations of LLMs in the current data science landscape
The benefits of open-source collaboration and knowledge sharing in the community
Throughout the conversation, Vincent illustrates these principles with vivid, real-world examples from his extensive experience in the field. They also discuss Vincent's thoughts on the future of data science and his call to action for more knowledge sharing in the community through blogging and open dialogue.
LINKS
The livestream on YouTube (https://youtube.com/live/-CD66CI1pEo?feature=share)
Vincent's blog (https://koaning.io/)
CalmCode (https://calmcode.io/)
scikit-lego (https://koaning.github.io/scikit-lego/)
Vincent's book Data Science Fiction (WIP) (https://calmcode.io/book)
The Deon Checklist, an ethics checklist for data scientists (https://deon.drivendata.org/)
Of oaths and checklists, by DJ Patil, Hilary Mason and Mike Loukides (https://www.oreilly.com/radar/of-oaths-and-checklists/)
Vincent's Getting Started with NLP and spaCy Course course on Talk Python (https://training.talkpython.fm/courses/getting-started-with-spacy)
Vincent on twitter (https://x.com/fishnets88)
:probabl. on twitter (https://x.com/probabl_ai)
Vincent's PyData Amsterdam Keynote "Natural Intelligence is All You Need [tm]" (https://www.youtube.com/watch?v=C9p7suS-NGk)
Vincent's PyData Amsterdam 2019 talk: The profession of solving (the wrong problem) (https://www.youtube.com/watch?v=kYMfE9u-lMo)
Vanishing Gradients on Twitter (https://twitter.com/vanishingdata)
Hugo on Twitter (https://twitter.com/hugobowne)
Check out and subcribe to our lu.ma calendar (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk) for upcoming livestreams!
| |||
| Episode 40: What Every LLM Developer Needs to Know About GPUs | 24 Dec 2024 | 01:43:34 | |
Hugo speaks with Charles Frye, Developer Advocate at Modal and someone who really knows GPUs inside and out. If youāre a data scientist, machine learning engineer, AI researcher, or just someone trying to make sense of hardware for LLMs and AI workflows, this episode is for you.
Charles and Hugo dive into the practical side of GPUsāfrom running inference on large models, to fine-tuning and even training from scratch. They unpack the real pain points developers face, like figuring out:
- How much VRAM you actually need.
- Why memoryānot computeāends up being the bottleneck.
- How to make quick, back-of-the-envelope calculations to size up hardware for your tasks.
- And where things like fine-tuning, quantization, and retrieval-augmented generation (RAG) fit into the mix.
One thing Hugo really appreciate is that Charles and the Modal team recently put together the GPU Glossaryāa resource that breaks down GPU internals in a way thatās actually useful for developers. We reference it a few times throughout the episode, so check it out in the show notes below.
š§ Charles also does a demo during the episodeāsome of it is visual, but we talk through the key points so youāll still get value from the audio. If youād like to see the demo in action, check out the livestream linked below.
This is the "Building LLM Applications for Data Scientists and Software Engineers" course that Hugo is teaching with Stefan Krawczyk (ex-StitchFix) in January (https://maven.com/s/course/d56067f338). Charles is giving a guest lecture at on hardware for LLMs, and Modal is giving all students $1K worth of compute credits (use the code VG25 for $200 off).
LINKS
The livestream on YouTube (https://www.youtube.com/live/INryb8Hjk3c?si=0cbb0-Nxem1P987d)
The GPU Glossary (https://modal.com/gpu-glossary) by the Modal team
What Weāve Learned From A Year of Building with LLMs (https://applied-llms.org/) by Charles and friends
Charles on twitter (https://x.com/charles_irl)
Hugo on twitter (https://x.com/hugobowne)
Vanishing Gradients on twitter (https://x.com/vanishingdata)
| |||
| Episode 41: Beyond Prompt Engineering: Can AI Learn to Set Its Own Goals? | 30 Dec 2024 | 00:43:51 | |
Hugo Bowne-Anderson hosts a panel discussion from the MLOps World and Generative AI Summit in Austin, exploring the long-term growth of AI by distinguishing real problem-solving from trend-based solutions. If you're navigating the evolving landscape of generative AI, productionizing models, or questioning the hype, this episode dives into the tough questions shaping the field.
The panel features:
- Ben Taylor (Jepson) (https://www.linkedin.com/in/jepsontaylor/) ā CEO and Founder at VEOX Inc., with experience in AI exploration, genetic programming, and deep learning.
- Joe Reis (https://www.linkedin.com/in/josephreis/) ā Co-founder of Ternary Data and author of Fundamentals of Data Engineering.
- Juan Sequeda (https://www.linkedin.com/in/juansequeda/) ā Principal Scientist and Head of AI Lab at Data.World, known for his expertise in knowledge graphs and the semantic web.
The discussion unpacks essential topics such as:
- The shift from prompt engineering to goal engineeringāletting AI iterate toward well-defined objectives.
- Whether generative AI is having an electricity moment or more of a blockchain trajectory.
- The combinatorial power of AI to explore new solutions, drawing parallels to AlphaZero redefining strategy games.
- The POC-to-production gap and why AI projects stall.
- Failure modes, hallucinations, and governance risksāand how to mitigate them.
- The disconnect between executive optimism and employee workload.
Hugo also mentions his upcoming workshop on escaping Proof-of-Concept Purgatory, which has evolved into a Maven course "Building LLM Applications for Data Scientists and Software Engineers" launching in January (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?utm_campaign=8123d0&utm_medium=partner&utm_source=instructor). Vanishing Gradient listeners can get 25% off the course (use the code VG25), with $1,000 in Modal compute credits included.
A huge thanks to Dave Scharbach and the Toronto Machine Learning Society for organizing the conference and to the audience for their thoughtful questions.
As we head into the new year, this conversation offers a reality check amidst the growing AI agent hype.
LINKS
Hugo on twitter (https://x.com/hugobowne)
Hugo on LinkedIn (https://www.linkedin.com/in/hugo-bowne-anderson-045939a5/)
Vanishing Gradients on twitter (https://x.com/vanishingdata)
"Building LLM Applications for Data Scientists and Software Engineers" course (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?utm_campaign=8123d0&utm_medium=partner&utm_source=instructor).
| |||
| Episode 62: Practical AI at Work: How Execs and Developers Can Actually Use LLMs | 31 Oct 2025 | 00:59:04 | |
Many leaders are trapped between chasing ambitious, ill-defined AI projects and the paralysis of not knowing where to start. Dr. Randall Olson argues that the real opportunity isn't in moonshots, but in the "trillions of dollars of business value" available right now. As co-founder of Wyrd Studios, he bridges the gap between data science, AI engineering, and executive strategy to deliver a practical framework for execution.
In this episode, Randy and Hugo lay out how to find and solve what might be considered "boring but valuable" problems, like an EdTech company automating 20% of its support tickets with a simple retrieval bot instead of a complex AI tutor. They discuss how to move incrementally along the "agentic spectrum" and why treating AI evaluation with the same rigor as software engineering is non-negotiable for building a disciplined, high-impact AI strategy.
They talk through:
How a non-technical leader can prototype a complex insurance claim classifier using just photos and a ChatGPT subscription.
The agentic spectrum: Why you should start by automating meeting summaries before attempting to build fully autonomous agents.
The practical first step for any executive: Building a personal knowledge base with meeting transcripts and strategy docs to get tailored AI advice.
Why treating AI evaluation with the same rigor as unit testing is essential for shipping reliable products.
The organizational shift required to unlock long-term AI gains, even if it means a short-term productivity dip.
LINKS
Randy on LinkedIn (https://www.zenml.io/llmops-database)
Wyrd Studios (https://thewyrdstudios.com/)
Stop Building AI Agents (https://www.decodingai.com/p/stop-building-ai-agents)
Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Watch the podcast video on YouTube (https://youtu.be/-YQjKH3wRvc)
š Learn more:
In Hugo's course: Building AI Applications for Data Scientists and Software Engineers (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=AI20) ā https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=AI20
Next cohort starts November 3: come build with us!
| |||
| Episode 61: The AI Agent Reliability Cliff: What Happens When Tools Fail in Production | 16 Oct 2025 | 00:28:04 | |
Most AI teams find their multi-agent systems devolving into chaos, but ML Engineer Alex Strick van Linschoten argues they are ignoring the production reality. In this episode, he draws on insights from the LLM Ops Database (750+ real-world deployments then; now nearly 1,000!) to systematically measure and engineer constraint, turning unreliable prototypes into robust, enterprise-ready AI.
Drawing from his work at Zen ML, Alex details why success requires scaling down and enforcing MLOps discipline to navigate the unpredictable "Agent Reliability Cliff". He provides the essential architectural shifts, evaluation hygiene techniques, and practical steps needed to move beyond guesswork and build scalable, trustworthy AI products.
We talk through:
- Why "shoving a thousand agents" into an app is the fastest route to unmanageable chaos
- The essential MLOps hygiene (tracing and continuous evals) that most teams skip
- The optimal (and very low) limit for the number of tools an agent can reliably use
- How to use human-in-the-loop strategies to manage the risk of autonomous failure in high-sensitivity domains
- The principle of using simple Python/RegEx before resorting to costly LLM judges
LINKS
The LLMOps Database: 925 entries as of today....submit a use case to help it get to 1K! (https://www.zenml.io/llmops-database)
Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Watch the podcast video on YouTube (https://youtu.be/-YQjKH3wRvc)
š Learn more:
-This was a guest Q&A from Building LLM Applications for Data Scientists and Software Engineers (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=AI20) ā https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=AI20
Next cohort starts November 3: come build with us!
| |||
| Episode 52: Why Most LLM Products Break at Retrieval (And How to Fix Them) | 02 Jul 2025 | 00:28:38 | |
Most LLM-powered features do not break at the model. They break at the context. So how do you retrieve the right information to get useful results, even under vague or messy user queries?
In this episode, we hear from Eric Ma, who leads data science research in the Data Science and AI group at Moderna. He shares what it takes to move beyond toy demos and ship LLM features that actually help people do their jobs.
We cover:
⢠How to align retrieval with user intent and why cosine similarity is not the answer
⢠How a dumb YAML-based system outperformed so-called smart retrieval pipelines
⢠Why vague queries like āwhat is this all aboutā expose real weaknesses in most systems
⢠When vibe checks are enough and when formal evaluation is worth the effort
⢠How retrieval workflows can evolve alongside your product and user needs
If you are building LLM-powered systems and care about how they work, not just whether they work, this one is for you.
LINKS
Eric's website (https://ericmjl.github.io/)
Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Hugo's recent newsletter about upcoming events and more! (https://hugobowne.substack.com/p/stop-building-agents)
š Learn more:
Hugo's course: Building LLM Applications for Data Scientists and Software Engineers (https://maven.com/s/course/d56067f338) ā next cohort starts July 8: https://maven.com/s/course/d56067f338
šŗ Watch the video version on YouTube: YouTube link (https://youtu.be/d-FaR5Ywd5k)
| |||
| Episode 51: Why We Built an MCP Server and What Broke First | 26 Jun 2025 | 00:47:41 | |
What does it take to actually ship LLM-powered features, and what breaks when you connect them to real production data?
In this episode, we hear from Philip Carter ā then a Principal PM at Honeycomb and now a Product Management Director at Salesforce. In early 2023, he helped build one of the first LLM-powered SaaS features to ship to real users. More recently, he and his team built a production-ready MCP server.
We cover:
⢠How to evaluate LLM systems using human-aligned judges
⢠The spreadsheet-driven process behind shipping Honeycombās first LLM feature
⢠The challenges of tool usage, prompt templates, and flaky model behavior
⢠Where MCP shows promise, and where it breaks in the real world
If youāre working on LLMs in production, this oneās for you!
LINKS
So We Shipped an AI Product: Did it Work? by Philip Carter (https://www.honeycomb.io/blog/we-shipped-ai-product)
Vanishing Gradients YouTube Channel (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Hugo's recent newsletter about upcoming events and more! (https://hugobowne.substack.com/p/ai-as-a-civilizational-technology)
š Learn more:
Hugo's course: Building LLM Applications for Data Scientists and Software Engineers (https://maven.com/s/course/d56067f338) ā next cohort starts July 8: https://maven.com/s/course/d56067f338
šŗ Watch the video version on YouTube: YouTube link (https://youtu.be/JDMzdaZh9Ig)
| |||
| Episode 50: A Field Guide to Rapidly Improving AI Products -- With Hamel Husain | 17 Jun 2025 | 00:27:42 | |
If we want AI systems that actually work, we need to get much better at evaluating them, not just building more pipelines, agents, and frameworks.
In this episode, Hugo talks with Hamel Hussain (ex-Airbnb, GitHub, DataRobot) about how teams can improve AI products by focusing on error analysis, data inspection, and systematic iteration. The conversation is based on Hamelās blog post A Field Guide to Rapidly Improving AI Products, which he joined Hugoās class to discuss.
They cover:
š Why most teams struggle to measure whether their systems are actually improving
š How error analysis helps you prioritize what to fix (and when to write evals)
š§® Why evaluation isnāt just a metric ā but a full development process
ā ļø Common mistakes when debugging LLM and agent systems
š ļø How to think about the tradeoffs in adding more evals vs. fixing obvious issues
š„ Why enabling domain experts ā not just engineers ā can accelerate iteration
If youāve ever built an AI system and found yourself unsure how to make it better, this conversation is for you.
LINKS
* A Field Guide to Rapidly Improving AI Products by Hamel Husain (https://hamel.dev/blog/posts/field-guide/)
* Vanishing Gradients YouTube Channel (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
* Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
* Hugo's recent newsletter about upcoming events and more! (https://hugobowne.substack.com/p/ai-as-a-civilizational-technology)
š Learn more:
Hugo's course: Building LLM Applications for Data Scientists and Software Engineers (https://maven.com/s/course/d56067f338) ā next cohort starts July 8: https://maven.com/s/course/d56067f338
Hamel & Shreya's course: AI Evals For Engineers & PMs (https://maven.com/parlance-labs/evals?promoCode=GOHUGORGOHOME) ā use code GOHUGORGOHOME for $800 off
šŗ Watch the video version on YouTube: YouTube link (https://youtu.be/rWToRi2_SeY)
| |||
| Episode 49: Why Data and AI Still Break at Scale (and What to Do About It) | 05 Jun 2025 | 01:21:45 | |
If we want AI systems that actually work in production, we need better infrastructureānot just better models.
In this episode, Hugo talks with Akshay Agrawal (Marimo, ex-Google Brain, Netflix, Stanford) about why data and AI pipelines still break down at scale, and how we can fix the fundamentals: reproducibility, composability, and reliable execution.
They discuss:
š Why reactive execution mattersāand how current tools fall short
š ļø The design goals behind Marimo, a new kind of Python notebook
āļø The hidden costs of traditional workflows (and what breaks at scale)
š¦ What it takes to build modular, maintainable AI apps
š§Ŗ Why debugging LLM systems is so hardāand what better tooling looks like
š What we can learn from decades of tools built for and by data practitioners
Toward the end of the episode, Hugo and Akshay walk through two live demos: Hugo shares how heās been using Marimo to prototype an app that extracts structured data from world leader bios, and Akshay shows how Marimo handles agentic workflows with memory and tool useābuilt entirely in a notebook.
This episode is about tools, but itās also about culture. If youāve ever hit a wall with your current stackāor felt like your tools were working against youāthis oneās for you.
LINKS
* marimo | a next-generation Python notebook (https://marimo.io/)
* SciPy conference, 2025 (https://www.scipy2025.scipy.org/)
* Hugo's face Marimo World Leader Face Embedding demo (https://www.youtube.com/watch?v=DO21QEcLOxM)
* Vanishing Gradients YouTube Channel (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
* Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
* Hugo's recent newsletter about upcoming events and more! (https://hugobowne.substack.com/p/ai-as-a-civilizational-technology)
* Watch the podcast here on YouTube! (https://youtube.com/live/WVxAz19tgZY?feature=share)
š Want to go deeper?
Check out Hugo's course: Building LLM Applications for Data Scientists and Software Engineers.
Learn how to design, test, and deploy production-grade LLM systems ā with observability, feedback loops, and structure built in.
This isnāt about vibes or fragile agents. Itās about making LLMs reliable, testable, and actually useful.
Includes over $800 in compute credits and guest lectures from experts at DeepMind, Moderna, and more.
Cohort starts July 8 ā Use this link for a 10% discount (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=LLM10)
| |||
| Episode 48: HOW TO BENCHMARK AGI WITH GREG KAMRADT | 23 May 2025 | 01:04:25 | |
If we want to make progress toward AGI, we need a clear definition of intelligenceāand a way to measure it.
In this episode, Hugo talks with Greg Kamradt, President of the ARC Prize Foundation, about ARC-AGI: a benchmark built on Francois Cholletās definition of intelligence as āthe efficiency at which you learn new things.ā Unlike most evals that focus on memorization or task completion, ARC is designed to measure generalizationāand expose where todayās top models fall short.
They discuss:
š§ Why we still lack a shared definition of intelligence
š§Ŗ How ARC tasks force models to learn novel skills at test time
š Why GPT-4-class models still underperform on ARC
š The limits of traditional benchmarks like MMLU and Big-Bench
āļø What the OpenAI Oā results revealāand what they donāt
š” Why generalization and efficiency, not raw capability, are key to AGI
Greg also shares what heās seeing in the wild: how startups and independent researchers are using ARC as a North Star, how benchmarks shape the frontier, and why the ARC team believes weāll know weāve reached AGI when humans can no longer write tasks that models canāt solve.
This conversation is about evaluationānot hype. If you care about where AI is really headed, this oneās worth your time.
LINKS
* ARC Prize -- What is ARC-AGI? (https://arcprize.org/arc-agi)
* On the Measure of Intelligence by FranƧois Chollet (https://arxiv.org/abs/1911.01547)
* Greg Kamradt on Twitter (https://x.com/GregKamradt)
* Hugo's High Signal Podcast with Fei-Fei Li (https://high-signal.delphina.ai/episode/fei-fei-on-how-human-centered-ai-actually-gets-built)
* Vanishing Gradients YouTube Channel (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
* Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
* Hugo's recent newsletter about upcoming events and more! (https://hugobowne.substack.com/p/ai-as-a-civilizational-technology)
* Watch the podcast here on YouTube! (https://youtu.be/wU82fz4iRfo)
š Want to go deeper?
Check out Hugo's course: Building LLM Applications for Data Scientists and Software Engineers.
Learn how to design, test, and deploy production-grade LLM systems ā with observability, feedback loops, and structure built in.
This isnāt about vibes or fragile agents. Itās about making LLMs reliable, testable, and actually useful.
Includes over $800 in compute credits and guest lectures from experts at DeepMind, Moderna, and more.
Cohort starts July 8 ā Use this link for a 10% discount (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=LLM10)
| |||
| Episode 47: The Great Pacific Garbage Patch of Code Slop with Joe Reis | 07 Apr 2025 | 01:19:12 | |
What if the cost of writing code dropped to zero ā but the cost of understanding it skyrocketed?
In this episode, Hugo sits down with Joe Reis to unpack how AI tooling is reshaping the software development lifecycle ā from experimentation and prototyping to deployment, maintainability, and everything in between.
Joe is the co-author of Fundamentals of Data Engineering and a longtime voice on the systems side of modern software. Heās also one of the sharpest critics of āvibe codingā ā the emerging pattern of writing software by feel, with heavy reliance on LLMs and little regard for structure or quality.
We dive into:
⢠Why āvibe codingā is more than a meme ā and what it says about how we build today
⢠How AI tools expand the surface area of software creation ā for better and worse
⢠What happens to technical debt, testing, and security when generation outpaces understanding
⢠The changing definition of āproductionā in a world of ephemeral, internal, or just-good-enough tools
⢠How AI is flattening the learning curve ā and threatening the talent pipeline
⢠Joeās view on what real craftsmanship means in an age of disposable code
This conversation isnāt about doom, and itās not about hype. Itās about mapping the real, messy terrain of what it means to build software today ā and how to do it with care.
LINKS
* Joe's Practical Data Modeling Newsletter on Substack (https://practicaldatamodeling.substack.com/)
* Joe's Practical Data Modeling Server on Discord (https://discord.gg/HhSZVvWDBb)
* Vanishing Gradients YouTube Channel (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
* Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
š Want to go deeper?
Check out my course: Building LLM Applications for Data Scientists and Software Engineers.
Learn how to design, test, and deploy production-grade LLM systems ā with observability, feedback loops, and structure built in.
This isnāt about vibes or fragile agents. Itās about making LLMs reliable, testable, and actually useful.
Includes over $800 in compute credits and guest lectures from experts at DeepMind, Moderna, and more.
Cohort starts July 8 ā Use this link for a 10% discount (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=LLM10)
| |||
| Episode 46: Software Composition Is the New Vibe Coding | 03 Apr 2025 | 01:08:57 | |
What if building software felt more like composing than coding?
In this episode, Hugo and Greg explore how LLMs are reshaping the way we think about software developmentāfrom deterministic programming to a more flexible, prompt-driven, and collaborative style of building. Itās not just hype or griftāitās a real shift in how we express intent, reason about systems, and collaborate across roles.
Hugo speaks with Greg Ceccarelliāco-founder of SpecStory, former CPO at Pluralsight, and Director of Data Science at GitHubāabout the rise of software composition and how it changes the way individuals and teams create with LLMs.
We dive into:
- Why software composition is emerging as a serious alternative to traditional coding
- The real difference between vibe coding and production-minded prototyping
- How LLMs are expanding who gets to build softwareāand how
- What changes when you focus on intent, not just code
- What Greg is building with SpecStory to support collaborative, traceable AI-native workflows
- The challenges (and joys) of debugging and exploring with agentic tools like Cursor and Claude
Weāve removed the visual demos from the audioābut you can catch our live-coded Chrome extension and JFK document explorer on YouTube. Links below.
JFK Docs Vibe Coding Demo (YouTube) (https://youtu.be/JpXCkuV58QE)
Chrome Extension Vibe Coding Demo (YouTube) (https://youtu.be/ESVKp37jDwc)
Meditations on Tech (Gregās Substack) (https://www.meditationsontech.com/)
Simon Willison on Vibe Coding (https://simonwillison.net/2025/Mar/19/vibe-coding/)
Johnno Whitaker: On Vibe Coding (https://johnowhitaker.dev/essays/vibe_coding.html)
Tim OāReilly ā The End of Programming (https://www.oreilly.com/radar/the-end-of-programming-as-we-know-it/)
Vanishing Gradients YouTube Channel (https://www.youtube.com/channel/UC_NafIo-Ku2loOLrzm45ABA)
Upcoming Events on Luma (https://lu.ma/calendar/cal-8ImWFDQ3IEIxNWk)
Greg Ceccarelli on LinkedIn (https://www.linkedin.com/in/gregceccarelli/)
Gregās Hacker News Post on GOOD (https://news.ycombinator.com/item?id=43557698)
SpecStory: GOOD ā Git Companion for AI Workflows (https://github.com/specstoryai/getspecstory/blob/main/GOOD.md)
š Want to go deeper?
Check out my course: Building LLM Applications for Data Scientists and Software Engineers.
Learn how to design, test, and deploy production-grade LLM systems ā with observability, feedback loops, and structure built in.
This isnāt about vibes or fragile agents. Itās about making LLMs reliable, testable, and actually useful.
Includes over $2,500 in compute credits and guest lectures from experts at DeepMind, Moderna, and more.
Cohort starts April 7 ā Use this link for a 10% discount (https://maven.com/hugo-stefan/building-llm-apps-ds-and-swe-from-first-principles?promoCode=LLM10)
š Want to help shape the future of SpecStory?
Greg and the team are looking for design partners for their new SpecStory Teams productābuilt for collaborative, AI-native software development.
If you're working with LLMs in a team setting and want to influence the next wave of developer tools, you can apply here:
š specstory.com/teams (https://specstory.com/teams)
| |||
Ā© My Podcast Data