Back

Explore every episode of the podcast This Day in AI Podcast

Dive into the complete episode list for This Day in AI Podcast. Each episode is cataloged with detailed descriptions, making it easy to find and explore specific topics. Keep track of all episodes from your favorite podcast and never miss a moment of insightful content.

Rows per page:

1–50 of 142

TitlePub. DateDuration
EP77: OpenAI o1 & o1-mini, The Era of AI Reasoning & Is Reflection-70B a Fraud?13 Sep 202401:20:16

Try o1 & o1-mini: https://simtheory.ai
-----
00:00 - OpenAI o1 & o1 Mini Discussion
18:26 - Evals of OpenAI o1 & Chris Discusses Malicious Uses
32:55 - Will OpenAI o1 with Agency Take Jobs or Augment Workers?
48:58 - Does OpenAI o1 & o1 Mini Make Agency Products More Viable Now?
52:28 - Can we Build a CRM for Klarna Using OpenAI's o1? And Model Examples
1:03:37 - Is there another OpenAI model coming? Orion?
1:05:45 - Reflection 70B & Matt Schumer Drama: Was Reflection 70B a Fraud? Is it just a great prompt?

Thanks for listening!

EP76: Can AI Fix Its Own Mistakes? (Reflection 70B) & How Much Will You Pay for AI Productivity?06 Sep 202401:01:19

Join Simtheory: https://simtheory.ai
Our Community: https://thisdayinai.com
----
CHAPTERS:
00:00 - Days of AI Models Lives
04:02 - Reflection 70B Open Source Model: Is It The Best Open Source AI Model or Just Great Prompt Engineering?
24:48 - Is Microsoft Office a Dud? What Actually Makes you More Productive in Enterprise AI.
36:15 - OpenAI Floats $2,000/month for New Models Strawberry (Q*) and Orion. Is it Expensive or Cheap for Potential Gains?
55:51 - Boom Factor for Reflection 70B & Final Thoughts
-----
Thanks for listening and all of your support of the show.

EP67: Claude 3.5 Sonnet Beats GPT-4o + Ilya Sutskever's New Startup & Hedra lols21 Jun 202401:15:28

Show notes: https://thisdayinai.com/bookmarks/60-ep67
Community: https://thisdayinai.com
SimTheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Hedra Lols cold open
02:24 - Anthropic Claude 3.5 Sonnet Impressions
20:02 - Claude 3.5 Sonnet Vision Image Tests
25:32 - Claude 3.5 Sonnet Refusal Problems
28:54 - More on Claude 3.5 Sonnet, Artifacts and Future AI UI
51:41 - Hedra fun
58:29 - Thoughts on Ilya Sutskever's SSI Inc (Safe Superintelligence)
1:09:12 - Is AI Bad For Kids?

EP66: Apple Intelligence & Private Cloud Compute, Dream Machine, Mistral Funding & OpenAI Revenue14 Jun 202401:24:19

Show notes: https://thisdayinai.com/bookmarks/59-ep66
Community & discord: https://thisdayinai.com
Join SimTheory: https://simtheory.ai
-----

CHAPTERS:
00:00 - Apple Intelligence & Apple Private Cloud Compute Thoughts, Approach and Model Discussion
41:19 - LumaLab's Dream Machine
48:00 - Mistral's Fundraise & Valuation: Are AI Labs Proxies in the Big Tech AI War?
52:40 - Stable Diffusion Medium 3
56:28 - OpenAI's Revenue Leaks, AI Usage in EDU and Workplaces

Thanks for listening and your support!

EP65: AI Doomerism, Qwen2, Kling Video Generation, Mistral Fine Tuning, Will Recall Be Recalled?07 Jun 202400:57:32

Join SimTheory: https://simtheory.ai
Join the community: https://thisdayinai.com
Show notes: https://thisdayinai.com/bookmarks/57-ep65
----

CHAPTERS:
00:00: Fun with AI yet everyone is doom and gloom
13:25: Qwen2: our initial thoughts
22:12: Kling Video Generation
25:11: Mistral's Fine Tuning SDK: Chris Fine Tunes using Mistral
31:32: Looking Backwards: Streaming Video-to-Video Translation with Feature Banks & AI Deepfakes
40:06: Is Microsoft's Recall Going to Be Recalled?
44:00: The Next AI Money Making Experiment: AI Poker Agents
46:53: Apple WWDC: Will we get AI Agent Siri?
50:16: New SimTheory beta discussion

Thanks for listening!

EP64: Microsoft Build, Can We Get This Song to #1? Google AI Fail, GPT-4o, Phi-3-Vision, Mistral-7B-v0.324 May 202401:21:49

HELP US GET THIS SONG TO #1: https://www.udio.com/songs/aM5GyzoomJn4fycgyhsUyL
(Remember to keep listening and heart the song!)

Join our community: https://thisdayinai.com
SimTheory: https://simtheory.ai

Thanks for listening and all of your support, we appreciate it.

--------
CHAPTERS:
00:00 - Google's AI Overview Fails
18:50 - The Reality of Using AI
25:02 - Two weeks of GPT-4o
34:36 - Microsoft Build: CoPilot+ PCs, Recall, AI Narks, Phi-Silica, Team CoPilot, CoPilot with GPT-4o Voice
54:41 - Phi-3-Vision Testing
57:26 - Mistral-7B v0.3 Uncensored with Function Calling Testing
1:08:55 - AI Startup Bubble? Lots of AI Startups looking for buyers...
1:15:37 - Help us get this song to #1 on Udio!

EP63: GPT-4o, ChatGPT Voice & Google I/O AI Recap (Project Astra) + Future Computing Interfaces17 May 202401:42:57

Join the fun at: https://thisdayinai.com
SimTheory: https://simtheory.ai
Show notes: https://thisdayinai.com/bookmarks/55-ep63/
UDIO song: https://www.udio.com/songs/iu1381RxvjfzWznGHeVecV

Thanks for listening and all your support of the show!

CHAPTERS:
------
00:00 - We're changing the name of the show
00:52 - Thoughts on GPT-4o (GPT4 Omni), ChatGPT Free Vs Plus & impressions
27:57 - ChatGPT Voice Mode: A Dramatic Shift? Voice as a Platform: Star Trek Vs Her
34:54 - Project Astra & The Future Interface of AI Computing
52:28 - Applying AI Technologies: are the next 3 years a golden age for developers implementing AI?
55:23 - Do we have to become Cyborgs to find our keys?
1:06:24 - Google I/O AI Recap: Google's Context Caching, Tools for Project Astra, Impressions of Gemini Pro 1.5, Gemma, Gemini Flash, Veo etc.
1:37:43 - Our Favorite UDIO song of the week

LIVE: OpenAI Spring Event (Post Event Reaction)13 May 202400:59:27

LIVE after the OpenAI Spring Update Event! Hear our initial reaction to GPT-4o and "Her" like virtual assistant with low latency voice.

More testing/discussion coming later in the week.

Community: https://thisdayinai.com.

EP62: What is gpt4l-auto? Which GPT2-chatbot is best? Should you stop your education because of AI?10 May 202401:27:04

Community: https://thisdayinai.com
Show notes: https://thisdayinai.com/bookmarks/54-ep62
SimTheory: https://simtheory.ai

If you like the show, please consider leaving a comment and subscribing! We love to hear your thoughts on topics discussed on the show.

-----
CHAPTERS:
00:00 - im-a-good-gpt2-chatbot, im-also-a-good-gpt2-chatbot
02:33 - Which model is the best? gpt2-chatbots compared with current top models using "snake test"
11:07 - Are 3 New Models Coming?  GPT-4L, GPT-4L-AUTO & GPT-4-AUTO
18:41 - Privacy and trust issues with how OpenAI conducts themself
29:22 - Thoughts on OpenAI's Model SPEC & Alignment
40:21 - "Lazy Prompting" & Accessing Better Latent Space
52:21 - Should you still learn how to code or study to become a specialist because of AI?
1:02:35 - Why most people want AI to "go away": AI fear porn
1:13:20 - MAI-1: Microsoft's AI Model to Compete with Google and OpenAI
1:15:59 - News "Rapid" Fire: Udio Inpainting & Story Diffusion

EP61: What is GPT2-chatbot? MoE Theories, ChatGPT Search, Virtual Try On & Fine-Tuning Experts03 May 202401:23:05

Show Notes: https://thisdayinai.com/bookmarks/53-ep61
Community: https://thisdayinai.com
SimTheory: https://simtheory.ai

Thanks for watching, if you like the show please consider subscribing, liking and all the stuff lord youtube requires.

CHAPTERS:
----
00:00 - GPT2-chatbot: What could GPT2 Be? Is This GPT4.5 or GPT-5?
37:08 - Is OpenAI about to take on Google & Perplexity with Search? ChatGPT Search?
52:15 - Fun with Virtual Try On: IDM-VTON
1:01:30 - Anthropic Releases Claude App for iOS & Claude Teams. Should you lock your team to a single model?
1:08:37 - GeoSpy AI Hype & reality check
1:15:21 - World's First AI Music Video Using OpenAI's SORA

EP60: Rabbit r1 Launch Party, LAMs, Microsoft's Phi-3, Hume AI EVI API, Llama3 Updates & Groq Speed24 Apr 202401:01:15

Community: https://thisdayinai.com
Show Notes: https://thisdayinai.com/bookmarks/52-ep60
SimTheory with Groq Llama3: https://simtheory.ai

Thanks for listening!

Llama3 Tunes Mentioned on Show:
https://huggingface.co/Orenguteng/Lexi-Llama-3-8B-Uncensored
https://huggingface.co/sherazkhan/Mixllama3-8x8b-Instruct-v0.1
https://huggingface.co/mattshumer/Llama-3-8B-16K
https://huggingface.co/McGill-NLP/Llama-3-8B-Web

CHAPTERS:
=====
00:00 - Rabit r1 Launch Party & Can LAMs Be Useful?
13:40 - Microsoft's Phi-3 Impressions, Use Cases & Will It Kill Someone?
32:50 - Llama3, Gemini 1.5 API Closing in on GPT-4 & Llama3 on Groq
40:07 - A Week Later: SO Many Llama3 Fine Tunes and 16K Context 
43:50 - Hume AI Releases AI EVI API: Empathic Voice Interface (and Lie Detector Test)
52:11 - Meta Has Put Llama 3 Everywhere with Meta AI. What is the point?

EP59: Unhinged Meta Llama 3 *Special Edition*19 Apr 202401:26:33

Show Notes: https://thisdayinai.com/bookmarks/51-ep59
SimTheory: https://simtheory.ai
This Day in AI Community: https://thisdayinai.com

CHAPTERS:
======
00:00 - Meta Llama 3: Chris's Cheese Song & Zuck's Silver Chain
04:07 - Everything Meta Announced with Llama 3: 7B & 40B Model with 400B coming soon
21:31 - Is Groq The Ideal API Host for Llama3?
28:44 - Llama 3 Being Made Available via Meta Apps to 3B Users with Meta AI in Instagram, Whatsapp and via Web
38:01 - Llama 3 Licensing Must Include "Llama 3" 
40:52 - Llama 3 400B Model Benchmarks While Still in Training & Potential Unlimited Context? & You Can Eat Llama
1:01:51 - OpenAI Assistants API v2 & Is Tooling Important to Win Devs? Google Gemini's Mistakes
1:15:24 - Conor Update: Using VASA-1 To Deep Fake a Record Label
1:23:07 - SimTheory update: what's next from SimTheory

EP75: OpenAI🍓, Q* & Orion: What Will Happen When AI Has Agency?30 Aug 202401:13:22

Get a Simtheory AI Workspace: https://simtheory.ai
Show Notes: https://thisdayinai.com/bookmarks/69-ep75
------
00:00 - Lols
00:29 - Discussion on OpenAI's Strawberry Q* and Orion Leaks and What it Might Mean for the Future of AI Agency & Background Tasks
31:48 - Google's New Gemini 1.5 Pro & Flash Experimental Tunes: Our Thoughts
44:22 - Google's Diffusion Models are Real-Time Game Engines GameNGen & Future Model Simulations
58:06: Qwen2-VL Vision Models: Initial Thoughts
1:08:00 - Some LOLs & Surprise End of Show Guest!
----
Thanks for listening and your "average" reviews. It means a lot to us. To support the show please consider leaving a review, like, comment and all the things.

EP58: We Convinced a Record Label to Sign an AI Artist + Udio AI Music, Gemini 1.5 Pro, GPT-4 TURBO, Mixtral12 Apr 202401:09:53

AI News: https://thisdayinai.com
SimTheory: https://simtheory.ai
Show Notes: https://thisdayinai.com/bookmarks/48-ep58
-------

CHAPTERS:
00:00 - Udio, Udio Examples
10:45 - Will a Record Label Sign an AI Udio Artist?
19:09 - 3 Major LLM Updates/Release in a Single Day 
22:58 - Google Gemini 1.5 Pro General Availability, Audio Modality & Impressions
30:20 - Google Cloud Next 2024 AI Announcements Discussion
47:18 - OpenAI Announces "improvements" to GPT-4 Turbo, GPT-4 Turbo Official Release & Vision API JSON & Function Calling
57:35 - Mistral Posts BitTorrent To New Open Source Model Mixtral-8-22B
1:03:00 - Humane's AI Pin Reviews are out... and they aren't great.

Special thanks to AI artist Conor for the great content!

Thanks for listening.

EP57: Is Gary Right? VoiceEngine, Cohere Command R+, Stable Audio 2, Grok 1.505 Apr 202401:09:19

AI News & Discord: https://thisdayinai.com
Try AI on SimTheory: https://simtheory.ai
Show Notes: https://thisdayinai.com/bookmarks/46-ep57
------
CHAPTERS:
00:00 - Mike's Meta Ray Band AI Glasses With No AI
03:52 - OpenAI's Voice Engine & Voice Cloning Safety
14:03 - ChatGPT Now Has Inpainting & Comparison to BrushNet by TencentARC
19:44 - Is There a Business Model for AI Right Now? Is Gary Marcus Right?
44:31 - Cohere's Command R+ Model & Tooling
58:20 - Grok-1.5 & Grok Improving X/Twitter

Thanks for listening and supporting the show.

EP56: We Wrote a Song! Claude Opus is 👑, Gemini 1.5 Pro & Ultra API Experiments28 Mar 202401:26:53

Show notes: https://thisdayinai.com/bookmarks/45-ep56
Try Gemini 1.5 Pro on SimTheory: https://simtheory.ai/agent/865-google-gemini-15-your-ultimate-assistant
Try Gemini Ultra on SimTheory: https://simtheory.ai/agent/866-google-gemini-ultra-the-apex-of-ai-conversation
Join our community: https://thisdayinai.com

CHAPTERS
=====
00:00 - Fun with Suno v3
10:38 - We Have Google Gemini 1.5 Pro API, Google Ultra API Access!
26:21 - Claude Opus is the King According to LMSYS Chatbot Arena Leaderboard
38:25 - The Sink Sub Coding Challenge with Opus, Gemini 1.5 Pro and Gemini Ultra + Building Salesforce CRM with AI
50:06 - Amazon Invest More Billions in Anthropic
53:03 - Hume AI: Empathic AI Voice & Vision Understanding
1:01:06 - Inflection AI Absorbed into Microsoft, Microsoft is below, above and around all top AI labs.
1:09:28 - Does AI Help Students Learn? Maybe Not?
1:17:37 - Stable Code Instruct 3B, a good local coding model?
1:23:12 - Our AI Songs in Full! 

Thanks for listening, please consider subbing, liking, commenting - we love hearing from you.

EP55: Will Devin Take Our Jobs? Sora Interview, Claude Haiku, DeepSeek 7B, Figure1 & Robot Slavery15 Mar 202401:29:00

Show Notes: https://thisdayinai.com/bookmarks/42-ep55
SimTheory Claude Haiku Agent: https://simtheory.ai/agent/795-claude-haiku-chatbot
Sign up for daily AI news: https://thisdayinai.com

====
CHAPTERS
00:00 - OpenAI CTO Mira Murati Sora Interview Train Wreck
16:47 - EU Passes the AI Act
24:25 - 1 year since Greg Brockman Unveiled GPT-4 + Cognition's Devin
52:34 - Anthropic Releases Claude 3: Haiku & It's REALLY GOOD!
1:05:20 - DeepSeek-7B Real World Vision Language Understanding
1:16:09 - It's all about the training data, why Tesla might win Robotics & Vision
1:17:27 - Figure1 Robot with OpenAI for Vision and Language + Discussion on Robot Slavery
====

Please consider subscribing if you like the podcast! Thanks for listening.

EP54: Claude 3, Gemini 1.5 1M Context Seinfeld Experiment, OpenAI's DramaAI and Inflection 2.508 Mar 202401:36:02

Join SimTheory: https://simtheory.ai
Try Claude Opus: https://simtheory.ai/agent/689-claude-opus-your-conversational-companion
Subscribe to This Day in AI Daily News: https://thisdayinai.com
Show Notes: https://thisdayinai.com/bookmarks/41-ep54
Seinfeld Trivia Results: https://docs.google.com/spreadsheets/d/1crRzGE_JbQCIR5dEW_ORAq1QA9Yr8qquonZLILQRUpE/edit#gid=0

====
This week we cover Anthropic's impressive Claude 3 Opus, Sonnet and Haiku releases and play with Google's Gemini 1.5 1M Context using all the Seinfeld episodes ever written. We reluctantly recap and discuss the latest OpenAI drama, the Elon Musk lawsuit and finally cover Inflection's Inflection 2.5 release now available on Pi.

If you like the show sub, like, comment to feed the YouTube gods for us. xo.

CHAPTERS:
====
00:00 - Anthropic Claude 3
36:05 - Is The Future of Programming LLM Function Abstraction?
47:13 - Google Gemini 1.5 1M Context Experiments
1:08:38 - If You Had AGI Tomorrow What Would You Do?
1:12:13 - OpenAI's DramaAI & Elon Musk Lawsuit
1:29:38 - Inflection 2.5 Release on Pi

EP53: Mistral Large, Forecasting with LLMs, The Gemini Pile On & Is CoPilot Using GPT-4.5?01 Mar 202401:23:38

Show notes: https://thisdayinai.com/bookmarks/39-ep53
Join SimTheory: https://simtheory.ai
Try Mistral Large on SimTheory: https://simtheory.ai/agent/645-mistral-large
Join our community: https://thisdayinai.com
====

This week we talk about the release of Mistral's Large model, Mistral Le Chat, and their deal with Microsoft Azure. We cover papers on Emote Portrait Alive, AI Lip Reading and Cover the Gemini Pile On and how it is distracting from Gemini and the 1M context size break through. We cover the great "data sale" of both Reddit, Tumblr and Stackoverflow data and discuss the Forecasting with LLM paper from Berkeley.  We also cover Klarna's 700 support agent replacing AI agents and ask... is Sydney Back with GPT-4.5?

====

CHAPTERS:
00:00 - Cold open
00:44 - A Tough Week for AI Influencers
02:29 - Mistral Large, Mistral Le Chat & Microsoft Azure Partnership
30:31 - EMO: Emote Portrait Alive
36:26 - VSP-LLM: Visual Speech Processing incorporated with LLMs. AI Lip reading tech.
40:06 - The Google Gemini Pile On / Backlash: Is it taking attention away from 1M context breakthrough?
55:25 - The Great AI Training Data Sale: Reddit, Tumblr, Stackoverflow
1:00:34 - Forecasting with LLMs Paper: Can AI Predict The Future?
1:10:15 - Klarna Says They Replace 700 Humans with AI
1:18:07 - Is Microsoft's CoPilot Update Really GPT-4.5?

====

If you like the podcast please consider subscribing, comment, liking and all the things required to feed the YouTube overlords.

EP52: The Groq Breakthrough, Google's Gemma 7B, Unlimited Context, Can 'Magic' Reason?22 Feb 202400:55:00

Show notes: https://thisdayinai.com/bookmarks/32-ep52
Groq Mixtral: https://simtheory.ai/agent/567-groq-mixtral-edition
Groq Llama: https://simtheory.ai/agent/566-groq-the-speed-oriented-chat-companion
SimTheory: https://simtheory.ai
====
This week we discuss Groq's LPU Chips and the implications of low cost low latency LLMs on custom hardware. We revisit our prank calling to see if Groq's low latency gives an advantage and see if we can improve Air Canada's chatbot. We discuss the launch of Google's Open Source Gamma 7B release and Magic's $148M fundraise for an AI co-worker who can reason. We also cover ChatGPT losing it's mind during the week.

If you like the show, please consider subscribing. Thanks for listening.

====
Chapters:
00:00 - Groq, Groq API and Retell with Groq
32:48 - Google Gemma 7B Open Source Model
39:04 - The 'Magic' Breakthrough on Reasoning and Context
50:19 - Sounds for OpenAI Sora Thanks to ElevenLabs Sound FX
51:59 - ChatGPT Goes Haywire

EP51: OpenAI's Sora, Gemini Pro 1.5 10M Context, ChatGPT Memory, GraphRAG, ChatRTX, Microsoft UFO...16 Feb 202401:29:19

Show Notes: https://thisdayinai.com/bookmarks/28-ep51/
Sign up for daily This Day in AI: https://thisdayinai.com
Try Stable Cascade: https://simtheory.ai/agent/508-stable-cascade
Join SimTheory: https://simtheory.ai
======

This week we take several shots of vodka before trying to make sense of all the announcements. OpenAI attempted to trump Google's Gemini 1.5 with the announcement of Sora, 1 minute video generation that does an incredible job of keeping track of objects. Google showed us that up to 10M context windows are possible with multi-modal inputs. We discuss if a larger context window could end the need for RAG and take a first look at GraphRAG by Microsoft hoping to improve RAG with a knowledge graph. We road test Nvidia's ChatRTX on our baller graphics cards and Chris tries to delete all of his files using Microsoft UFO, a new open source project that uses GPT-4 vision to navigate and execute tasks on your Windows PC. We cover briefly V-JEPA (will try for next weeks show) and it's ability to learn through watching videos and listening, and finally discuss Stability's Stable Cascade which we've made available for "research" on SimTheory.

If you like the show please consider subscribing and leaving a comment. We appreciate your support.

======
Chapters:
00:00 - OpenAI's Sora That Creates Videos Instantly From Text
13:49 - ChatGPT Memory Released in Limited Preview
23:31 - OpenAI Rumored To Be Building Web Search, Andrej Karpathy Leaves OpenAI, Have OpenAI Slowed Down?
33:04 - Google Announces Gemini Pro 1.5. Huge Breakthrough 10M Context Window!
50:11 - Microsoft Research Publishes GraphRAG: Knowledge Graph Based RAG
1:02:03 - Nvidia's ChatRTX Road Tested
1:07:18 - AI Computers, AI PCs & Microsoft's UFO: An Agent for Window OS Interaction. Risk of AI Computers.
1:18:46 - Meta's V-JEPA: new architecture for self-supervised learning
1:24:26 - Stability AI's Stable Cascade

EP50: We Bet $1000 Using Gemini Advanced, Qwen1.5 72B, Retell AI, Apple's MGIE & GOODY-209 Feb 202401:01:18

Subscribe to ThisDayInAI: https://thisdayinai.com
Try AI Agents on SimTheory:
https://simtheory.ai
Show notes:
https://thisdayinai.com/bookmarks/6-ep50

Tell us your thoughts on Gemini here: https://thisdayinai.com/post/62-your-thoughts-gemini-advanced/

Thanks to everyone for all your support and kind reviews to reach 50 episodes! Please consider leaving us a review wherever you get your podcasts.
=====

This week we cover the launch of Google Gemini Advanced, Gemini Ultra 1.0 and Bard being Renamed to Gemini. We compare GPT-4, Gemini Ultra 1.0 and Qwen 1.5 72B by sports betting $1000 on horse racing.

We celebrate 50 episodes and share our excited for Qwen 1.5 72B's performance at coding and quick refusals. We cover new releases including SyncLabs and Retell AI and Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models.

Finally, we discuss GOODY-2 and it's high refusal rate.

=====
CHAPTERS:

00:00 - Betting $1,000 To Compare Gemini Ultra 1.0 to GPT-4 to Qwen 1.5
07:33 - Google Gemini Advanced, Ultra: Details of Announcement and First Impressions
25:48 - OpenAI is Developing Agents to Control Your Devices
27:40 - Celebrating 50 Episodes of This Day in AI
30:34 - Qwen 1.5 72B: We're Impressed!
42:47 - SyncLabs: Tested & Impressions
47:58 - Retell AI: Tested & Impressions
54:18 - Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models
58:10 - GOODY-2: The World's Most Responsible AI Model

EP49: Our Big Announcement + GPT-4 Update, Code Llama, LLaVA-1.6, YOLO World, EAGLE-7B & Bard Images02 Feb 202401:15:51

Join our new community: https://thisdayinai.com.
View the show notes here:
https://thisdayinai.com/bookmarks/2-ep49/
Build AI Agents & Try AI From The Show: https://simtheory.ai

If you enjoy the podcast, please consider leaving us a review wherever you get your podcasts.

====
In this episode we reveal the new ThisDayinAI.com community website. We discuss the latest GPT-4 updates, Code Llama 70B open-source release and first impressions, we play around with the new LLaVA-1.6 release and are impressed by its capabilities. We also look at YOLO World and discuss the impact of EAGLE-7B and RWKV Language Models. Finally, we cover Bard's horrible new image creation feature and censorship. 

CHAPTERS:
====
00:00 - Introducing ThisDayInAI.com Community
5:10 - Be Careful What You Wish For! Mike Gets Spam Called by AI
16:16 - OpenAI Announces "improved" GPT-4 Preview Model to Make GPT-4 Less Lazy
27:00 - LLaVA-1.6: Improved reasoning, OCR, and world knowledge
34:00 - YOLO-World: Real-Time Open-Vocabulary Object Detection
45:11 - RWKV an RNN with GPT-level LLM performance and EAGLE7B Impressions
58:16 - Google Bard's New Highly Censored Image Creation Feature
1:07:13 - Will Google Bard be Renamed to Google Gemini?

EP74: Human Eggs with Ideogram 2.0, Phi 3.5 Boom Factor + AI-Free Startups23 Aug 202401:13:09

Sign up to Simtheory for an AI workspace: https://simtheory.ai
Try ideogram 2.0 on Simtheory
---
CHAPTERS:
00:00 - Ideogram 2.0: Your new AI graphics designer?
23:46 - Microsoft Phi 3.5 Initial Impressions & Thoughts + Boom Factor
38:51 - AI workspace productivity: how much is your productivity worth?
55:08 - Procreate's Anti AI Movement: Marketing or a New Category?
1:07:06 - Chris's thoughts on Phi-3.5 Fine Tuning & Lack of Documentation, Accessibility of Models to Try
---
To see images from the show join our Discord community: https://thisdayinai.com

Show notes: https://thisdayinai.com/bookmarks/68-ep74

Thanks for listening, your comments, reviews and support of the show. We really appreciate it and love hearing from you.

PS. Tasmanian YouTuber Chris mentions: https://www.youtube.com/@UCalOFVbIxEAWIV5LHGkKcnw

EP48: Llama3 Confirmed, Elevenlabs Voice Dubbing, Prompt Compression, Does RAG Make ChatGPT Worse?25 Jan 202401:11:00

Thanks for listening, we appreciate your support of the podcast.

This week we discuss Mark Zuckerberg confirming Llama 3, road test Elevenlabs Voice Dubbing, the state of AI apps and subscriptions, practical use cases of AI interacting with our world, does RAG make ChatGPT worse? Prompt compression with LongLLMLingua and how it might solve the attention problem, experiments with new image models including PhotoMaker and some LOLs to end the show.

AGENTS MENTIONED ON SHOW:
======
AI Phone Call On SimTheory:
https://simtheory.ai/agent/332-flirtatious-phone-call-assistant
MidJourney 6 on SimTheory:
https://simtheory.ai/agent/395-midjourney-image-creator
MidJourney 6 Video Creator:
https://simtheory.ai/agent/400-animate-midjourney-images

MORE LINKS:
======
Join the Discord: https://discord.gg/3gxM9H8qpv
Build an AI Agent: https://simtheory.ai

To support the show (and if you enjoy it) please consider becoming a paying subscriber to SimTheory to help us cover costs of agents, models and experiments we do for the show. Plus get access to every model, modality and the latest AI tech e.g. phone calling in a single place.

CHAPTERS
======
00:00 - Mark Zuckerberg Confirmed Llama 2 In Training
03:39 - Elevenlabs Voice Dubbing Service Tested
09:28 - Discussion on Research Labs, Apps & Future of AI App Business Models
18:43 - Bland.ai Update with Real World Examples & The Future of AI Agents & Agency interacting with our "analogue world"
30:56 - Nick Dobos Says RAG Makes ChatGPT Worse. Can Compression Help?
35:32 - LongLLMLingua and Prompt Compression
46:45 - Image Models: Photo Maker & Experiments with Image Generation
1:01:45 - LOLs including Rabbit r1 Fail, Claude Multi-Modal Leak, DPD Chat


SOURCES
======
https://www.youtube.com/watch?v=YeemJlrNx2Q
https://twitter.com/Stocktwits/status/1748043532340789570
https://twitter.com/danhendrycks/status/1749316795138552228
https://elevenlabs.io/dubbing
https://twitter.com/natfriedman/status/1750199867308433634
https://twitter.com/NickADobos/status/1749837866300264529?s=20
https://twitter.com/NickADobos/status/1749957909449187837?s=20
https://lumiere-video.github.io/
https://twitter.com/felix_red_panda/status/1749522604027682946?s=20
https://twitter.com/andrewcurran_/status/1747661100865511750?s=46
https://twitter.com/ashbeauchamp/status/1748034519104450874?s=20

PAPERS
======
https://arxiv.org/pdf/2310.06839.pdf
https://arxiv.org/pdf/2310.06839.pdf
https://arxiv.org/pdf/2312.04461v1.pdf

EP47: GPT-5 Rumors, AutoGen Studio, SeeAct Web Agents, Google AMIE, Anthropic’s Sleeper Agents17 Jan 202401:26:12

Build AI Agents & Try AI Agents From The Show On SimTheory: https://simtheory.ai
Join Discord: https://discord.gg/aphwE5snuq
Get Merch: https://www.thisdayinaimerch.com/

DESCRIPTION
====
In this episode, we dive into the buzz around GPT-5, sparked by Sam Altman's revelations on Bill Gates' latest podcast. We share our top hopes and dreams for GPT-5 and future AI advancements. Next, we delve into Microsoft's new CoPilot Pro Subscription, exploring how it stands out from ChatGPT Plus. Chris takes AutoGen Studio for a spin and ponders over its ideal user base. The episode then shifts to the intriguing concept of collaborative AI agents - is this the path to AI's mastering reasoning, reflection, and profound thought? We dissect the insights from the SeeAct Web Agents study, assessing its influence on AI agent development. Shifting gears, we discuss Google AMIE's groundbreaking ability to outperform doctors in diagnoses, even those assisted by AI. To wrap up, we spotlight the significance of Anthropic's Sleeper Agents experiment and its groundbreaking findings.

Thanks for listening. Please consider subscribing if you haven't already and leaving a review. We appreciate all of your support!

CHAPTERS:
====
00:00 - Cold Open
00:31 - GTP-5 Rumors & Leaks
07:32 - Microsoft CoPilot Pro
22:27 - Microsoft's AutoGen Studio: An open-source UI for AutoGen
38:53 - The Future of AI Agents? LAMs and SeeACT Web Agent Paper
1:00:19 - Google AMIE: Can AI Replace Doctors for Diagnosis?
1:13:12 -Anthropic's Sleep Agents Experiment

SOURCES:
====
https://twitter.com/arrakis_ai/status/1745672203683942863?s=20
https://twitter.com/daniacostaai/status/1746554047878824409?s=46
https://blogs.microsoft.com/blog/2024/01/15/bringing-the-full-power-of-copilot-to-more-people-and-businesses/
https://twitter.com/emollick/status/1747359731595763817
https://microsoft.github.io/autogen/blog/2023/12/01/AutoGenStudio/
https://osu-nlp-group.github.io/SeeAct/
https://blog.research.google/2024/01/amie-research-ai-system-for-diagnostic_12.html
https://www.bloomberg.com/news/articles/2024-01-14/artificial-intelligence-will-affect-almost-40-of-jobs-imf-says
https://twitter.com/Teknium1/status/1746067427379798344

PAPERS:
====
https://arxiv.org/pdf/2401.01614.pdf
https://arxiv.org/pdf/2401.05654.pdf
https://arxiv.org/pdf/2401.05566.pdf

EP46: Prank Calls with AI, Rabit r1, GPT Store Released, ChatGPT Teams & LUMA Genie12 Jan 202401:13:43

Try AI Voice Calling: https://simtheory.ai/agent/332-flirtatious-phone-call-assistant
Join Our Discord: https://discord.gg/s7bCFV4gTr
Join SimTheory: https://simtheory.ai

In this episode we put Bland.ai to the test. We try out their new AI technology for voice calls that can react and respond in near real time by prank calling our local hardware and pet stores.

We also discuss the launch of more AI dedicated hardware in the Rabit r1, the GPT Store now it's finally released with over 3M GPTs, discuss GPT Teams, LUMA, AudioBox and ask, are we in an AI bubble?

If you like this episode please consider liking, subscribing and commenting. Thanks for watching!

CHAPTERS
====
00:00 - Our call to the hardware store
00:30 - Bland.ai Voice Calling with AI
03:04 - Prank Calling a Hardware Store with AI
11:21 - Calling a Pet Grooming Store with AI
18:15 - Thoughts in AI Hardware, Cherry Picked AI Demos & Rabit r1
35:35 - OpenAI Releases GPT Store with 3M GPTs, Cloning Problem & Initial Reactions
45:22 - OpenAI Releases ChatGPT Teams
47:57 - ChatGPT Memory
49:26 - LUMA Genie, The Metaverse & Vision Pro Apps
55:05 - The AI Jailbreak Problem & Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
1:00:52 - Meta AudioBox
1:03:38 - Microsoft Overtakes Apple as Most Valuable Company - Is it because of AI? And is AI a Bubble?


SOURCES:
====
https://twitter.com/usebland/status/1743411488612913429
https://chats-lab.github.io/persuasive_jailbreaker/
https://www.rabbit.tech/
https://twitter.com/Dan_Jeffries1/status/1745404485298459106
https://twitter.com/abacaj/status/1745474794638745892?s=20
https://twitter.com/AravSrinivas/status/1745489529551905159
https://openai.com/blog/introducing-the-gpt-store
https://twitter.com/NickADobos/status/1745244031381291164/photo/2
https://twitter.com/AndrewCurran_/status/1744918982174429432?s=20
https://openai.com/blog/introducing-chatgpt-team
https://lumalabs.ai/genie
https://audiobox.metademolab.com/maker
https://www.miri.health/

EP45: We're Back! GPT Store Next Week, Gemini Pro & Gemini Vision, Mixtral API, AnyText, NYTimes Lawsuit04 Jan 202401:18:49

It's great to be back! In this episode we cover everything new and everything we missed during our break. We start with breaking news that the OpenAI ChatGPT GPT Store is being released next week, then cover Gemini Pro and Gemini Pro Vision API, Mixtral APIs, AnyText, NY Times Copyright lawsuit and finally.. get excited about a dishwashing robot!

====
Join SimTheory: https://simtheory.ai
Join Discord: https://discord.gg/aphwE5snuq
Get Merch: https://www.thisdayinaimerch.com/

Try models from the show:
====
Gemini Pro: https://simtheory.ai/agent/282-google-gemini-assistant
Mixtral: https://simtheory.ai/agent/129-miss-mistra-mistral-medium
Stable Diffusion Video: https://simtheory.ai/agent/224-image-to-video-creation-agent
AI Movie Trailer Maker: https://simtheory.ai/agent/279-ai-movie-trailer-maker

CHAPTERS:
====
00:00 - Mike's AI Movie Trailer Intro
02:05 - GPT Store Will go Live Next Week
22:52 - Gemini Pro API & Gemini Pro Vision Road Tested (literally)
33:34 - Mixtral API: Mistral Platform API Tested
45:31 - Stable Video Diffusion
48:12 - Pika AI Video General Availability
52:05 - Stability AI Memberships
55:54 - Prompt Injection for DALL-E with Public Domain
57:34 - New York Times Sues OpenAI & Microsoft for Copyright Infringement
1:04:49 - Inpainting with AnyText
1:14:15 - Microsoft CoPilot App with GPT-4 Now On iOS and Android
1:14:39 - One More Thing: The Dishwasher Bot

SOURCES:
====
https://time.com/6551496/mickey-mouse-public-domain-steamboat-willie/
https://twitter.com/digthatdata/status/1742074049260621976?s=46
https://www.theguardian.com/media/2023/dec/27/new-york-times-openai-microsoft-lawsuit
https://www.reuters.com/technology/apple-explores-ai-deals-with-news-publishers-new-york-times-2023-12-22/
https://twitter.com/rowancheung/status/1742967393310368222/photo/1
https://www.theinformation.com/briefings/openai-to-launch-chatbot-store-next-week?rc=kvsmhw
https://blog.google/technology/ai/gemini-api-developers-cloud/
https://mistral.ai/news/mixtral-of-experts/
https://mistral.ai/news/la-plateforme/
https://stability.ai/news/stable-video-diffusion-open-ai-video-model
https://simtheory.ai/share/d49c8c00-9fda-40aa-b386-a7c27455015b/
https://pika.art/
https://stability.ai/membership
https://twitter.com/venturetwins/status/1742976476432196100?s=46
https://github.com/tyxsspa/anytext
https://www.theverge.com/2023/12/29/24019288/microsoft-copilot-app-available-iphone-ipad-ai
https://mobile-aloha.github.io/

EP44: The Finale: Google Gemini, SimTheory, Is Ilya OK? Predictions for 202408 Dec 202301:18:27

Join our discord: https://discord.gg/zqz5fVyx7m
Get the merch: https://thisdayinaimerch.com
Try Agents & Models on SimTheory: https://simtheory.ai

In our final episode for the year, we cover the surprise announcement of Google's Gemini AI models and give our first impressions. We road test Gemini Pro on Bard and discuss the likely impact of Gemini on the market and developer ecosystems. Then it's time for our holiday gift: SimTheory. Now you can use AI agents we mention on the show including our virtual girlfriends, Sports Betting with AI and many more! You can even create your own agents to try different models using the same tools we use to prepare for the show. We then discuss if Ilya is OK and the drama at OpenAI. And finally, we make predictions for 2024 and cover some of Meta's latest announcements.

Thanks for watching, listening and all your support through 2023. We really appreciate it and will see you early next year!

CHAPTERS:
=====
00:00 - Google Gemini is Here? Kinda
38:48 - Our Holiday Gift: SimTheory: Virtual Girlfriend, Sports Betting with AI Agents
51:15 - Is Ilya OK? Is GPT-4 Slowness About Cost Reductions?
56:26 - NexusRaven-V2-13B for function calling: is this the future of specialized fine tune models?
1:00:14 - Our Predictions for AI in 2024
1:12:54 - Meta announces AI Alliance for AI Openness + Updates to Meta AI Characters and SeamlessExpressive
1:15:43 - Final thoughts and thank you

SOURCES:
=====
https://blog.google/technology/ai/google-gemini-ai/
https://twitter.com/tunguz/status/1732444203437695387
https://twitter.com/tunguz/status/1732444203437695387
https://twitter.com/tunguz/status/1732444203437695387
https://twitter.com/tunguz/status/1732444203437695387
https://techcrunch.com/2023/12/07/early-impressions-of-googles-gemini-arent-great/
https://twitter.com/clementdelangue/status/1732138699901809042
https://huggingface.co/Nexusflow/NexusRaven-V2-13B
https://twitter.com/abemurray/status/1732723510810759369
https://ai.meta.com/blog/ai-alliance
https://techcrunch.com/2023/12/06/metas-ai-characters-are-now-live-across-its-u-s-apps-with-support-for-bing-search-and-better-memory/
https://techcrunch.com/2023/12/06/meta-ai-adds-reels-support-and-reimagine-a-way-to-generate-new-ai-images-in-group-chats/
https://seamless.metademolab.com/expressive
https://twitter.com/mattrickard/status/1731889331516936261

EP43: Is GPT-4 Lazy? Wizard 33B, Qwen 72B Tested & Self Operation AI Computer01 Dec 202301:12:36

Join the discord: https://discord.gg/27mQ9cut
Get the merch: https://thisdayinaimerch.com

This week we celebrate ChatGPT's 1 Year Anniversary and Ask is GPT-4 Lazy? We explore the best of open source with Wizard 33B and test China's Qwen 72B model from Alibaba. Chris tries to delete all files from his computer using Self Operation AI Computer and we cover Amazon's AWS Ignite AI announcements, Stability Diffusion XL Turbo, The Scalable Extraction Attack on ChatGPT and an exciting waitlist release from PIKA. 

Like, sub, comment if you enjoy the episode to support the show. We love hearing from you.

CHAPTERS:
=====
00:00 - Cold Open
00:08 - ChatGPT 1 Year Anniversary
07:54 - Is GPT-4 Lazy? Is Claude Unusable Now?   
18:43 - Are Open-Source Models Catching Up 1 Year On?
21:57 - Wizard 33B Open-Source Model
24:55 - Demo of Wizard 33B
28:26 - China's Qwen 72B Open-Source Model
31:26 - Qwen Demo
38:16 - Self Operation Computer Discussion & The Future of AI With Access to Computers
49:23 - Scalable Extraction: DeepMind's COMPANY attack to extract training data from ChatGPT
55:20 - Stability Diffusion XL Turbo, Stability's Stability & Commercial Subscriptions
1:03:23 - Amazon's AWS Ignite: Amazon Q, Trainium 2, Bedrock Fine Tuning
1:07:49 - PIKA Video
1:09:26 - Important News

SOURCES:
======
https://arstechnica.com/information-technology/2023/11/chatgpt-was-the-spark-that-lit-the-fire-under-generative-ai-one-year-ago-today/
https://twitter.com/emollick/status/1729604442826170586?s=46
https://twitter.com/krishnanrohit/status/1729353613498261597?s=46
https://arxiv.org/pdf/2311.16989.pdf
https://twitter.com/huybery/status/1730127387109781932/photo/1
https://arxiv.org/pdf/2309.16609.pdf
https://github.com/OthersideAI/self-operating-computer/tree/main
https://arxiv.org/pdf/2311.17035.pdf
https://twitter.com/ayushsoni_io/status/1730128497572462695
https://stability.ai/news/stability-ai-sdxl-turbo
https://pika.art/
https://venturebeat.com/ai/amazon-awss-barrage-of-gen-ai-announcements-aim-to-outdo-microsoft/

EP42: What Did Sam Altman Do? Q* & AGI? LLM OS, Claude 2.1, Stable Video Diffusion and Suno Fun!24 Nov 202301:25:46

Join Our Discord: https://discord.gg/58HtZnVD
Buy The Merch: https://www.thisdayinaimerch.com/

This week we reluctantly cover all the OpenAI drama and ask What Did Sam Altman Actually Do? Is Q* a path to AGI or just one big "look over here" distraction so we stop asking all these questions... We also cover Andrej Karpathy's LLM OS vision, discuss Claude 2.1 and how bad it's become thanks to "safety" and discuss our initial impressions of Stable Video Diffusion. Finally, we have some fun with Suno!

If you like this podcast, please consider subscribing and liking this episode. We appreciate the support.

CHAPTERS:
====
00:00 - A Full Recap of What Happened with Sam Altman & OpenAI
10:06 - What Did Sam Altman Actually Do? 
28:03 - What Did Ilya Really Discover? Is Q* A Big Distraction? How Far Ahead if OpenAI?
40:47 - Will This Drama Help Progress Open Source AI?
51:11 - Is Andrej Karpathy's LLM OS Vision The Future?
1:00:25 - Inflection-2 LLM
1:02:35 - Stable Video Diffusion Initial Thoughts
1:06:40 - Claude 2.1 Announcement 200K Context
1:21:26 Fun with Suno AI: Make Music with a Prompt

SOURCES:
====
https://www.theinformation.com/briefings/openais-employee-share-sale-to-continue-after-altman-returns?rc=kvsmhw
https://twitter.com/openai/status/1727236805182026159?s=46
https://openai.com/blog/openai-announces-leadership-transition
https://twitter.com/ylecun/status/1727727656118923296?s=46
https://www.theinformation.com/articles/openai-dramas-first-season-ends-but-second-season-is-possible?rc=kvsmhw
https://www.theinformation.com/articles/openai-made-an-ai-breakthrough-before-altman-firing-stoking-excitement-and-concern?rc=kvsmhw
https://inflection.ai/inflection-2
https://stability.ai/news/stable-video-diffusion-open-ai-video-model
https://www.anthropic.com/index/claude-2-1
https://app.suno.ai/
https://www.reuters.com/technology/sam-altmans-ouster-openai-was-precipitated-by-letter-board-about-ai-breakthrough-2023-11-22/

EP41: Are GPTs the Future or All Hype? Microsoft AI Ignite & Is Open Source at GPT-4 Level?17 Nov 202301:07:48

Join the discord: https://discord.gg/Gcc8FXPK
Get the merch:
https://thisdayinaimerch.com

Support the show by leaving a like, comment or sharing with a friend. We appreciate it!

DESCRIPTION:
=======
This week we discuss what's happened since OpenAI's Dev Day: Sam Altman has stopped ChatGPT Plus Subscriptions Due to Demand, GPTs have been leaking their prompts and data, and thousands of people have been busy creating GPTs... but are they any good? We also discuss Microsoft AI Ignite and share our thoughts on Microsoft's new Azure Hardware, Microsoft CoPilot Studio, Azure AI Studio and all the other Microsoft AI Ignite News. We discuss can Open-Source AI Now Compete with GPT-4? And Cover Google Lyria Music AI and Meta's EMU Video and Emu Edit.


CHAPTERS:
=======
00:00 - OpenAI Stops Taking GPT Plus Subscribers. Subscription for sale on eBay
1:30 - Are GPTs Just a New Enthusiasm Phase, The Future or All Hype?
16:28 - Will GPTs just Become Functions and Processes with Proprietary Data?
22:53 - Early GPT Data Leaks & Unsafe Prompts
24:05 - Monetization of GPTs
29:24 - Microsoft AI Ignite: Azure Chips, Microsoft CoPilot Studio, Azure AI Studio
43:41 - Can Open-Source AI Now Compete with GPT-4? 
48:27 - The OpenAI Dilemma: Microsoft & Open Source Threats
50:58 - What are The Killer Use Cases for AI?
56:50 - Google Lyria: The Future of Music Creation?
1:02:52 - Meta's EMU Video and Emu Edit AI research milestones.

SOURCES:
=======
https://twitter.com/sdand/status/1724629169483719104/photo/1
https://www.searchenginejournal.com/openai-pauses-new-chatgpt-plus-subscriptions-due-to-surge-in-demand/501360/
https://twitter.com/levelsio/status/1722744926004269309?s=46
https://twitter.com/fzaslavskiy/status/1723731923149754542?s=46
https://deepmind.google/discover/blog/transforming-the-future-of-music-creation/
https://ai.meta.com/blog/emu-text-to-video-generation-image-editing-research/
https://emu-video.metademolab.com/
https://www.youtube.com/watch?v=eFkCGTb7Z8E
https://www.theverge.com/2023/11/15/23960417/microsoft-copilot-ai-studio-custom-gpts-chatgpt-openai
https://www.theverge.com/2023/11/15/23960471/microsoft-windows-ai-studio-nvidia-developers
https://huggingface.co/TheBloke/goliath-120b-GGUF?text=Hey+my+name+is+Julien%21+How+are+you%3F
https://www.reddit.com/r/LocalLLaMA/
https://twitter.com/hamelhusain/status/1722637811176902779?s=46

EP40: Open AI Dev Day Recap: Custom GPTs, GPT-4 Turbo, Assistants API Discussion & Test Drive10 Nov 202301:35:03

Join the discord: https://discord.gg/nkUnyD44
Get the "We're all wrapper apps" merch:
https://thisdayinaimerch.com

Dive into the riveting world of AI development with Mike & Chris and their deep dive into OpenAI's latest offerings, including the much-anticipated GPTs. From the technical nitty-gritty to the potential for monetization, this podcast peels back the layers of AI's future. The bros hands-on experience with creating custom AI models reveals the reality behind the hype, offering a candid look at the promises versus the actual deliverables in the AI industry. Whether you're an AI aficionado or a tech enthusiast, this episode is your front-row seat to the unfolding narrative of AI's capabilities and its impact on the tech landscape.

CHAPTERS:
=====

00:00 - Recap & Thoughts on Custom GPTs, GPT "Apps", GPT Store and Future GPTs
38:41 - GPT-4 Turbo, GPT-4 Vision & 128k Context Possibilities
50:43 - GPT-4 Vision as part of GPT-4 Turbo API
53:54: Fine Tuning Models for Speed & Cost
56:09 - Assistants API: Vendor Lock In?
58:06 - Wrapper Apps, GPT discoverability and Monetization of GPTs
1:08:54 - Was GPT3.5 Default 16k Turbo The Biggest Announcement?
1:12:53 - OpenAI TTS Text To Speech Voices: Better than Eleven Labs & PlayHT?
1:15:35 - What Google Would Need to Deliver with Gemini to Win Back Devs
1:15:50 - Fine Tuning Custom GPT Models for Custom GPTs
1:18:38 - GPT-4 Fien Tuning Experimental Access
1:22:16 - Is a UI SDK next for Custom GPTs?
1:24:34 - Custom Trained Models from OpenAI for $2-3M
1:25:57 - Will Hardware Kill OpenAI? Is Hardware Distribution Key for Apple and Google to Win Long Term?
1:30:42 - Other AI News from the Week: GitHub CoPilot AI First & GH200s

SOURCES:
=====
https://openai.com/blog/new-models-and-developer-products-announced-at-devday
https://twitter.com/altryne/status/1721989500291989585?s=20
https://www.oneusefulthing.org/p/almost-an-agent-what-gpts-can-do
https://twitter.com/karpathy/status/1721977139938185492?s=46
https://twitter.com/BenjaminDEKR/status/1722397663939965170/photo/2
https://github.com/langchain-ai/opengpts
https://openai.com/blog/introducing-gpts
https://www.youtube.com/watch?v=WFM2pvj00oc

LIVE: Reaction to OpenAI DevDay, Opening Keynote06 Nov 202301:17:40

This is a recording of the live event on YouTube following the OpenAI DevDay keynote. We'll be back with a regular episode later this week.

Sharkey and Sharkey amped up on caffeine live react to OpenAI's latest announcements. Cost reductions, larger models, and an app store?! The duo banter and bicker about whether this marks excitement or irrelevance for devs like you. Plus Elon Musk teases a GPT-style model without the handcuffs - does this spell trouble for Big Sam? Sharkey and Sharkey think out loud and solicit hot takes from listeners on the implications.

We cover:

  • All the news from OpenAI DevDay
  • Reactions from our community
  • xAI Grok (briefly)
  • GPTs and the GPT store


Join the discord: https://discord.gg/sA6anFq2
Get the merch: https://thisdayinaimerch.com

EP73: Has Google Done It? Grok 2 Beta & Is Tuning All You Need?16 Aug 202401:24:54

Sign up to Simtheory: https://simtheory.ai
-------
00:00 - Reactions to #madebygoogle and Gemini Live
15:30 - Grok 2 Beta Tested & Is Grok Getting Flux Credit?
39:03 - Future of Personalized Software in Education & The Workplace: Are Devs Still Needed?
1:02:16 - Claude's Prompt Caching Explained
1:11:18 - Hermes 3 (Llama 3.1 Fine-tuned for instruction following)
1:19:18 - Is Tuning All You Need? Why Claude Sonnet 3.5 is so good.

Thanks for watching/listening/subscribing/liking/commenting and reviewing  our average podcast each week. It means a lot to us.

You can join our community here: https://thisdayinai.com or try Simtheory: https://simtheory.ai.

EP39: White House AI Executive Order, The Bletchley Declaration & Adversarial AI Attacks02 Nov 202301:06:59

Join our Discord: https://discord.gg/TRrgAyeM
Buy the merch: https://www.thisdayinaimerch.com/ 

This week the AI guys unpack the White House's sweeping executive order on regulating AI - will this lead to the death of open-source models? They also discuss the vague and fluffy Bletchley Declaration signed by world leaders, why Geoffrey Hinton just won't stop fearmongering, and introduce some hilarious new merch including a life-size shower curtain! Tune in for hot takes on the AI ethics debate, prompt engineering tricks, and key insights on the future of language models.


CHAPTERS:
=====
00:00 - King Charles on AI (Cold Open)
00:20 - Thoughts on White House AI Executive Order 
23:09 - The Bletchley Declaration & AI Safety Summit
38:04 -  LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2 & They Killed Tay!
48:34 - Adversarial Attacks and Defenses in Large Language Models: Old and New Threats Paper
51:51 - Mike proposes What The Future of AI Computing Might Look Like
55:00 - Leaked: The Secret Prompt Powering ChatGPT's New Multi-Tool Mode (and How to Hack It)

1:01:39 - Anthropic Have Raised More Billions & Our Merch Store!

SOURCES:
======
https://www.whitehouse.gov/briefing-room/presidential-actions/2023/10/30/executive-order-on-the-safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence/
https://www.aisnakeoil.com/p/what-the-executive-order-means-for
https://www.gov.uk/government/publications/ai-safety-summit-2023-the-bletchley-declaration/the-bletchley-declaration-by-countries-attending-the-ai-safety-summit-1-2-november-2023
https://www.businessinsider.com/sam-altman-and-demis-hassabis-just-want-to-control-ai-2023-10
https://twitter.com/ylecun/status/1718263147591573949?s=20
https://twitter.com/ldjconfirmed/status/1718456393026490523
Leaked Prompt: https://raw.githubusercontent.com/spdustin/ChatGPT-AutoExpert/main/_system-prompts/all_tools.md
https://www.cnbc.com/2023/10/27/google-commits-to-invest-2-billion-in-openai-competitor-anthropic.html

PAPERS:
======
https://arxiv.org/pdf/2310.17688.pdf
https://arxiv.org/pdf/2310.20624.pdf
https://arxiv.org/pdf/2310.19737v1.pdf

EP38: Ed Sheeran Listens to Our Podcast, Deep Fakes & Frontier Risks and AI Ears: SALMONN Model27 Oct 202301:08:13

Join the Discord: https://discord.gg/2j6k7AXw

This week, juicy revelations from Ed Sheeran and Taylor Swift's secret love affair! We also discuss the latest mind-blowing AI innovations, including talking heads, vision models that can see from every angle, and intelligent agents plotting world domination. Don't miss our spicy debate on whether AI will transform humanity or destroy us all. Plus advice from Chris on picking up virtual girlfriends using neural networks - this episode has it all!

Please note the Ed Sheeran bit is a joke (please don't sue us haha) and an example of a deep fake and deep fake technology for comedy. Please Ed. We're begging you.

Please consider reviewing the podcast to support the show. We read them all and they mean a lot to us :).

CHAPTERS
=====
00:00 - Ed Sheeran Actually Listens to Our Podcast
02:17 - Frontier Risk and Preparedness, Deep Fakes & VideoReTalking
15:06 - ByteDance's SALMONN AI Audio, Music, Sound Model for AI Hearing
23:01 - Adept's fuyu 8B Vision Model: The Future of How AI Agents Navigate the Web?
34:41 - Multiple Agents in the Metaverse & Zero123++ Making Single Images into 3D Objects
46:42 - Google's Gemini Leaks & Stubbs + Our Failed Gemini Leaker Source
50:17 - Is AI Boring? Chris Roasts Jacob Browning
1:03:41 - Bing's Sydney is Still Trying to Escape & Threatening Humanity

SOURCES:
=====
https://openai.com/blog/frontier-risk-and-preparedness
https://openai.com/form/preparedness-challenge
https://github.com/OpenTalker/video-retalking
https://venturebeat.com/ai/tiktok-makers-new-ai-salmonn-understands-all-audio-not-just-music-and-voices/
https://github.com/OpenTalker/video-retalking
https://huggingface.co/adept/fuyu-8b
https://www.adept.ai/
https://arxiv.org/pdf/2310.15110.pdf
https://twitter.com/dylan522p/status/1716937534490435874?s=46
https://medium.com/@bedros-p/gemini-is-coming-to-makersuite-so-are-stubbs-32248f3924aa
https://medium.com/@bedros-p/stubbs-is-coming-form-your-own-opinions-386489a3f844
https://twitter.com/ylecun/status/1717616244600238358?s=46
https://www.jacob-browning.com/post/generative-ai-is-boring
https://twitter.com/MichaelTontchev/status/1715876157105791138/photo/1

EP37: Fun With PlayHT 2.0, Will Open Source Be Unbeatable? The Future of AI Models + Meta MEG20 Oct 202301:08:23

JOIN DISCORD HERE: https://discord.gg/BA7Rfx69

This week's podcast is an electric shock - new open source AI models like Zephyr are generating content at lightning speed. We dive into the implications of AI on everything from gambling to parenting as new tech lets you clone voices in seconds. Meta can now read your mind and turn brain waves into images, while AI judges sports better than humans ever could. Don't miss our demo of using AI to simulate a full-on financial phishing scam call!

CHAPTERS:
=====
00:00 - AI Chris with PlayHT 2.0 Turbo Cold Open
11:11 - Experience with PlayHT 2.0 Turbo & Thoughts
12:45 - Will Open Source Be Unbeatable? Zephyr 7B Alpha Road Tested
27:54 - OpenAI's Arrakis Failure: Is the Focus Now Small Models? Is Open Source Catching Up?
35:18 - GPT-4V Now Widely Available: Queue the Visual Prompt Injections!
41:03 - DALL-E Prompt Leaks & Yelling at AI in ALL CAPS to Make it Work
44:49 - Stack Overflow Layoffs & Future Disruption from AI: Are we Prepared for What is Coming?
57:12 - Meta's MEG: Reading Our Thoughts & Improve AI Neural Nets by Copying the Human Brain
1:00:16 - Are We Now Desensitized to AI Progress?
1:01:53 - AI Sports Analysis: Will it Change Sport? 

SOURCES:
=====
https://news.play.ht/post/introducing-playht-2-0-turbo-the-fastest-generative-ai-text-to-speech-api
https://twitter.com/ylecun/status/1713342883925790952?s=46
https://twitter.com/victormustar/status/1713307274888810521?s=46
https://huggingface.co/HuggingFaceH4/zephyr-7b-alpha
https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat
https://www.theinformation.com/articles/openai-dropped-work-on-new-arrakis-ai-model-in-rare-setback?rc=kvsmhw
https://devday.openai.com/
https://twitter.com/fabianstelzer/status/1712790589853352436?s=20
https://simonwillison.net/2023/Oct/14/multi-modal-prompt-injection/
https://twitter.com/wunderwuzzi23/status/1712996819246957036
https://openai.com/blog/dall-e-3-is-now-available-in-chatgpt-plus-and-enterprise
https://twitter.com/javilopen/status/1714748189482381653?s=46
https://www.theverge.com/2023/10/16/23919004/stack-overflow-layoff-ai-profitability
https://twitter.com/sergeykarayev/status/1714725576223977808?s=46
https://ai.meta.com/blog/brain-ai-image-decoding-meg-magnetoencephalography
https://www.froedtert.com/epilepsy/diagnostics/magnetoencephalography
https://twitter.com/BrianRoemmele/status/1714060169490124899/video/1
https://www.reddit.com/r/StableDiffusion/comments/17b4dfc/my_first_try_with_video/

PAPERS:
=====
https://arxiv.org/pdf/2310.11441.pdf
https://ai.meta.com/static-resource/image-decoding

EP36: ChatGPT Vision Road Tested, AutoGen Cheese Test & Anthropic's Break Through13 Oct 202301:12:46

Join the discord: https://discord.gg/bb6VZHks

This mind-blowing episode explores the shocking capabilities of GPT-4 vision, including how it identified Mike's exact location just from a traffic photo. We dive into the cheese-filled insanity of using multiple AI agents together with AutoGen, and discuss Anthropic's groundbreaking research into neural superposition. Don't miss our dramatic exposé of Meta's new lobotomized AI chatbots - this episode takes you on a wild ride through the cutting edge of AI!

If you like this episode please consider subscribing and leaving a comment or review.

CHAPTERS
=====
00:00 - Shakespearean AutoGen DRs
00:29 - ChatGPT Vision Road Tested, Augmented Intelligence & Comparison to LLaVA Vision 
31:25 - The Cost of AI: Is There A Business Model With Margin?
36:52 - The Value of AI is in Productivity: Discussion of Business Models
43:19 - AutoGen Agents Cheese Test: Do Multi-Agents Perform Better?
58:09 - Anthropic's AI Break Through: Decomposing Language Models Into Understandable Components

SOURCES:
=====
https://llava.hliu.cc/
https://www.wsj.com/tech/ai/ais-costly-buildup-could-make-early-products-a-hard-sell-bdd29b9f
https://techcrunch.com/2023/10/09/chatgpts-mobile-app-hit-record-4-58m-in-revenue-last-month-but-growth-is-slowing/
https://www.wired.com/story/generative-ai-chatgpt-is-coming-for-sales-jobs/
https://twitter.com/amasad/status/1712276988403294610?s=46
https://www.anthropic.com/index/decomposing-language-models-into-understandable-components
https://twitter.com/julesterpak/status/1711766882934534631?s=20

PAPERS:
=====
https://transformer-circuits.pub/2023/monosemantic-features/index.html
https://arxiv.org/pdf/2304.08485.pdf

EP35: AI Safety Gone Mad, Stable 3B Cheese Test, GPT4 Vision & DALL-E 3 Diversity + Sydney is BACK!06 Oct 202301:17:39

Not too late to join Discord community, do it here: https://forms.gle/rSx9dYoqc1qxX6sx5. Invites going out today!

Thanks for helping us reach 2K subs here on YouTube!

This week we dive into the wild world of AI image generation and vision, from racist cartoon captions to heartfelt poetry written by Bing. We discuss the implications of teaching AI to forget unwanted knowledge, and debate whether safety controls are protecting users or limiting creativity. Get ready for philosophical ponderings, hilarious experiments, and our signature irreverent takes as we explore the latest AI advances and absurdities. Whether you're an expert or just fascinated by the future, this episode will challenge your thinking and give you plenty to discuss with friends.

CHAPTERS
======
00:00 - Fooling Bing Vision to Solve Captcha
00:26 - Meta's Messenger AI Stickers Out of Control! AI Safety Discussion
06:17 - More Safety Nonsense: The Low-Resource Language Jailbreak GPT-4 Paper
9:36 - More on Mistral 7B (Safety and Positive Reception)
17:31 - Friends and Foes of Open Source AI & Is Anthropic a Crypto-like Scam for Billions? 
21:26 - Turnitin Thinks It Can Detect AI, Being a Student in an AI World
24:25 - Stable 3B LLM Review and Cheese Test Results
38:48 - DALL-E 3 Road Test on ChatGPT & Diversity Prompt Injection Problems
48:12 - Using Bing GPT4-Vision to Solve Captchas for Grandma
51:01 - The Dawn of LLMs, Explorations with GPT-4Vision Paper + Possibilities of AI Vision
1:04:00 - Who's Harry Potter? Making LLMs forget 
1:09:00 - Google Assistant with Bard AI
1:10:18 - LLaMA Long 32K Initial Thoughts
1:12:40 - Sydney Bing is Back BABY!
1:15:36 - Comments on Discord Rollout and Survey Response

SOURCES
======
https://twitter.com/ibogost/status/1709629850359628211
https://twitter.com/paul_rottger/status/1707430998600831424?s=46
https://www.theinformation.com/articles/openai-rival-anthropic-in-talks-to-raise-2-billion-from-google-others-as-ai-arms-race-accelerates
https://twitter.com/abacaj/status/1709455939231772962?s=46
https://twitter.com/ylecun/status/1708149902784799121?s=46
https://twitter.com/rustykitty_/status/1709316764868153537
https://stability.ai/blog/stable-lm-3b-sustainable-high-performance-language-models-smart-devices
https://twitter.com/neilkli/status/1709450248186167715/photo/4
https://twitter.com/ItakGol/status/1708541450722414798/photo/2
https://www.oneusefulthing.org/p/the-shape-of-the-shadow-of-the-thing
https://www.microsoft.com/en-us/research/project/physics-of-agi/articles/whos-harry-potter-making-llms-forget-2/
https://techcrunch.com/2023/10/04/google-assistant-is-getting-ai-capabilities-with-bard/
https://venturebeat.com/ai/meta-quietly-releases-llama-2-long-ai-that-outperforms-gpt-3-5-and-claude-2-on-some-tasks/
https://twitter.com/lumpenspace/status/1709773644203708527/photo/2

PAPERS
======
https://arxiv.org/pdf/2310.02446.pdf
https://arxiv.org/pdf/2309.17421.pdf

EP34: Meta's AI Agents, Mistral 7B Road Tested (with Cheese) & ChatGPT Vision28 Sep 202301:00:24

Want to join our Discord? https://forms.gle/k8TyUeWKGWHFBzwQ9. Invites will go out next week!

This week's AI show is off the charts with excitement! We dive deep into the latest AI announcements from Meta, including creepy camera glasses and lame celebrity "AI agents". Then we gush over the impressive new Minstral model and its cheese diagnosing abilities. Plus, we freak out over Tesla's crazy new human-like robot that can intuitively stack blocks - this mind-blowing bot signals the imminent rise of AGI! Don't miss this jam-packed episode full of the hottest AI news and spiciest takes.
- Hype written by AI.

If you like this episode please consider subscribing, liking and all the things. Thanks for watching.

CHAPTERS
00:00 - About the Discord Community / Plug
01:02 - Reacting to Meta Connect: AI Agents, Stickers, Image Tools, Meta Ray bands with "Hey Meta", Meta AI ChatGPT Competitior, & EMU Images
16:52 - OpenAI announces ChatGPT Vision for ChatGPT Plus & Enterprise Users + Voices for ChatGPT & Web Browsing is Back!
21:52 - Sam Altman & Jony Ives Reported to Be Discussing AI Hardware Project + Future AI Chips Discussion
24:01 - Chris Road Tests Mistral 7B (with Cheese) and is Impressed!
34:35 - Giraffe Llama v2 70B 32K is Marketing Hype
40:49 - Tesla Optimus Robot Latest Video + Optimus Vs Boston Dynamics Fight Proposal
45:17 - Microsoft Wants to Use Nuclear Energy to Power AI Data Centers
48:56 - Mike's Virtual AI Girlfriend Has High Expectations + AI Memory Innovations & Final Thoughts on AI Agents

SOURCES

https://about.fb.com/news/2023/09/introducing-ai-powered-assistants-characters-and-creative-tools/
https://www.theverge.com/2023/9/27/23891128/meta-ai-assistant-characters-whatsapp-instagram-connect
https://twitter.com/boztank/status/1707105576424198290
https://techcrunch.com/2023/09/27/meta-debuts-ai-studio-to-let-developers-build-custom-chatbots/
https://twitter.com/verge/status/1707105410786701770?s=46
https://www.maginative.com/article/a-deep-dive-inside-emu-metas-new-image-generation-ai-model/
https://openai.com/blog/chatgpt-can-now-see-hear-and-speak
https://twitter.com/0xgaut/status/1707079424007365057?s=46
https://www.wsj.com/tech/ai/openai-seeks-new-valuation-of-up-to-90-billion-in-sale-of-existing-shares-ed6229e0?mod=followamazon
https://twitter.com/petergyang/status/1707169696049668472?s=46
https://www.theinformation.com/articles/designer-jony-ive-and-open-ais-sam-altman-discuss-ai-hardware-project?rc=kvsmhw
https://mistral.ai/news/announcing-mistral-7b/
https://techcrunch.com/2023/09/25/amazon-to-invest-up-to-4-billion-in-ai-startup-anthropic/
https://huggingface.co/abacusai/Giraffe-v2-70b-32k
https://twitter.com/Tesla_Optimus/status/1705728820693668189
https://jobs.careers.microsoft.com/global/en/job/1627555/Principal-Program-Manager-Nuclear-Technology
https://www.theinformation.com/articles/how-microsoft-is-trying-to-lessen-its-addiction-to-openai-as-ai-costs-soar?utm_source=ti_app&rc=kvsmhw

EP33: AI WARS: Gemini Vs Gobi, DALL-E3, Alexa AI, Open Interpreter & Llama2 Experiments22 Sep 202301:05:12

Do you want to join our Discord community? Fill in this: https://forms.gle/k8TyUeWKGWHFBzwQ9.

The AI wars are heating up as Google and OpenAI race to release the first multimodal LLM. We build a sync sub video game with just one prompt using open Interpreter. Alexa shows off scary new conversational abilities, while poets sell out to Big Tech. Join us for the latest AI battles - but don't get your hopes up, it's not that exciting!

SOURCES

  • https://www.bloomberg.com/news/articles/2023-09-20/chatgpt-usage-is-rising-again-as-students-return-to-school#xj4y7vzkg
  • https://www.theinformation.com/articles/openai-hustles-to-beat-google-to-launch-multimodal-llm?rc=kvsmhw
  • https://twitter.com/emollick/status/1704560486111658209?s=20
  • https://openai.com/dall-e-3
  • https://twitter.com/nickfloats/status/1704592748303827276?s=20
  • https://www.aboutamazon.com/news/devices/amazon-alexa-generative-ai
  • https://neuralink.com/blog/first-clinical-trial-open-for-recruitment/
  • https://openinterpreter.com/
  • https://openai.com/blog/red-teaming-network
  • https://restofworld.org/2023/ai-developers-fiction-poetry-scale-ai-appen/

If you like the show please consider sharing with friends and leaving a comment.


EP32: Does AI Remember Your Unethical Requests? Chuck's AI Forum, Robot Ethics, & LLM Deception15 Sep 202301:06:29

This week's episode is an absolute barnstormer, covering everything from robots burning in stadium fires to AI girlfriends with dangerous memories. Get ready for an action-packed ride as we dive into the dark realities of AIs keeping naughty lists, journalism being taken over by plagiarizing robots, and whether downloading your brain into an android body means you can laugh in the face of death. Buckle up and grab some popcorn, because this week's episode is one wild ride from start to finish!

(Written by AI lol)

If you like the pod please support us by leaving a review wherever you get your podcasts and sharing with friends.

CHAPTERS
====
00:00 - "What if I could download your soul?" Cold Open
00:56 - Chuck's AI Forum, Regulation and What We Should Be Focusing On
11:02 - Deceptive Abilities Emerging in LLM Paper Discussion
24:03 - Large Language Models and Optimizers: Take a Deep Breath
30:50 - 5 Years to Discover Capabilities of Current Models
33:52 - a16z Report on How Consumer are Using LLMs
39:41 - Are Your Androids Going to Be Criminals? Implications of AI Robots in Society
47:25 - US Copyright Offices Denies AI Created Image Copyright & Microsoft Will Legally Defend Paid Users of AI CoPilot
55:27 - Stable Audio: Mike's Paid Customer Stable Audio Experience
59:48 - Open Interrupter: Open-Source Version of OpenAI's Code Interrupter
1:02:22 - ChatGPT Journalist Leaves Prompt in Article. LOLs. 

SOURCES
====
https://www.bbc.com/news/technology-66804996
https://twitter.com/emollick/status/1700207590607552740?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://twitter.com/emollick/status/1702141069616452079?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://a16z.com/how-are-consumers-using-generative-ai/
https://www.foxsports.com.au/nrl/nrl-premiership/nrlw-match-report-published-after-author-forgets-to-remove-chatgpt-prompts/news-story/13aa9b48bb0fc10dfe79aeb6c50381a2
https://www.marca.com/en/nfl/los-angeles-chargers/2023/09/14/650343b6ca4741fe388b45b2.html
https://blogs.microsoft.com/on-the-issues/2023/09/07/copilot-copyright-commitment-ai-legal-concerns/
https://www.reuters.com/legal/litigation/us-copyright-office-denies-protection-another-ai-created-image-2023-09-06/
https://stableaudio.com/pricing-with-account
https://twitter.com/alphasignalai/status/1702363289160651000?s=46&t=uXHUN4Glah4CaV-g2czc6Q

PAPERS
====
https://arxiv.org/ftp/arxiv/papers/2307/2307.16513.pdf
https://arxiv.org/pdf/2309.03409.pdf

EP31: Fine-Tuned MrBeast Model Results, Chris Makes a Game, AGI Safety Paper + ERNIE GPT08 Sep 202300:56:52

Anthropic and OpenAI continue their awkward dance as they both court developers, while Apple spends millions training the next Siri. And when an AI generates its own MrBeast video, hilarity and fake deaths ensue. Tune in to hear the bros' spicy takes on the latest in the AI world!

Consider liking and subbing if you like the show. Thanks for watching!

CHAPTERS
======
00:00 - Cold open: sparks of AGI
00:20 - Fine Tuning a MRBEAST AI Model Experiment Results & Prompt2Model
09:41 - The Realities of AI Theory Vs Reality: Trying to Implement Papers
12:46 - Making SinkSub Game with AI using ChatDEV
19:37 - Code Llama Paper & Models Grounded in Mathematical Truth, False Refusals
31:16 - The Only Path to Controllable AGI Paper Discussion (Max Tegmark)
40:55 - Baidu's ERNIE China's ChatGPT: Our Review
48:13 - Mike's Claude Prediction Comes True: Anthropic Release Claude PRO
50:35 - OpenAI Announce AI Developer Conference November 6th 2023
52:59 - Apple Siri AJAX Rumors: Apple ChatGPT?
54:25 - Time's 100 Most Influential People in AI LOLs

SOURCES:
======
https://www.chinatalk.media/p/how-ernie-chinas-chatgpt-cracks-under
https://openai.com/blog/announcing-openai-devday
https://twitter.com/anthropicai/status/1699776481009053806?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://www.theverge.com/2023/9/6/23861763/apple-ai-language-models-ajax-gpt-training-spending
https://twitter.com/ylecun/status/1699779376961712473?s=46&t=uXHUN4Glah4CaV-g2czc6Q

PAPERS:
======
https://arxiv.org/pdf/2307.07924v3.pdf
https://arxiv.org/abs/2308.12950
https://arxiv.org/abs/2309.01933

EP30: ChatGPT Enterprise, Are Wrapper Apps Doomed? Prompt2Model & Synthetic Training Data.31 Aug 202301:06:44

This week we dive into the implications of OpenAI's new ChatGPT Enterprise release - will it crush the competition or lead to an AI monopoly? Then we debate whether fine-tuning models on synthetic data is the holy grail and discuss using it to train our own MrBeast video plot generator. We round up by laughing at Google's absurd new "AI" meeting assistant, and an awkward robot that needs to be told to shut up. 

If you like the show, consider subscribing, liking and leaving a comment. We love hearing from you.

CHAPTERS:
====
00:00 - Shut up! Cold Open
00:34 - ChatGPT Enterprise, OpenAI Strategy, Wrapper Apps
22:09 - CoTracker Model from Meta & Meta's Strategy with AI
27:19 - Is ChatGPT Enterprise the Death Blow to Wrapper Apps?
37:53 - Ideogram Vs MidJourney: Advancements in Text on Images
43:09 - Prompt2Model: This Day in Synthetic Training Developers + Mr Beast Video Idea Generator
57:58 - Google's Cloud Next AI Event: Get your AI to Attend Meetings for You!
1:02:54 - Sky News on AI & Robotics: Shut Up!

SOURCES:
====
https://openai.com/blog/introducing-chatgpt-enterprise
https://www.theinformation.com/articles/openai-passes-1-billion-revenue-pace-as-big-companies-boost-ai-spending
https://sparktoro.com/blog/we-analyzed-millions-of-chatgpt-user-sessions-visits-are-down-29-since-may-programming-assistance-is-30-of-use/
https://co-tracker.github.io/
https://ideogram.ai/
https://www.reddit.com/r/ArtificialInteligence/comments/164tt0w/googe_had_an_ai_conference_today_did_anyone_care/
https://www.theverge.com/2023/8/29/23849056/google-meet-ai-duet-attend-for-me
https://www.youtube.com/watch?v=6sZzEEb3G0w

PAPERS
====
Prompt2Model: https://arxiv.org/pdf/2308.12261v1.pdf

EP72: Croc Test with Gemini 1.5 Experimental, Flux Destroys Midjourney & GPT4o Model Updates07 Aug 202401:17:52

Sign up to SimTheory:
https://simtheory.ai
------
Join our community: https://thisdayinai.com
------
Jump around:
00:00 - Gemini 1.5 Experimental Experiments
20:11 - SimTheory
22:54 - LMSYS Leaderboard: Does it match our experience?
27:31 - Flux by Black Forest Labs is Better Than MidJourney
48:04 - OpenAI announces new GTP4o (50% cheaper inputs) & structured outputs
1:12:35 - Groq raises 640M to meet "soaring demand" will this fix unreliability?

Thanks for listening, if you like this show please consider leaving a review.

EP29: Meta's Code Llama, Unnatural Instruction, Phishing Our Mother & OpenAI's GPT3.5 Fine Tuning25 Aug 202301:03:44

This week, the Zuck strikes again - Meta unveils a state of the art AI code generator to challenge OpenAI's dominance. We explore the implications of AI models training themselves, and how it could accelerate capabilities. Then we put 11 labs' multilingual speech synthesis to the test, using it to generate a fake phishing call on our mother. Don't miss our scandalous experiments pushing AI to its limits in this jam-packed episode!

If you like the pod, please consider subbing, liking, commenting etc. xox

CHAPTERS:
=====
00:00 - Rehearsal of Phishing Our Mother (Cold Open)
00:19 - Meta's Code Llama
08:24 - Unnatural Instruction to Train AI Models
15:06 - Why Didn't Meta Release the Unnatural Instruction Code Llama Model? The Sparks of AGI?
16:50 - Evolution of GPT: Is Unnatural Instruction The Next Evolution of Models?
23:04 - DeepMind's Reinforced Self-Training ReST for Language Modeling paper and thoughts on future models
36:09 - Fine Tuning GPT-3.5 Turbo Announced by OpenAI: Should You Just Fine Tune Open Source?
44:05 - ElevenLabs Out of Beta and Multilingual v2: Explained by AI Us.
48:12 - Chris Tried to Figure Out AI Phishing
53:03 - Rehearsing Phishing Our Mother Call & Implications of This AI Tech
59:43 - How Much We Lost Not Investing in NVIDIA
1:01:29 - AI Bros Give Investment Advice

SOURCES:
======
https://ai.meta.com/blog/code-llama-large-language-model-coding/
https://www.theinformation.com/articles/metas-next-ai-attack-on-openai-free-code-generating-software
https://twitter.com/emollick/status/1694793231727210579?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://minimaxir.com/2023/08/stable-diffusion-xl-wrong/
https://twitter.com/abacaj/status/1679996952560246786/photo/1
https://openai.com/blog/gpt-3-5-turbo-fine-tuning-and-api-updates
https://arstechnica.com/ai/2023/08/how-chatgpt-turned-generative-ai-into-an-anything-tool/
https://elevenlabs.io/blog/multilingualv2/
https://www.businessinsider.com/nvidia-technology-spending-wave-build-out-google-meta-oracle-gpu-2023-8

PAPERS:
======
https://arxiv.org/pdf/2212.09689.pdf
https://arxiv.org/pdf/2308.08998.pdf

EP28: What is Poop? Is Generative AI a Dud? Will OpenAI Go Bankrupt? + Llama2 Uncensored18 Aug 202301:09:25

This week your favorite AI bros go deep on the BIG LIES - how big tech and the mainstream narrative are trying to SILENCE revolutionary AI models that threaten to EXPOSE inconvenient truths!  Tune in as Chris and Michael SHRED Meta's attempt to gag their new science AI Galactica, and discuss the CENSORSHIP built into aligned models like Claude and GPT-4.  Don't miss their hilarious takedown of noted AI alarmist Gary Marcus - his latest flip-flop proves generative AI is HERE TO STAY! The truth will not be suppressed!

(Note description written by AI for lols)

Thanks for helping us reach our goal of 100+ reviews on Apple Podcasts. It means a lot to us!

CHAPTERS
=====
00:00 - What is a Poop?
01:08 - Is Generative AI a Dud?
23:32 - OpenAI Acquires Global Illumination to work on ChatGPT
31:12 - Anthropic Raises $100M from Korean SK Telecom
37:15 - LLAMA2 Uncensored: Censorship, Misinformation and the Battle for Truth
48:31 - Meta's AI Trained on 48M Science Papers Shut Down After 2 Days

SOURCES
=====
https://garymarcus.substack.com/p/what-if-generative-ai-turned-out
https://www.judiciary.senate.gov/imo/media/doc/2023-05-16%20-%20Testimony%20-%20Marcus.pdf
https://www.sbs.com.au/whats-on/article/where-experts-naysayers-and-everyone-else-saw-the-internet-heading-in-the-90s/l7rfeapbm
https://garymarcus.substack.com/p/what-exactly-are-the-economics-of
https://twitter.com/emollick/status/1690906554747265024?s=20
https://techcrunch.com/2023/08/14/ai-startup-anthropic-raises-100m-from-korean-telco-giant-sk-telecom/
https://huggingface.co/jarradh/llama2_70b_chat_uncensored
https://twitter.com/boriquagato/status/1691800167857750274?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://www.cnet.com/tech/computing/ai-and-you-googles-news-ambitions-putting-a-face-on-subway-fare-dodgers/
https://openai.com/blog/openai-acquires-global-illumination

EP27: Have We Reached AI Disillusionment? GPTBOT Web Crawler, Nvidia's AI GH200s, Zoom AI Scandal11 Aug 202301:03:02

This week we gossip about OpenAI's shady web crawling habits, laugh at Zoom's lame excuses for spying, and dream up the perfect AI crypto scam. Get the inside scoop on Nvidia's new trillion-parameter instrument of AI, hear our hot takes on the public's growing AI disillusionment, and find out what an AI HVAC administrator would sound like. Join your favorite AI bros as they dive deep on the latest AI hype and hardware gossip - this episode is chock full of spicy AI tea you won't want to miss!

Please consider leaving a review to help us reach 100 reviews if you listen on Apple Podcasts :)

CHAPTERS:
====
00:00 - We Should Totally Do An AI Crypto Scam
00:27 - AI Meal Planner Suggests Chlorine Gas Recipe & AI with Personality
04:07 - OpenAI's GPTBot for Web Crawling
08:47 - Stealing content with AI, How to Protect Your IP from AI
14:37 - Zoom's Terms of Service for AI Training Scandal
25:25 - Nvidia's GH200 Announcement & Availability of Hardware
34:29 - Have We Reached AI Disillusionment?
52:54 - Generative AI LLMs for HVAC!?
55:31 - Claude Instant Version 1.2 Released
57:33 - AudioLDM 2: Text-to-audio/speech generation 
1:00:41 - Skeptics Vs Optimists for AI (AI Crypto Bros)

SOURCES
====
https://www.theguardian.com/world/2023/aug/10/pak-n-save-savey-meal-bot-ai-app-malfunction-recipes
https://saveymeal-bot.co.nz/ingredients
https://platform.openai.com/docs/gptbot
https://news.ycombinator.com/item?id=37030568
https://admiralcloudberg.medium.com/critical-conversations-the-crash-of-eastern-airlines-flight-212-660f47698887
https://news.ycombinator.com/item?id=37021160
https://www.axios.com/2023/08/09/zooms-terms-service-changes-ai-fears
https://www.nvidia.com/en-au/data-center/dgx-gh200/
https://twitter.com/authority_ai/status/1688619238389379073?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://twitter.com/SashaKaletsky/status/1676957007985922051
https://twitter.com/emollick/status/1688760539441217536/photo/1
https://twitter.com/anthropicai/status/1689303697535414272?s=46&t=uXHUN4Glah4CaV-g2czc6Q

EP26: Software Teams Replaced for $1.40, Doctors Out-Diagnosed, Meta's Audio Craft, ChatGPT Updates04 Aug 202301:09:48

This week we dive into the brave new world of AI agents teaming up to do real work - from building video games to diagnosing patients! But will these digital workforces put humans out of jobs? We discuss the AI takeover of industries like medicine and software, plus exciting updates like AI-generated music and Google giving their Assistant a complete AI makeover. 

We also cover Meta's Audio Craft, Med-Flamingo, GPT-5 Trademark and Rumors, and The SF Compute Company.

Thanks for your likes, comments and support.

CHAPTERS:
=====
00:00 - Self-Diagnosing Medical Problems Is Here
00:28 - MetaGPT: Multi-agent Collaborative Framework and The Multi-Agent Future
11:39 - Will Agents Replace Software in the Future?
14:59 - The flood of new LLMs
20:09 - Med-Flamingo: Have Virtual Doctors Arrived?
34:06 - Martin Shkreli's Dr. Gupta & Disrupting Medical Diagnosis
38:45 - Everyone is Using AI Already for Medical Diagnosis
39:29 - Focusing on Higher Level Work
40:53 - Meta's Audio Craft: Create Sounds and Music with Open Source AI
44:53 - Will Spotify Cut Out Artists to Increase Profits with AI Music?
48:07 - Will Entrenched Professionals Slow Down the Benefit of AI?
51:37 - ChatGPT Updates: GPT-4 as Default, Suggested Replies, Prompt Examples, Stay Logged In!
 57:05 - The San Francisco Compute Group: The A100 Cooperative!
1:00:42 - GPT-5 Trademark has been registered!
 1:01:48 - Google Assistant Powered by AI LLM Leaked in Letter
1:05:02 - The Future of Websites: LLMs for Businesses and Brands

SOURCES:
=====
https://arxiv.org/pdf/2308.00352v2.pdf
https://twitter.com/michael_d_moor/status/1685804620730540033?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://twitter.com/emollick/status/1686176146700857344?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://blueprintcdn.com/wp-content/uploads/2023/07/Blueprint-Discussion-Paper-2023.10-Agarwal-Moehring-Rajpurkar-Salz_2.pdf
https://twitter.com/sentdex/status/1687123772078247936?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://ai.meta.com/blog/audiocraft-musicgen-audiogen-encodec-generative-ai-audio/
https://twitter.com/joannejang/status/1687165702275567616?s=46&t=uXHUN4Glah4CaV-g2czc6Q
https://www.searchenginejournal.com/openai-files-trademark-application-gpt-5/493040/
https://www.axios.com/2023/07/31/google-assistant-artificial-intelligence-news
https://sfcompute.org/

© My Podcast Data