Good podcast

Top 100 most popular podcasts

Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact. Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy. We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship ? everything you need to crush it with data science.

Subscribe

iTunes / Overcast / RSS

Website

superdatascience.com/podcast

Episodes

832: The Anthropic CEO?s Techno-Utopia

Host Jon Krohn unpacks Dario Amodei?s vision of a techno-utopia in his essay Machines of Loving Grace, where ?Powerful AI? takes center stage. Amodei, CEO of Anthropic, imagines a future where AI doesn?t just assist but actively shapes fields like healthcare, economics, and governance with unmatched intelligence and autonomy. Jon explores the possibilities and challenges of this AI-driven future, asking how close we are to seeing these revolutionary shifts and what they mean for society. Additional materials: www.superdatascience.com/832 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-11-01
Link to episode

831: PyTorch Lightning, Lit-Serve and Lightning Studios, with Dr. Luca Antiga

PyTorch Lightning is revolutionizing the AI landscape, and Dr. Luca Antiga, CTO of Lightning AI, joins host Jon Krohn to explain how. In this episode, they explore the tools pushing AI development forward, from Lightning Studios to Lit-Serve, and discuss the game-changing rise of small language models that challenge industry giants with precision and speed. Luca also shares his vision for developers in an AI-enhanced world, where coding meets creativity and collaboration with intelligent tools. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: How Lightning AI's open-source tools make AI development faster [11:30] The rise of small language models and how they'll rival LLMs [37:47] Luca's journey from biomedical imaging to deep learning pioneer [52:03] How AI will transform software developer tasks [1:03:05] Additional materials: www.superdatascience.com/831
2024-10-29
Link to episode

830: The ?A.I.? Nobel Prizes (in Physics and Chemistry??)

Geoffrey Hinton and Sir Demis Hassabis: The Nobel Prize committee is an achievement of the highest order, awarding physicists, chemists, physiologists, medical practitioners, writers, pacifists and economists perhaps the greatest honor in their respective fields. In this week?s Five-Minute Friday, Jon Krohn discusses how two AI pioneers came to win prizes in chemistry and physics. Additional materials: www.superdatascience.com/830 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-10-25
Link to episode

829: Neuroscience Fueled by ML, with Prof. Bradley Voytek

Neuroscientist Bradley Voytek outlines to Jon Krohn the incredible use of data science and machine learning in his research and how recent discoveries in action potentials and neurons have completely skyrocketed the field to a new understanding of the brain and its functions. You?ll also hear what Bradley thinks is most important when hiring data scientists and his contributions to Uber?s algorithm when it was still a startup.  This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: Breakthroughs in brain region communication [04:08] The future of brain research and MedTech [35:24] The libraries and software used at the Halicioglu Data Science Institute [45:11] Brain rhythm as a diagnostic tool [1:02:58] Bradley?s curriculum structure at UC San Diego [1:12:21] How Uber applies data science [1:20:07] Additional materials: www.superdatascience.com/829
2024-10-22
Link to episode

828: Are ?Citizen Data Scientists? A Myth? With Keith McCormick

The citizen data scientist: Fact or fiction? Jon Krohn holds a conversation across episodes in this Five-Minute Friday, with today?s guest Keith McCormick, in part responding to Nick Elprin?s interview in episode 811: Scaling Data Teams Effectively. Additional materials: www.superdatascience.com/828 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-10-18
Link to episode

827: Polars: Past, Present and Future, with Polars Creator Ritchie Vink

Ritchie Vink, CEO and Co-Founder of Polars, Inc., speaks to Jon Krohn about the new achievements of Polars, an open-source library for data manipulation. This is the episode for any data scientist on the fence about using Polars, as it explains how Polars managed to make such improvements, the APIs and integration libraries that make it so versatile, and what?s next for this efficient library. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: Why Polars is so efficient [05:20] Polars? easy integration with other data-processing tools [21:23] Eager vs lazy executive in Polars [32:15] Polars? data processing of large- and small-scale datasets [38:28] Ritchie?s plans to scale his company [46:14] Upcoming features in Polars [58:06] Additional materials: www.superdatascience.com/827
2024-10-15
Link to episode

826: In Case You Missed It in September 2024

Next-gen IDEs, efficiency-boosting open-source Python libraries, and changes in hiring for data scientists: This episode of In Case You Missed It gives you our best clips of September?s interviews, hosted by Jon Krohn. Additional materials: www.superdatascience.com/826 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-10-11
Link to episode

825: Data Contracts: The Key to Data Quality, with Chad Sanderson

Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they?re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: What data contracts are and how they define expectations for data quality [03:16] What data contracts look like [09:09] The common misconceptions about data quality when implementing AI [12:55] Chad?s Chief Operator role at Data Quality Camp [19:46] How ?shifting left? improves data reliability by addressing issues early [24:17] Why data professionals still struggle with data quality [30:31] How data debt forms and why it leads to complex, inefficient architectures [35:53] How will the role of human oversight evolve in ensuring data quality? [47:12] How can data teams leverage storytelling? [52:33] Additional materials: www.superdatascience.com/825
2024-10-08
Link to episode

824: Llama 3.2: Open-Source Edge and Multimodal LLMs

Llama 3.2 brings a new era of AI innovation with lightweight models tailored for on-device applications and powerful vision models for handling complex image inputs. Host Jon Krohn explores how this release pushes the boundaries of open-source AI, making it more accessible and versatile for developers. He also covers the Llama Stack toolkit, designed to streamline deployment, and Llama Guard 3, Meta?s latest content moderation solution. With extensive support from major cloud and hardware partners, Llama 3.2 is set to unlock groundbreaking possibilities for AI across mobile and beyond. Tune in to hear more. Additional materials: www.superdatascience.com/824 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-10-04
Link to episode

823: Virtual Humans and AI Clones, with Natalie Monbiot

Virtual humans are rewriting the rules of digital communication and reshaping entire industries. This week, Jon Krohn welcomes Natalie Monbiot, Head of Strategy at Hour One, to shed light on how AI avatars are revolutionizing L&D and e-commerce by turning traditional training and product listings into captivating, presenter-led content. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? How do you create a virtual being? [10:55] ? Reid Hoffman's avatar [13:40] ? The virtual human economy [31:07] ? Virtual human societies [51:24] ? Virtual humans and creative expression [56:35] ? Challenges in maintaining transparency [01:00:22] Additional materials: www.superdatascience.com/823
2024-10-01
Link to episode

822: NotebookLM: Jaw-Dropping Podcast Episodes Generated About Your Documents

NotebookLM, Google?s latest AI tool, takes content creation to a new level. This week, Jon Krohn shares how the platform transformed his 200-page dissertation into a fascinating 11-minute podcast. Discover how AI can turn vast amounts of information into engaging and digestible content, opening up new possibilities for content creation. Additional materials: www.superdatascience.com/822 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-09-27
Link to episode

821: The Skills You Need to Be an Effective Data Scientist, with Marck Vaisman

Marck Vaisman speaks to Jon Krohn about his paradigm for understanding core data practitioner types. Hear Marck detail the four data practitioner personas that he has identified in his research, why he believes the roadmaps that influencers like to promote as surefire ways to a data science career don?t work in practice, and why the term ?data scientist? is still so elusive and hard to recruit for. This episode is brought to you by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? How Marck started his work in defining data science roles [08:06] ? The relationship between the four data practitioner personas [15:26] ? About Marck?s ?menu? for effective data science [40:43] ? How recruiters can hire the best data scientist for the job [59:31] Additional materials: www.superdatascience.com/821
2024-09-24
Link to episode

820: OpenAI's o1 "Strawberry" Models

Jon Krohn takes OpenAI?s new models (o1-preview and o1-mini) for a spin in this Five-Minute Friday, learning their key strengths and limitations, and how the o1 series may represent yet another landmark for generative AI. Additional materials: www.superdatascience.com/820 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-09-20
Link to episode

819: PyTorch: From Zero to Hero, with Luka Anicin

SuperDataScience veteran and Udemy teacher Luka Anicin is on the podcast to talk about his brand-new course, ?PyTorch: From Zero to Hero?, available exclusively on superdatascience.com. Host Jon Krohn asks Luka why he feels that every data scientist should consider PyTorch as their default Python library, and why ?keeping it simple? can secure the success of a machine learning project. This episode is brought to you by AWS Inferentia and AWS Trainium, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? About the PyTorch library [03:29] ? Why PyTorch became so popular [25:24] ? How to increase accuracy and efficiency in PyTorch [31:49] ? How to utilize transfer learning [35:44] ? Why real-world projects are essential to data scientists [41:10] ? About Datablooz [46:49] Additional materials: www.superdatascience.com/819
2024-09-17
Link to episode

818: In Case You Missed It in August 2024

Experts from AI and data science discuss the impact and benefits of decentralization, the importance of structuring AI systems in business, and why knowing the basics will always matter for data engineers. Listen to Shingai Manjengwa (episode 809), Daniel Hulme (episode 807), Jerry Yurchisin (episode 813) and Nick Elprin (episode 811) explore a future world of work that rewards continuing learners, sets tasks for the people best suited to complete them rather than those whose job titles reflect the spec, and applies a fleet of ?AI agents? to solve complex business tasks. Additional materials: www.superdatascience.com/818 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-09-13
Link to episode

817: The Positron IDE, Tidy NLP and MLOps with Dr. Julia Silge

Dr. Julia Silge, Engineering Manager at Posit, introduces the brand-new Positron IDE, perfect for exploratory data analysis and visualization. She also lays out her top picks for LLMs that boost coding efficiency and discusses when traditional NLP methods might be the smarter choice over LLMs. Plus, Julia highlights some must-know open-source libraries that make managing MLOps easier than ever. Tune in for insights that every data scientist, ML engineer, and developer will find useful. This episode is brought to you by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? Overview of Posit and Positron IDE [05:20] ? How the needs of a data scientist differ from those of a software developer [10:54] ? How to contribute to the open-source Positron [19:50] ? MLOps and Vetiver: Tools for deploying and maintaining ML models [37:01] ? Natural Language Processing (NLP) and the Tidyverse approach [50:34] ? The role of AI and LLMs in data science education [1:24:18] Additional materials: www.superdatascience.com/817
2024-09-10
Link to episode

816: Explaining AGI to a 94-Year-Old

Jon Krohn takes on a listener's challenge to explain his work in data science to his 94-year-old grandmother, Annie. This heartwarming conversation covers what data is, the role of a data scientist, and breaks down artificial intelligence (AI) and artificial general intelligence (AGI) in simple terms. The episode provides a fresh take on how to communicate complex topics to a lay audience, offering both clarity and insight. Additional materials: www.superdatascience.com/816 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-09-06
Link to episode

815: Polars: Faster DataFrame Ops, with Marco Gorelli

Polars, Python, Narwhals, Rust, and Pandas: Marco Gorelli talks to Jon Krohn about the many ways to use the newest data libraries available, the joys of open-source development, and the best method to win prizes in forecasting competitions. This episode is brought to you by AWS Inferentia and AWS Trainium, by Babbel, the science-backed language-learning platform, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? When to use Polars vs Pandas [08:26] ? How Polars optimizes string operations and data processing [20:08] ? Where Narwhals outstrips Polars and Pandas [48:37] ? The benefits of using Altair [55:21] ? Addressing the lack of women in data science [1:09:58] ? How to win a forecasting competition [1:16:58] Additional materials: www.superdatascience.com/815
2024-09-03
Link to episode

814: Summer Reflections

As summer winds down, this episode shifts focus from the usual tech discussions to something more personal: reflecting on the importance of balancing work with life?s simple pleasures. While the world of data science and AI continues to evolve rapidly, it's essential to remember that true success isn't just about professional milestones. It?s also about cherishing the moments that make life meaningful. Tune in for a brief but impactful reflection on how to redefine success to include not just achievements, but also the everyday joys that often go unnoticed. Additional materials: www.superdatascience.com/814 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-08-30
Link to episode

813: Solving Business Problems Optimally with Data, with Jerry Yurchisin

Jerry Yurchisin from Gurobi joins Jon Krohn to break down mathematical optimization, showing why it often outshines machine learning for real-world challenges. Find out how innovations like NVIDIA?s latest CPUs are speeding up solutions to problems like the Traveling Salesman in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? The Burrito Optimization Game and mathematical optimization use cases [03:36] ? Key differences between machine learning and mathematical optimization [05:45] ? How mathematical optimization is ideal for real-world constraints [13:50] ? Gurobi?s APIs and the ease of integrating them [21:33] ? How LLMs like GPT-4 can help with optimization problems [39:39] ? Why integer variables are so complex to model [01:02:37] ? NP-hard problems [01:11:01] ? The history of optimization and its early applications [01:26:23] Additional materials: www.superdatascience.com/813
2024-08-27
Link to episode

812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery

In this episode of Five-Minute Friday, Jon Krohn investigates published findings from the startup Sakana AI and its paper?s co-authors from the University of Oxford, the University of British Columbia and the Vector Institute in Toronto. These authors explore the potential of The AI Scientist, a framework that could change the way we conduct scientific research forever. Additional materials: www.superdatascience.com/812 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-08-23
Link to episode

811: Scaling Data Science Teams Effectively, with Nick Elprin

Nick Elprin talks to Jon Krohn about how and when to scale a data science team and its workflows to secure a company?s commercial viability. You?ll also hear how to launch your own data science startup and why it?s so important to understand that AI tools are not one-size-fits-all. This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? How Nick served enterprises with his AI startup, Domino Data Lab [05:36] ? About the Navy?s own mine detection models [17:43] ? The hype surrounding GenAI [30:35] ? How AI platforms integrate with business strategies [39:49] ? When it?s time to integrate an AI tool into your business [51:12] ? Why Nick started Domino Data Lab [1:03:53] Additional materials: www.superdatascience.com/811
2024-08-20
Link to episode

810: The Five Levels of Self-Driving Cars

Self-driving cars are here, and Jon Krohn is breaking down the five levels of automation that could change driving forever. From full human control at Level 0 to cars that drive themselves in any condition at Level 5, get the real story on what these levels mean. With firsthand insights from a recent autonomous vehicle experience, this episode cuts through the buzz and tells you what?s coming next. Additional materials: www.superdatascience.com/810 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-08-16
Link to episode

809: Agentic AI, with Shingai Manjengwa

Agentic AI is revolutionizing the tech landscape, and Shingai Manjengwa from ChainML is here to tell us why. Discover how AI agents are becoming an integral part of our lives, automating tasks like travel bookings and daily inspiration. Shingai explains the power of multi-agent systems, where AI agents collaborate to solve complex challenges, and highlights how blockchain technology is enhancing AI transparency and trust. Plus, get an inside look at ChainML?s innovative Theoriq protocol and the groundbreaking Council Analytics tool. This episode is brought to you by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? What A.I. agents are [10:51] ? How blockchain technology helps humans trust A.I. agents [18:27] ? The Theoriq protocol developed by ChainML [34:05] ? How Council Analytics lets you ?speak? to their dataset with natural language [39:00] ? A future of multi-agent systems [50:42] ? Challenges and risks associated with agentic AI [1:04:17] Additional materials: www.superdatascience.com/809
2024-08-13
Link to episode

808: In Case You Missed It in July 2024

Advice for emerging data scientists, the latest in model merging, and how GenAI can supercharge your creativity: Host Jon Krohn gives us his highlights from a month of interviews, packed with tips from some of the leading names in data science and beyond. Guests include Daliana Liu, Charles Duhigg, Charles Goddard, Rosanne Liu and Andrey Kurenkov. Additional materials: www.superdatascience.com/808 ? Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-08-09
Link to episode

807: Superintelligence and the Six Singularities, with Dr. Daniel Hulme

The singularity could soon be upon us. The PESTLE framework, developed by this episode?s guest Daniel Hulme, expresses not one but six types of singularity that could occur: political, environmental, social, technological, legal and economic. Jon Krohn and Daniel Hulme discuss how each of these singularities could bring good to the world, aligning with human interests and pushing forward progress. They also talk about neuromorphic computing, machine consciousness, and applying AI at work. This episode is brought to you by AWS Inferentia and AWS Trainium, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? About the six singularities [03:43] ? How the singularity could improve life on earth [09:01] ? The credibility of AI experts [32:51] ? How the decentralization of technology could benefit earth [43:14] ? How AI might enhance creativity [1:04:33] Additional materials: www.superdatascience.com/807
2024-08-06
Link to episode

806: Llama 3.1 405B: The First Open-Source Frontier LLM

Llama 3.1 is here, and it?s a game-changer. Meta?s latest AI model, especially the massive 405B variant, finally brings an open-source option to compete with giants like OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet. While Meta didn?t fully open-source everything, the availability of "open weights" is a strategic move to shake up the AI landscape. The model boasts an impressive 128,000-token context window and multilingual support in eight languages. Meta is also focusing on responsible AI development with tools like Llama Guard 3 for content moderation. This release is more than just a tech upgrade?it's about democratizing AI and sparking innovation across industries. How will you leverage Llama 3.1 to make a real impact? Tune into this week?s FMF episode and let?s explore the future with this latest AI development together. Additional materials: www.superdatascience.com/806 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-08-02
Link to episode

805: How to Be a Supercommunicator, with Charles Duhigg

Become a Supercommunicator! New York Times bestselling author Charles Duhigg, known for The Power of Habit and Smarter Faster Better, gets real about mastering communication in this episode. Discover insights from his latest book, Supercommunicator, where he reveals how to align conversation styles for deeper connections, handle conflicts effectively, and why AI can't replicate the emotional depth of human interactions. This episode is brought to you by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? The inspirations behind Supercommunicator [03:41] ? The three types of conversations: Practical, emotional, and social conversations [05:22] ? The matching principle: Align communication styles for better connection [10:36] ? What is neural entrainment: Achieve a mind meld through synchronized brain activity [13:22] ? The series of steps/principles to connect with someone [24:39] ? How to avoid or de-escalate conflict conversations [31:07] ? The impact of GenAI on conversations: How AI mimics dialogue but lacks emotional depth [45:24] Additional materials: www.superdatascience.com/805
2024-07-30
Link to episode

804: AI x Solar Power = Abundant Energy

Solar power now provides 6% of the world's electricity, thanks to rapid growth. Host Jon Krohn discusses the factors driving this rise, the challenges ahead, and how AI and data science are optimizing solar technologies. Tune in for insights on the future of solar power, and don't forget to like, share, and subscribe! Additional materials: www.superdatascience.com/804 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-07-26
Link to episode

803: How to Thrive in Your (Data Science) Career, with Daliana Liu

Daliana Liu is a big name in data science teaching, and she has always been generous in sharing everything she knows about getting a job in data science. In this episode, she continues to extend her generosity, helping listeners define their approach to achieving a fulfilling career in data science and tech. This episode is brought to you by AWS Inferentia and AWS Trainium, by Babbel, the science-backed language-learning platform, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? Common career challenges for data scientists [34:57] ? Advice for people who don?t know where to go in their career [48:05] ? How to build resilience and protect against Imposter Syndrome [1:06:23] ? Skills that data scientists should develop today [1:39:17] ? The future of the data science and AI job market [1:46:55] Additional materials: www.superdatascience.com/803
2024-07-23
Link to episode

802: In Case You Missed It in June 2024

How to grab investor interest with your AI startup idea, revisiting algorithms, and helping practitioners ensure AI safety with regulatory frameworks and beyond: This month, you missed a whole bunch of great interviews. But don?t worry, Jon Krohn is here to recap all the best bits for you! Additional materials: www.superdatascience.com/802 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-07-19
Link to episode

801: Merged LLMs Are Smaller And More Capable, with Arcee AI's Mark McQuade and Charles Goddard

Merged LLMs are the future, and we?re exploring how with Mark McQuade and Charles Goddard from Arcee AI on this episode with Jon Krohn. Learn how to combine multiple LLMs without adding bulk, train more efficiently, and dive into different expert approaches. Discover how smaller models can outperform larger ones and leverage open-source projects for big enterprise wins. This episode is packed with must-know insights for data scientists and ML engineers. Don?t miss out! Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? Explanation of Charles' job title: Chief of Frontier Research [03:31] ? Model Merging Technology combining multiple LLMs without increasing size [04:43] ? Using MergeKit for model merging [14:49] ? Evolutionary Model Merging using evolutionary algorithms [22:55] ? Commercial applications and success stories [28:10] ? Comparison of Mixture of Experts (MoE) vs. Mixture of Agents [37:57] ? Spectrum Project for efficient training by targeting specific modules [54:28] ? Future of Small Language Models (SLMs) and their advantages [01:01:22] Additional materials: www.superdatascience.com/801
2024-07-16
Link to episode

800: A Transformative Century of Technological Progress, with Annie P.

The SuperDataScience Podcast is celebrating its 800th episode! Host Jon Krohn speaks to his grandmother, Annie, about growing up at a time when so many technologies we take for granted today were yet to be developed. Listen in to hear Annie?s experience of the changes in technology across 94 years and how she and her family fared in 1940s Ukraine with no electricity or running water. Additional materials: www.superdatascience.com/800
2024-07-12
Link to episode

799: AGI Could Be Near: Dystopian and Utopian Implications, with Dr. Andrey Kurenkov

No-code games with GenAI, the creative possibilities of LLMs, and our proximity to AGI: In this episode, Jon Krohn talks to Andrey Kurenkov about what turned him from an AGI skeptic to a positivist. You?ll also hear about his wildly popular podcast ?Last Week in AI? and how the NVIDIA-backed startup Astrocade is helping videogame enthusiasts to create their own games through generative AI. A must-listen! This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? All about The Gradient and Last Week in AI [10:42] ? All about Astrocade and Andrey?s role at the startup [24:35] ? Balancing UX and creative control at Astrocade [42:00] ? The creative possibilities of LLMs [1:04:15] ? The rapid emergence of AGI [1:10:31] Additional materials: www.superdatascience.com/799
2024-07-09
Link to episode

798: Claude 3.5 Sonnet: Frontier Capabilities & Slick New "Artifacts" UI

Claude 3.5 Sonnet, Anthropic?s newest model, is making waves in the AI community. This mid-size model outshines the larger Claude 3 Opus in tasks like code generation, content creation, and document summarization, and it?s twice as fast. In this episode of The Super Data Science Podcast, Jon Krohn discusses its top-notch performance across benchmarks like MMLU, GPQA, and HumanEval, along with its improved machine vision capabilities. Plus, learn about the new Artifacts UI feature, which makes managing generated content easier by displaying outputs side-by-side with inputs. Tune in to find out why Claude 3.5 Sonnet is setting new standards in AI. Additional materials: www.superdatascience.com/798 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-07-05
Link to episode

797: Deep Learning Classics and Trends, with Dr. Rosanne Liu

Dr. Rosanne Liu, Research Scientist at Google DeepMind and co-founder of the ML Collective, shares her journey and the mission to democratize AI research. She explains her pioneering work on intrinsic dimensions in deep learning and the advantages of curiosity-driven research. Jon and Dr. Liu also explore the complexities of understanding powerful AI models, the specifics of character-aware text encoding, and the significant impact of diversity, equity, and inclusion in the ML community. With publications in NeurIPS, ICLR, ICML, and Science, Dr. Liu offers her expertise and vision for the future of machine learning. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? How the ML Collective came about [03:31] ? The concept of a failure CV [16:12] ? ML Collective research topics [19:03] ? How Dr. Liu's work on the ?intrinsic dimension? of deep learning models inspired the now-standard LoRA approach to fine-tuning LLMs [21:28] ? The pros and cons of curiosity-driven vs. goal-driven ML research [29:08] ? Discussion on Dr. Liu's research and papers [33:17] ? Character-aware vs. character-blind text encoding [54:59] ? The positive impacts of diversity, equity, and inclusion in the ML community [57:51] Additional materials: www.superdatascience.com/797
2024-07-02
Link to episode

796: Earth's Coming Population Collapse and How AI Can Help, with Simon Kuestenmacher

Want to feel optimistic about your day? In this Friday episode, Simon Kuestenmacher talks to Jon Krohn about demography: What it is, why it?s so important, and why its forecasts should give us reason to hope for a better future. In an increasingly globalized world, and with an aging population in countries with the biggest GDPs, demography is more valuable than ever. Additional materials: www.superdatascience.com/796 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-06-28
Link to episode

795: Fast-Evolving Data and AI Regulatory Frameworks, with Dr. Gina Guillaume-Joseph

Gina Guillaume-Joseph talks to Jon Krohn about the data and regulatory frameworks set to transform the AI industry and why that?s important to anyone working with data. This episode offers a solid path to understanding AI regulation?s past, present and future. Gina walks listeners through the AI Bill of Rights, the NIST AI Risk Framework and the MITRE ATLAS threat model. This episode is brought to you by AWS Inferentia and AWS Trainium, by Crawlbase, the ultimate data crawling platform, and by Babbel, the science-backed language-learning platform. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? What ?responsible AI? means [08:14] ? Why the federal government should be behind AI regulation [12:22] ? The US vs EU on AI regulation [18:46] ? About the AI Bill of Rights [26:14] ? About MITRE and the MITRE Atlas [37:19] ? What a systems engineer does [54:11] Additional materials: www.superdatascience.com/795
2024-06-25
Link to episode

794: Exciting (and Frightening!) Trends in Open-Source AI

Trends in open-source AI: Join Jon Krohn and a panel of data science icons as they discuss the most exciting and concerning developments in open-source AI. Hear insights from Drew Conway, Jared Lander, Emily Zabor, and JD Long on the transformative potential of AI and its future impact. Additional materials: www.superdatascience.com/794 Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.
2024-06-21
Link to episode

793: Bayesian Methods and Applications, with Alexandre Andorra

Bayesian methods take the spotlight in this episode with Alex Andorra, co-founder of PyMC Labs, and Jon Krohn. Learn how Bayesian techniques handle tough problems, make the most of prior knowledge, and work wonders with limited data. Alex and Jon break down essentials like PyMC, PyStan, and NumPyro libraries, show how to boost model efficiency with PyTensor, and talk about using ArviZ for top-notch diagnostics and visualizations. Plus, get into advanced modeling with Gaussian Processes. This episode is brought to you by Crawlbase, the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: ? Practical introduction to Bayesian statistics [04:54] ? Definition and significance of epistemology [17:52] ? Explanation of PyMC and Monte Carlo methods [27:57] ? How to get started with Bayesian modeling and PyMC [34:26] ? PyMC Labs and its consulting services [50:50] ? ArviZ for post-modeling diagnostics and visualization [01:02:23] ? Gaussian processes and their applications [01:09:02] Additional materials: www.superdatascience.com/793
2024-06-18
Link to episode

792: In Case You Missed It in May 2024

Jon Krohn shares his favorite clips from May. Hear how Navdeep Martin is spearheading a company to tackle the climate crisis, why Sol Rashidi and Demetrios Brinkmann find nailing job titles so necessary in the fast-paced industries of tech and AI, and get the latest on embeddings with Luis Serrano. Additional materials: www.superdatascience.com/792 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
2024-06-14
Link to episode

791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to Jon Krohn about the technique?s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education. This episode is brought to you by AWS Inferentia (go.aws/3zWS0au) and AWS Trainium (go.aws/3ycV6K0), and Crawlbase (crawlbase.com), the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: ? Why it is important that AI is open [03:13] ? The efficacy and scalability of direct preference optimization [07:32] ? Robotics and LLMs [14:32] ? The challenges to aligning reward models with human preferences [23:00] ? How to make sure AI?s decision making on preferences reflect desirable behavior [28:52] ? Why Nathan believes AI is closer to alchemy than science [37:38] Additional materials: www.superdatascience.com/791
2024-06-11
Link to episode

790: Open-Source Libraries for Data Science at the New York R Conference

The experts reveal their top open-source R libraries with us live from the New York R Conference! This Super Data Science Podcast episode features an exclusive panel with data science trailblazers Drew Conway, Jared Lander, Emily Zabor, and JD Long. They share their favorite R libraries and valuable insights to enhance your data science practice. Additional materials: www.superdatascience.com/790 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
2024-06-07
Link to episode

789: ML for Wind-Powered Energy Generation, with Dr. Jason Yosinski

Machine Learning for Wind Energy is front and center in this episode as Jon Krohn is joined by Dr. Jason Yosinski, CEO of Windscape AI. Dr. Yosinski brings to light the latest ML advancements sparking significant changes in renewable energy. Tune in for a comprehensive review of these cutting-edge technologies and their expansive impact on the industry and the environment's well-being. This episode is brought to you by Crawlbase, the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: ? Enhancing predictability in wind energy with ML [04:52] ? Data utilization from wind turbines by energy providers [11:41] ? Jason's journey into wind energy [17:55] ? Landing the right startup idea [22:47] ? Visualizing neural networks with the Deep Vis Toolbox [31:29] ? Extreme event forecasting at Uber vs. nowcasting at Windscape AI [45:13] ? Discoveries from Loss Change Allocation research [47:48] ? Engaging with Jason's ML Collective [59:46] ? Traits of successful AI entrepreneurs [1:10:26] Additional materials: www.superdatascience.com/789
2024-06-04
Link to episode

788: Multi-Agent Systems: How Teams of LLMs Excel at Complex Tasks

Multi-agent systems could mark a significant turning point in generative AI. From mastering increasingly complex tasks to getting LLMs to collaborate, in this Five-Minute Friday, Jon Krohn discusses the systems that are working to bridge the remaining gaps left by the latest large language models (LLMs). Additional materials: www.superdatascience.com/788 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
2024-05-31
Link to episode

787: MLOps: The Job and The Key Tools, with Demetrios Brinkmann

MLOps, how to build an online community, and tools for scaling LLMs: In this episode, Demetrios Brinkmann speaks to Jon Krohn about the similarities and differences between LLMOps, MLOps and DevOps, and why this should matter to companies looking to hire such engineers. You will also hear how to get involved in the MLOps community wherever you are in the world, and how you can start developing great products with the available tools. This episode is brought to you by AWS Inferentia (go.aws/3zWS0au) and AWS Trainium (go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: ? What MLOps is [03:51] ? About LLMOps [12:06] ? About LlamaIndex and Ollama [18:29] ? Insights from Demetrios? MLOps survey [20:49] ? Guidance for using third-party APIs [40:18] ? Recommendations for building an online community in tech and AI [47:07] Additional materials: www.superdatascience.com/787
2024-05-28
Link to episode

786: The Six Keys to Data Scientists' Success, with Kirill Eremenko

Learn about the six keys to data science success as host Jon Krohn welcomes back Kirill Eremenko, the mastermind behind SuperDataScience. Kirill shares his top insights on data science careers, from building strong portfolios to leveraging mentors and hands-on labs. With over 2.7 million students, his advice is a must-hear for aspiring and experienced data scientists alike.Additional materials: www.superdatascience.com/786Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
2024-05-24
Link to episode

785: Math, Quantum ML and Language Embeddings, with Dr. Luis Serrano

Dr. Luis Serrano from the Serrano Academy reveals how to make Math and Quantum ML accessible, tackles the challenges of teaching A.I. to beginners, and explores the power of embeddings in enterprise applications. Explore the future of Quantum Machine Learning and the latest trends in AI, including multimodality and autonomous systems.This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:? How math and AI can be made easy to understand [05:21]? The three major categories of learners [16:21]? Why embeddings are the most important component of LLMs [26:19]? How semantic search differs from a traditional keyword search [29:57]? The most exciting emerging application areas for AI [42:41]? The promising application areas for Quantum Machine Learning [49:18]Additional materials: www.superdatascience.com/785
2024-05-21
Link to episode

784: Aligning Large Language Models, with Sinan Ozdemir

Aligning LLMs: How can we teach pre-trained LLMs to hold a conversation and learn new information from each other? This was where Sinan Ozdemir began his investigation into aligning LLMs. In this episode, he talks to Jon Krohn about the limitations of definitions for LLMs, training LLMs, and whether it is possible to train an LLM without alignment.Additional materials: www.superdatascience.com/784Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
2024-05-17
Link to episode

783: Generative A.I. for Solar Power Installation, with Navdeep Martin

Recent advances in GenAI, how to tackle the climate crisis with advanced technology, and addressing the knowledge gap in understanding AI: Jon Krohn speaks to Flypower co-founder and CEO Navdeep Martin about the advances made in GenAI, from products to applications, and how we might use AI to tackle climate change.This episode is brought to you by AWS Inferentia and AWS Trainium. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:? How the Washington Post?s recommendation systems work [03:29]? Why product leaders make great CEOs [10:36]? How Flypower uses GenAI to tackle climate change [22:13]? How Flypower identifies its customers? most pertinent questions [30:03]? How AI might come to tackle climate change [36:52]? How to mitigate hallucination in AI models [41:04]Additional materials: www.superdatascience.com/783
2024-05-14
Link to episode
A tiny webapp by I'm With Friends.
Updated daily with data from the Apple Podcasts.