• 828: Are “Citizen Data Scientists” A Myth? With Keith McCormick
    Oct 18 2024
    The citizen data scientist: Fact or fiction? Jon Krohn holds a conversation across episodes in this Five-Minute Friday, with today’s guest Keith McCormick, in part responding to Nick Elprin’s interview in episode 811: Scaling Data Teams Effectively. Additional materials: www.superdatascience.com/828 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    20 mins
  • 827: Polars: Past, Present and Future, with Polars Creator Ritchie Vink
    Oct 15 2024
    Ritchie Vink, CEO and Co-Founder of Polars, Inc., speaks to Jon Krohn about the new achievements of Polars, an open-source library for data manipulation. This is the episode for any data scientist on the fence about using Polars, as it explains how Polars managed to make such improvements, the APIs and integration libraries that make it so versatile, and what’s next for this efficient library. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: Why Polars is so efficient [05:20] Polars’ easy integration with other data-processing tools [21:23] Eager vs lazy executive in Polars [32:15] Polars’ data processing of large- and small-scale datasets [38:28] Ritchie’s plans to scale his company [46:14] Upcoming features in Polars [58:06] Additional materials: www.superdatascience.com/827
    Show More Show Less
    1 hr and 14 mins
  • 826: In Case You Missed It in September 2024
    Oct 11 2024
    Next-gen IDEs, efficiency-boosting open-source Python libraries, and changes in hiring for data scientists: This episode of In Case You Missed It gives you our best clips of September’s interviews, hosted by Jon Krohn. Additional materials: www.superdatascience.com/826 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    42 mins
  • 825: Data Contracts: The Key to Data Quality, with Chad Sanderson
    Oct 8 2024
    Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they’re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: What data contracts are and how they define expectations for data quality [03:16] What data contracts look like [09:09] The common misconceptions about data quality when implementing AI [12:55] Chad’s Chief Operator role at Data Quality Camp [19:46] How “shifting left” improves data reliability by addressing issues early [24:17] Why data professionals still struggle with data quality [30:31] How data debt forms and why it leads to complex, inefficient architectures [35:53] How will the role of human oversight evolve in ensuring data quality? [47:12] How can data teams leverage storytelling? [52:33] Additional materials: www.superdatascience.com/825
    Show More Show Less
    1 hr and 2 mins
  • 824: Llama 3.2: Open-Source Edge and Multimodal LLMs
    Oct 4 2024
    Llama 3.2 brings a new era of AI innovation with lightweight models tailored for on-device applications and powerful vision models for handling complex image inputs. Host Jon Krohn explores how this release pushes the boundaries of open-source AI, making it more accessible and versatile for developers. He also covers the Llama Stack toolkit, designed to streamline deployment, and Llama Guard 3, Meta’s latest content moderation solution. With extensive support from major cloud and hardware partners, Llama 3.2 is set to unlock groundbreaking possibilities for AI across mobile and beyond. Tune in to hear more. Additional materials: www.superdatascience.com/824 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    14 mins
  • 823: Virtual Humans and AI Clones, with Natalie Monbiot
    Oct 1 2024
    Virtual humans are rewriting the rules of digital communication and reshaping entire industries. This week, Jon Krohn welcomes Natalie Monbiot, Head of Strategy at Hour One, to shed light on how AI avatars are revolutionizing L&D and e-commerce by turning traditional training and product listings into captivating, presenter-led content. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: • How do you create a virtual being? [10:55] • Reid Hoffman's avatar [13:40] • The virtual human economy [31:07] • Virtual human societies [51:24] • Virtual humans and creative expression [56:35] • Challenges in maintaining transparency [01:00:22] Additional materials: www.superdatascience.com/823
    Show More Show Less
    1 hr and 21 mins
  • 822: NotebookLM: Jaw-Dropping Podcast Episodes Generated About Your Documents
    Sep 27 2024
    NotebookLM, Google’s latest AI tool, takes content creation to a new level. This week, Jon Krohn shares how the platform transformed his 200-page dissertation into a fascinating 11-minute podcast. Discover how AI can turn vast amounts of information into engaging and digestible content, opening up new possibilities for content creation. Additional materials: www.superdatascience.com/822  Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    19 mins
  • 821: The Skills You Need to Be an Effective Data Scientist, with Marck Vaisman
    Sep 24 2024
    Marck Vaisman speaks to Jon Krohn about his paradigm for understanding core data practitioner types. Hear Marck detail the four data practitioner personas that he has identified in his research, why he believes the roadmaps that influencers like to promote as surefire ways to a data science career don’t work in practice, and why the term “data scientist” is still so elusive and hard to recruit for. This episode is brought to you by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: • How Marck started his work in defining data science roles [08:06] • The relationship between the four data practitioner personas [15:26] • About Marck’s “menu” for effective data science [40:43] • How recruiters can hire the best data scientist for the job [59:31] Additional materials: www.superdatascience.com/821
    Show More Show Less
    1 hr and 13 mins