Data Brew by Databricks

By: Databricks
  • Summary

  • Welcome to Data Brew by Databricks with Denny and Brooke! In this series, we explore various topics in the data and AI community and interview subject matter experts in data engineering/data science. So join us with your morning brew in hand and get ready to dive deep into data + AI! For this first season, we will be focusing on lakehouses – combining the key features of data warehouses, such as ACID transactions, with the scalability of data lakes, directly against low-cost object stores.
    © 2025 Data Brew by Databricks
    Show More Show Less
Episodes
  • Retrieval, rerankers, and RAG tips and tricks | Data Brew | Episode 39
    Feb 20 2025

    In this episode, Andrew Drozdov, Research Scientist at Databricks, explores how Retrieval Augmented Generation (RAG) enhances AI models by integrating retrieval capabilities for improved response accuracy and relevance.

    Highlights include:
    - Addressing LLM limitations by injecting relevant external information.
    - Optimizing document chunking, embedding, and query generation for RAG.
    - Improving retrieval systems with embeddings and fine-tuning techniques.
    - Enhancing search results using re-rankers and retrieval diagnostics.
    - Applying RAG strategies in enterprise AI for domain-specific improvements.

    Show More Show Less
    45 mins
  • The Power of Synthetic Data | Data Brew | Episode 38
    Feb 4 2025

    In this episode, Yev Meyer, Chief Scientist at Gretel AI, explores how synthetic data transforms AI and ML by improving data access, quality, privacy, and model training.

    Highlights include:
    - Leveraging synthetic data to overcome AI data limitations.
    - Enhancing model training while mitigating ethical and privacy risks.
    - Exploring the intersection of computational neuroscience and AI workflows.
    - Addressing licensing and legal considerations in synthetic data usage.
    - Unlocking private datasets for broader and safer AI applications.

    Show More Show Less
    42 mins
  • Secret to Production AI: Tools & Infrastructure | Data Brew | Episode 37
    Jan 22 2025

    In this episode, Julia Neagu, CEO & co-founder of Quotient AI, explores the challenges of deploying Generative AI and LLMs, focusing on model evaluation, human-in-the-loop systems, and iterative development.

    Highlights include:
    - Merging reinforcement learning and unsupervised learning for real-time AI optimization.
    - Reducing bias in machine learning with fairness and ethical considerations.
    - Lessons from large-scale AI deployments on scalability and feedback loops.
    - Automating workflows with AI through successful business examples.
    - Best practices for managing AI pipelines, from data collection to validation.

    Show More Show Less
    37 mins

What listeners say about Data Brew by Databricks

Average Customer Ratings

Reviews - Please select the tabs below to change the source of reviews.

In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.