Data Brew by Databricks

Episodes

Retrieval, rerankers, and RAG tips and tricks | Data Brew | Episode 39

Feb 20 2025

In this episode, Andrew Drozdov, Research Scientist at Databricks, explores how Retrieval Augmented Generation (RAG) enhances AI models by integrating retrieval capabilities for improved response accuracy and relevance.

Highlights include:
- Addressing LLM limitations by injecting relevant external information.
- Optimizing document chunking, embedding, and query generation for RAG.
- Improving retrieval systems with embeddings and fine-tuning techniques.
- Enhancing search results using re-rankers and retrieval diagnostics.
- Applying RAG strategies in enterprise AI for domain-specific improvements.

Show More Show Less

45 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
The Power of Synthetic Data | Data Brew | Episode 38

Feb 4 2025

In this episode, Yev Meyer, Chief Scientist at Gretel AI, explores how synthetic data transforms AI and ML by improving data access, quality, privacy, and model training.

Highlights include:
- Leveraging synthetic data to overcome AI data limitations.
- Enhancing model training while mitigating ethical and privacy risks.
- Exploring the intersection of computational neuroscience and AI workflows.
- Addressing licensing and legal considerations in synthetic data usage.
- Unlocking private datasets for broader and safer AI applications.

Show More Show Less

42 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Secret to Production AI: Tools & Infrastructure | Data Brew | Episode 37

Jan 22 2025

In this episode, Julia Neagu, CEO & co-founder of Quotient AI, explores the challenges of deploying Generative AI and LLMs, focusing on model evaluation, human-in-the-loop systems, and iterative development.

Highlights include:
- Merging reinforcement learning and unsupervised learning for real-time AI optimization.
- Reducing bias in machine learning with fairness and ethical considerations.
- Lessons from large-scale AI deployments on scalability and feedback loops.
- Automating workflows with AI through successful business examples.
- Best practices for managing AI pipelines, from data collection to validation.

Show More Show Less

37 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Mixture of Memory Experts (MoME) | Data Brew | Episode 36

Jan 10 2025

In this episode, Sharon Zhou, Co-Founder and CEO of Lamini AI, shares her expertise in the world of AI, focusing on fine-tuning models for improved performance and reliability.

Highlights include:
- The integration of determinism and probabilism for handling unstructured data and user queries effectively.
- Proprietary techniques like memory tuning and robust evaluation frameworks to mitigate model inaccuracies and hallucinations.
- Lessons learned from deploying AI applications, including insights from GitHub Copilot’s rollout.

Connect with Sharon Zhou and Lamini:
https://www.linkedin.com/in/zhousharon/
https://x.com/realsharonzhou
https://www.lamini.ai/

Show More Show Less

41 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Mixed Attention & LLM Context | Data Brew | Episode 35

Nov 21 2024

In this episode, Shashank Rajput, Research Scientist at Mosaic and Databricks, explores innovative approaches in large language models (LLMs), with a focus on Retrieval Augmented Generation (RAG) and its impact on improving efficiency and reducing operational costs.

Highlights include:
- How RAG enhances LLM accuracy by incorporating relevant external documents.
- The evolution of attention mechanisms, including mixed attention strategies.
- Practical applications of Mamba architectures and their trade-offs with traditional transformers.

Show More Show Less

39 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Kumo AI & Relational Deep Learning | Data Brew | Episode 34

Oct 14 2024

In this episode, Jure Leskovec, Co-founder of Kumo AI and Professor of Computer Science at Stanford University, discusses Relational Deep Learning (RDL) and its role in automating feature engineering.

Highlights include:
- How RDL enhances predictive modeling.
- Applications in fraud detection and recommendation systems.
- The use of graph neural networks to simplify complex data structures.

Show More Show Less

43 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
LLMs: Internals, Hallucinations, and Applications | Data Brew | Episode 33

Jul 21 2023

Our fifth season dives into large language models (LLMs), from understanding the internals to the risks of using them and everything in between. While we're at it, we'll be enjoying our morning brew.

In this session, we interviewed Chengyin Eng (Senior Data Scientist, Databricks), Sam Raymond (Senior Data Scientist, Databricks), and Joseph Bradley (Lead Production Specialist - ML, Databricks) on the best practices around LLM use cases, prompt engineering, and how to adapt MLOps for LLMs (i.e., LLMOps).

Show More Show Less

39 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
Demonstrate–Search–Predict Framework | Data Brew | Episode 32

Jun 29 2023

We will dive into LLMs for our fifth season, from understanding the internals to the risks of using them and everything in between. While we’re at it, we’ll be enjoying our morning brew.
In this session, we interviewed Omar Khattab - Computer Science Ph.D. Student at Stanford, creator of DSP (Demonstrate–Search–Predict Framework), to discuss DSP, common applications, and the future of NLP.

Show More Show Less

33 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from Wish List failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free

Audiobook Categories

More to Explore

GETTING STARTED

Episodes

Retrieval, rerankers, and RAG tips and tricks | Data Brew | Episode 39

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

The Power of Synthetic Data | Data Brew | Episode 38

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Secret to Production AI: Tools & Infrastructure | Data Brew | Episode 37

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Mixture of Memory Experts (MoME) | Data Brew | Episode 36

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Mixed Attention & LLM Context | Data Brew | Episode 35

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Kumo AI & Relational Deep Learning | Data Brew | Episode 34

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

LLMs: Internals, Hallucinations, and Applications | Data Brew | Episode 33

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed

Demonstrate–Search–Predict Framework | Data Brew | Episode 32

Failed to add items

Add to basket failed.

Add to Wish List failed.

Remove from Wish List failed.

Follow podcast failed

Unfollow podcast failed