PodcastIntel
Sign in Get Started Free
Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast

From RLHF to RLHB: The Case for Learning from Human Behavior - with Jeffrey Wang and Joe Reeve of Amplitude

Jun 8, 2023 · 49m
AI Summary
  • Jeffrey Wang and Joe Reeve propose RLHB (Reinforcement Learning from Human Behavior) as alternative to RLHF, using implicit behavioral signals instead of explicit feedback
  • Highlights difficulty of collecting high-quality explicit human feedback (15,000 items is resource-intensive) and low engagement with explicit feedback UI
  • Explores using behavioral data and implicit signals to scale reward modeling without requiring millions of explicit annotation items

More from Latent Space: The AI Engineer Podcast

View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Latent Space: The AI Engineer Podcast and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →