PodcastIntel
Sign in Get Started Free
Neural intel Pod
Neural intel Pod

Reinforcement Learning Under Unmeasured Confounding

Jun 28, 2025 · 01:04:20
AI Summary
  • Offline RL framework for unmeasured confounding.
  • Estimates policy value with reward-inducing proxies.
  • Proposes minimax estimator and policy-gradient algorithm.

More from Neural intel Pod

View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Neural intel Pod and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →