PodcastIntel
Sign in Get Started Free
Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast

RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious

Aug 30, 2023 · 1h 12m
AI Summary
  • Eugene Cheah discusses RWKV (Receptance Weighted Key Value) models as significant Transformer alternative, reviving RNNs for GPT-class LLMs with better scaling properties
  • RWKV models scale efficiently in both training and inference compared to Transformer-based open models while remaining competitive on reasoning benchmarks
  • Architectural innovation challenges Transformer dominance and addresses practical deployment concerns around sequence length and computational efficiency for open models

More from Latent Space: The AI Engineer Podcast

View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Latent Space: The AI Engineer Podcast and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →