PodcastIntel
Sign in Get Started Free
Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast

Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)

Aug 22, 2024 · 1h 5m
AI Summary
  • Cosine Genie achieved #1 ranking on SWE-Bench Full, Lite, and Verified using GPT-4o fine-tuning at scale on billions of tokens of synthetic data, beating all other agents including Cognition's Devin
  • Fine-tuning GPT-4o proved worthwhile despite long context windows and prompt caching from competitors, demonstrating practical value beyond 'in context learning is all you need' assumptions
  • The breakthrough combined OpenAI's new fine-tuning capabilities with massive synthetic data generation, establishing new SOTA for coding agents

Guests on This Episode

AP
Alistair Pullen
1 podcast appearance

More from Latent Space: The AI Engineer Podcast

View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Latent Space: The AI Engineer Podcast and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →