✨
AI Summary
- Databricks' DBRX and Imbue's 70B model outperform GPT-4o zero-shot on reasoning/coding benchmarks while using 7x less data than Llama 3 70B
- Imbue releasing 11 cleaned NLP benchmarks, new code reasoning benchmark, 450k human ambiguity judgments dataset, and infrastructure scripts for bare-metal cluster training
- Focus on cost-aware hyperparameter optimization and practical tools for training large models efficiently on 10,000 H100 clusters