Episodes (Page 10)
✨
Introduced CityAVOS, a benchmark dataset for UAV visual object search in urban areas.
✨
Introduced BAT, a benchmark for auto-bidding algorithms in online advertising.
✨
Highlighted Amazon's AI, ML, and robotics research.
Human Feedback Improvements
✨
Introduced T2I-R1 for text-to-image generation using RL and bi-level CoT.
✨
Proposes pretraining for estimating heterogeneous treatment effects (HTE).
✨
Introduces formula predicting LLM shifts to harmful content.
✨
Yo’Chameleon personalizes Large Multimodal Models (LMMs).
✨
AI integration in finance: benefits and risks for stability.
✨
Addresses reward hacking in LLM reasoning tasks with PRMs.
✨
Uses language models for automated patient record linkage.
✨
Surveys Parameter-Efficient Continual Fine-Tuning (PECFT).
✨
Building a code-editing agent requires LLM, loop, and tokens.
✨
Novel two-stage framework for lung tumor segmentation.
✨
Introduces Complex-Edit benchmark for image editing models.
✨
Pre-computes LLM inferences offline for faster query responses.
✨
Miras framework designs sequence models like associative memory.
✨
RUKA is an affordable, open-source humanoid robotic hand.
✨
GenEAva generates expressive cartoon avatars via diffusion.
✨
VCR-Bench evaluates video Chain-of-Thought reasoning for LVLMs.
✨
HalluMeasure automates hallucination detection in LLM outputs.