Evaluating and improving Replit Agent at scale

May 19, 202610:30 am - 10:45 am
Tech Stage

Description

Building AI agents that work reliably at scale is one of the hardest challenges in applied AI. In this talk, Michele explores how Replit approaches evaluation and continuous improvement of Replit Agent: the AI-powered coding assistant used by millions of builders worldwide. From designing meaningful evals to closing the feedback loop between real-world usage and model improvements, Michele shares the frameworks, lessons learned, and open questions from operating an AI agent at production scale.

REPLIT

Replit is an agentic software creation platform that lets anyone build applications using natural language. With millions of users and over 500,000 business users, it democratizes software...