Beyond Deployment: how to evaluate and manage the reliability of LLMs over time

May 19, 202601:30 pm - 02:00 pm
Maxi Stage 2

Description

The large-scale adoption of generative models requires new forms of governance and control. We will explore how organizations can continuously evaluate the quality of responses, the consistency of behaviors, and the robustness of LLM outputs over time, through practical approaches and real-world examples. We will also discuss several emerging methods for measuring and managing model reliability, highlighting their strategic implications for business.
42

SAS

SAS is among the world leaders in artificial intelligence and data. Thanks to SAS software and industry-specific solutions, every company can quickly transform data into reliable decisions. With...