Loading…
11-12, August 2026
Seoul, South Korea
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit Korea 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Korea Standard Time (KST), UTC +9. To see the schedule in your preferred timezone, please select from the drop-down menu to the right.
Wednesday August 12, 2026 17:15 - 17:45 KST
Building AI agents is easy, but ensuring their reliability is hard. Because agents operate in open-ended, non-deterministic environments, traditional Test-Driven Development (TDD) falls short. The solution is Eval-Driven Development (EDD), a paradigm that embeds continuous evaluation throughout the agent lifecycle.

This talk explores how to operationalize EDD. First, we examine the critical role of observability. Evaluating just a final output is insufficient for multi-step workflows. We show how OpenTelemetry (OTel) compatible tracing exposes internal reasoning and tool usage for granular debugging.

Next, we tackle scaling evaluation via LLM-as-a-Judge. Since uncalibrated judges often miss domain-specific nuances, we demonstrate aligning custom judges with human experts. Using open-source MLflow, attendees will learn to capture traces, collect expert feedback, and use alignment optimizers to create expert-aligned evaluators.
Speakers
avatar for Vincent Caldeira

Vincent Caldeira

CTO APAC, Red Hat
Vincent Caldeira, Red Hat APAC CTO and Industry Visiting Scholar at Columbia University, drives tech strategy and emerging engineering. A Top 10 APAC CTO (2023) with 20+ years in finance IT, he is an authority on open source, cloud-native technologies and AI. Vincent holds leadership... Read More →
avatar for Sharon Dashet

Sharon Dashet

Senior Principal Software Architect at Red Hat, Red Hat
Sharon Dashet is a seasoned Data and AI Architect with over 25 years of experience designing large-scale analytics, ML, and GenAI platforms. At Red Hat, they drive initiatives around Agentic AI and focus on emerging partnerships within the Red Hat AI ecosystem. Previously, they led... Read More →
Wednesday August 12, 2026 17:15 - 17:45 KST
Rose

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link