Loading…
11-12, August 2026
Seoul, South Korea
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit Korea 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Korea Standard Time (KST), UTC +9. To see the schedule in your preferred timezone, please select from the drop-down menu to the right.
Tuesday August 11, 2026 14:55 - 15:25 KST
Tech blogs usually talk about huge Ceph clusters with thousands of disks. But in reality, many of us run smaller on-prem setups. I will share my real experience of debugging a small Ceph cluster (10 nodes, 10 NVMe, 15 normal SSDs) and how tracking a slow app led me to write an upstream C++ patch.

It started when our Valkey (Redis) pods suffered from severe write latency. We checked CephFS metadata and Istio network metrics, but they were fine. So we dug into the storage layer using ceph osd perf. We saw huge latency on one specific node. Looking at the kernel logs (dmesg -k), we found a failing NVMe disk. I will explain the "Slow OSD" issue—how one broken disk can freeze a 3-replication cluster.

During this outage, reading ceph osd perf was very frustrating because the OSD IDs were completely unordered. Since it made debugging harder, I decided to fix it. I looked into Ceph's C++ code, changed the unordered hash map to a sorted vector (std::sort), and opened PR #67915 (https://github.com/ceph/ceph/pull/67915). I will share my experience discussing the fix with Ceph maintainers and why I believe engineers should fix the open-source tools they use.
Speakers
avatar for Sangyun Lee

Sangyun Lee

AI Platform Engineer @ CJ Olivenetworks AI Research Lab., CJ Olivenetworks
Sangyun Lee is a 1st-year AI Platform Engineer at CJ Olivenetworks AI Research Lab, building on-prem K8s AI platforms using Ceph, Istio, ArgoCD, and Airflow. As a Kubestronaut (aiming for Golden status by August), he explores the depths of vLLM inference and low-level infrastructure... Read More →
Tuesday August 11, 2026 14:55 - 15:25 KST
Orchid 2

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link