Loading…
11-12, August 2026
Seoul, South Korea
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit Korea 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Korea Standard Time (KST), UTC +9. To see the schedule in your preferred timezone, please select from the drop-down menu to the right.
arrow_back View All Dates
Tuesday, August 11
 

07:30 KST

Registration + Badge Pick-up
Tuesday August 11, 2026 07:30 - 18:45 KST

Tuesday August 11, 2026 07:30 - 18:45 KST
Grand Ballroom Foyer

07:30 KST

Sponsor Showcase
Tuesday August 11, 2026 07:30 - 18:45 KST

Tuesday August 11, 2026 07:30 - 18:45 KST
Sponsor Showcase - Grand Ballroom Foyer

09:00 KST

Keynote: Welcome + Opening Remarks - Jim Zemlin, CEO, The Linux Foundation
Tuesday August 11, 2026 09:00 - 09:25 KST

Speakers
avatar for Jim Zemlin

Jim Zemlin

CEO, The Linux Foundation
Jim Zemlin’s career spans three of the largest technology trends to rise over the last decade: mobile computing, cloud computing, and open source software. Today, as executive director of The Linux Foundation, he uses this experience to accelerate innovation in technology through... Read More →
Tuesday August 11, 2026 09:00 - 09:25 KST
Grand Ballroom 2-3

09:30 KST

Keynote Sessions To Be Announced
Tuesday August 11, 2026 09:30 - 10:15 KST

Tuesday August 11, 2026 09:30 - 10:15 KST
Grand Ballroom 2-3

10:15 KST

Keynote: Priya Nagpurkar, Vice President, Hybrid Cloud and AI Platforms, IBM Research
Tuesday August 11, 2026 10:15 - 10:30 KST

Speakers
avatar for Priya Nagpurkar

Priya Nagpurkar

Vice President, Hybrid Cloud and AI Platform, IBM Research

Tuesday August 11, 2026 10:15 - 10:30 KST
Grand Ballroom 2-3

10:30 KST

Morning Break
Tuesday August 11, 2026 10:30 - 11:00 KST

Tuesday August 11, 2026 10:30 - 11:00 KST
Grand Ballroom Foyer

11:00 KST

Building Event-Driven WebAssembly on Kubernetes: Runtime, Observability, and Security - Brandon Kang, Akamai Technologies & Nam Hai, Hylatek JSC
Tuesday August 11, 2026 11:00 - 11:30 KST
As cloud native systems evolve, WebAssembly(WASM) is emerging as a runtime that complements and sometimes challenges container based approaches. With fast startup, strong isolation, and portability, WASM enables efficient and secure event driven workloads.

In this session, we explore how to build and run the systems using WebAssembly on Kubernetes, focusing on SpinKube, an open source project that brings WASM workloads into Kubernetes.

Through a live demonstration, we will deploy and run WASM workloads using SpinKube, highlighting real workflows. We will also compare cold start and execution performance, showing millisecond startup and lower latency than traditional containers. Plus to demonstrate observability using eBPF based telemetry, we enable visibility into runtime behavior without overhead. Attendees will learn how to trace execution, monitor performance, and troubleshoot distributed WASM systems.

Finally, we will talk how WebAssembly improves security and operational efficiency by reducing the attack surface and enabling consistent execution across environments, providing a practical guide for production adoption.
Speakers
avatar for Kai Nguyen

Kai Nguyen

Engineer, Hylatek JSC
Nam Hai is a senior backend engineer with over 12 years of experience in cloud infrastructure, serverless architecture, and blockchain technology. He has led projects across domains such as IoT, e-commerce, and decentralized finance. Currently, he is focused on building AI-powered... Read More →
avatar for Brandon Kang

Brandon Kang

Principal Technical Solutions Architect, Akamai Technologies
Brandon Kang is a principal solutions architect at Akamai, driving cloud-native and AI initiatives.
With experience at Samsung, Microsoft, and Akamai, he brings deep expertise in large scale cloud native architecture and AI.
He is the author of 12 IT books on S/W engineering, Sec... Read More →
Tuesday August 11, 2026 11:00 - 11:30 KST
Orchid 2

11:00 KST

Upstream Kernel Hardening: Recent Progress and Challenges - Gustavo A. R. Silva, Linux Kernel Self-Protection Project
Tuesday August 11, 2026 11:00 - 11:30 KST
With the release of Linux 7.0, Rust is no longer considered experimental. New and safer components are expected to be written in Rust in the near future. However, the Linux kernel still contains more than 35 million lines of code written in C. This code will remain critical for years to come and, for the benefit of everyone, must continue to be maintained, improved, and hardened where possible.

In the Linux Kernel Self-Protection Project, we care deeply about this, and we've been advancing kernel hardening upstream for several years. In this presentation, we'll review recent hardening efforts and discuss the challenges we continue to face as we work toward our ultimate goal of eliminating classes of bugs and methods of exploitation in the upstream Linux kernel.
Speakers
avatar for Gustavo A. R. Silva

Gustavo A. R. Silva

Upstream Linux Kernel Engineer, Linux Kernel Self-Protection Project
Gustavo A. R. Silva works full-time as an Upstream Linux Kernel Engineer focused on hardening and proactive security. He's spent the past several years fixing all sorts of bugs & hardening the Linux kernel. His work is supported by The Linux Foundation and the Alpha-Omega project... Read More →
Tuesday August 11, 2026 11:00 - 11:30 KST
Orchid 1

11:00 KST

Squeezing Every Millisecond: A Practical Guide To Optimizing Time To First Token With OSS Muscle - Hrittik Roy, Platform Advocate
Tuesday August 11, 2026 11:00 - 11:30 KST
Large language models are getting faster GPUs every year, yet users still notice the pause before the first word appears. That pause has a name: Time To First Token (TTFT). And in production LLM systems, shaving even a few hundred milliseconds from it can dramatically change how responsive an application feels.

This talk tells the story of where those milliseconds go.

We will walk through the lifecycle of a request in modern LLM serving systems and explore the practical techniques engineers use to reduce TTFT in real deployments. Using examples from open source stacks like vLLM, TensorRT-LLM, and Hugging Face TGI, we will examine four powerful optimization levers: KV cache strategies, speculative decoding, model quantization, and batching policies.

Instead of focusing only on theory, the session highlights the tradeoffs practitioners face. When does speculative decoding actually help? When does batching hurt latency? When does quantization reduce memory pressure enough to speed up the first token?

Attendees will leave with a practical playbook for diagnosing TTFT bottlenecks and choosing the right optimization strategy for their model, infrastructure, and workload.
Speakers
avatar for Hrittik Roy

Hrittik Roy

vCluster, Platform Advocate
Hrittik is a Platform Advocate at Loft Labs and a CNCF Ambassador, with expertise in cloud native technologies and open source communities. He has contributed extensively to developer advocacy, technical writing, and community engagement. Hrittik has been a featured speaker at events... Read More →
Tuesday August 11, 2026 11:00 - 11:30 KST
Grand Ballroom 2-3

11:00 KST

SBOMs Aren't Enough. Secure Your Software Supply Chain End-To-End - Yongjae Chung, New York University Secure Systems Lab & Justin Cappos, New York University
Tuesday August 11, 2026 11:00 - 11:30 KST
You probably heard that SBOMs are helpful, but did you know that an SBOM only addresses a fraction of what can go wrong in your software supply chain? The SLSA (Supply Chain Levels for Software Artifacts) specification identifies 9 distinct threat areas, spanning from source code, all the way to package distribution. Most development teams address one or two of these and call it a day, leaving gaps that real-world attacks like SolarWinds and Log4J have already exploited. We understand that it is difficult to cover all aspects when it comes to the software supply chain.

How about we make this much easier? In this talk, we will present an overview of the modern software supply chain threat model, and show how you can provide integrity throughout the whole process of your software development life cycle. We will introduce an easy-to-setup, end-to-end open source stack, built from frameworks and tools within the CNCF/OpenSSF ecosystem.
Speakers
avatar for Justin Cappos

Justin Cappos

Professor, New York University
I am a professor at NYU who has been working on software supply chain security for more than 20 years. I am a maintainer / creator of the TUF, Uptane, and in-toto projects, which are all under the LF.
avatar for Yongjae Chung

Yongjae Chung

Master's Student, New York University Secure Systems Lab
Yongjae is a Master's student at New York University. He is a contributor to gittuf, an incubating project at Open Source Security Foundation.
Tuesday August 11, 2026 11:00 - 11:30 KST
Chrysanthemum

11:40 KST

Self-Healing Rollouts: Automating Production Fixes With Agentic AI - Kevin Dubois, IBM & Carlos Sanchez, Adobe
Tuesday August 11, 2026 11:40 - 12:10 KST
Your software rollouts to production are probably always flawless, right? For the rest of us, once in a while we do run into issues when releasing code to production. Even with robust CI/CD, production rollouts can hit unexpected snags. While in Kubernetes, ArgoCD and Argo Rollouts excels at Progressive Delivery and automated rollbacks to mitigate deployment issues, what if we could go a step further?

This session explores how to elevate your release process by integrating Agentic AI and asynchronous coding agents, with Argo Rollouts canary deployments. We'll demonstrate how an intelligent agent can automatically analyze a rollout failure, pinpointing the root cause. Beyond diagnosis, these agents can take proactive steps on your behalf, suggesting and even implementing code fixes as new pull requests, which can be redeployed automatically after PR review. This approach moves us closer to truly self-healing deployments.

Join us to learn how to combine the power of cloud native projects like Kubernetes, ArgoCD and Argo Rollouts with the autonomous capabilities of Agentic AI, achieving a release experience that is not only seamless but also resilient.
Speakers
avatar for Kevin Dubois

Kevin Dubois

Sr Principal Developer Advocate, IBM
Kevin Dubois is often featured as a (keynote) speaker at conferences around the world, where he shares his passion and knowledge about developer experience, open source, cloud native development and Java. He is also an author, java Champion, and an accomplished software architect... Read More →
avatar for Carlos Sanchez

Carlos Sanchez

Principal Scientist, Adobe
Carlos Sanchez is a Principal Scientist at Adobe Experience Manager, specializing in software automation, from build tools to Continuous Delivery and Progressive Delivery. Involved in Open Source for over 20 years, he is the author of the Jenkins Kubernetes plugin and a member of... Read More →
Tuesday August 11, 2026 11:40 - 12:10 KST
Orchid 2

11:40 KST

Getting Started With Kernel Programming: The Linux Kernel Mentorship Program - Jori Koolstra, Independent
Tuesday August 11, 2026 11:40 - 12:10 KST
The Linux Kernel is one of the most influential and widely-used open source software projects today. However, getting started -- understanding the code and contributing to it -- can be quite daunting. Kernel code often carries history and conventions that are not always documented, but are rather shared by the community as tribal knowledge. Moreover, understanding operating-system-level source code comes with its own unique challenges: there is no libc, you need to know about low-level primitives like memory barriers and various types of locking mechanisms, and memory handling errors can bring the whole system down (no garbage collection either, of course!).

The Linux Kernel Mentorship Program (LKMP) addresses some of these difficulties and aims to mentor prospective contributors to overcome the initial hurdle -- because everything becomes easier after your first contribution. The program is led by experienced kernel maintainers with the assistance of regular contributors and is open to anyone.

In this talk I will detail my experience as an LKMP mentee and explain how I have progressed since. I will also explain how you can get started with a small patch!
Speakers
avatar for Jori Koolstra

Jori Koolstra

Software Engineer

Tuesday August 11, 2026 11:40 - 12:10 KST
Orchid 1

11:40 KST

Stop Trusting a Black Box: The Economic Case for Open, Sovereign AI - Vincent Caldeira, Red Hat
Tuesday August 11, 2026 11:40 - 12:10 KST
As AI matures from a novelty into a strategic asset, 79% of organisations now prioritise sovereignty to mitigate vendor lock-in and secure critical data. Despite this, the market remains paralysed by a paradox: while open models have achieved performance parity with proprietary systems at a fraction of the cost, they remain massively under-utilised due to perceived friction and trust gaps.

This session dissects this market inefficiency, with insights from LF Research revealing how closed-source dominance is often driven by inertia rather than superior capability. We will explore how to operationalise true independence by rejecting "open-washing" in favour of rigorous frameworks that demand full model completeness: verifying everything from training data to weights. Join us to learn how to transition from renting black-box APIs to architecting a transparent, reproducible, and economically superior AI stack.
Speakers
avatar for Vincent Caldeira

Vincent Caldeira

CTO APAC, Red Hat
Vincent Caldeira, Red Hat APAC CTO and Industry Visiting Scholar at Columbia University, drives tech strategy and emerging engineering. A Top 10 APAC CTO (2023) with 20+ years in finance IT, he is an authority on open source, cloud-native technologies and AI. Vincent holds leadership... Read More →
Tuesday August 11, 2026 11:40 - 12:10 KST
Grand Ballroom 2-3

11:40 KST

One Binary, Every Package Manager: Shipping a Rust CLI To PyPI, Npm, Homebrew, Winget, and Beyond - Ajit Kumar, Independent
Tuesday August 11, 2026 11:40 - 12:10 KST
Most dev tools die in obscurity because installation friction kills adoption before the first command is run. This talk provides a battle-tested playbook for solving that problem using evnx—a Rust CLI for validating and secret-scanning `.env "files"—as a real-world case study.

Launched in early 2026, evnx achieved thousands of cross-ecosystem downloads within weeks by treating distribution as a first-class engineering concern. The session breaks down a complete distribution matrix:

Registry Packaging: Using crates.io as a source of truth, Maturin for native Python wheels, and npm wrappers for platform-specific binaries.
OS Package Managers: Automating Homebrew formulas via cargo-dist, plus Scoop and Winget submissions.
Developer Integration: GitHub Actions with SARIF output, pre-commit hooks, and lightweight Docker CI images.
Supply-Chain Security: Leveraging PyPI Trusted Publishing (OIDC), provenance attestations, and cosign for signed containers.

Attendees will receive reusable GitHub Actions workflows and manifest templates from the public evnx repo to ship any Rust CLI across ecosystems without sacrificing security or maintainer sanity.
Speakers
avatar for Ajit Kumar

Ajit Kumar

Researcher and Software Developer

Tuesday August 11, 2026 11:40 - 12:10 KST
Chrysanthemum

12:10 KST

Lunch
Tuesday August 11, 2026 12:10 - 13:35 KST

Tuesday August 11, 2026 12:10 - 13:35 KST
Grand Ballroom Foyer

13:35 KST

Beyond Round-Robin: GPU-Aware Load Balancing for LLM Inference in Kubernetes - Seokhwan Kong, NETLOX
Tuesday August 11, 2026 13:35 - 14:05 KST
Standard load balancers route LLM requests without awareness of KV-cache state or GPU queue depth — causing inflated Time-To-First-Token and wasted accelerator capacity.

loxilb closes this gap with an eBPF-native AI gateway. The L4 layer uses XDP/TC and kernel sockmap for zero-copy forwarding. The L7 layer adds API-key validation, token-quota enforcement, and accelerator-aware routing via Consistent Hashing with Bounded Loads (CHWBL).

This talk focuses on KV-exact routing for P/D disaggregated deployments. When requests share a prompt prefix, prefill GPUs recompute identical KV tensors repeatedly — wasting up to 80% of GPU cycles. Tier 1.5 eliminates this: loxilb tokenizes the prompt in-process (HuggingFace Rust tokenizer via CGO), computes block hashes matching vLLM's internal format, and routes to the exact GPU already holding those KV blocks — via a live inventory fed by vLLM's native ZMQ event stream.

Unlike serving-layer schedulers (llm-d, Dynamo), this runs in the eBPF data-plane hot path with no Kubernetes dependency — works on bare metal, VMs, and BlueField DPUs.

We'll trace a live request through the P/D testbed and share lessons from building GPU-state-aware routing
Speakers
avatar for Seokhwan Kong

Seokhwan Kong

CO-CEO & CTO, NETLOX
SeokHwan Kong is CTO and Co-Founder of NetLOX and creator of LoxiLB, an open-source eBPF-powered cloud-native load balancer. I holds a Ph.D. from Yonsei University with 15+ years in networking, SDN, Kubernetes, and Telco/5G. He has published at IEEE Future Networks World Forum (2024... Read More →
Tuesday August 11, 2026 13:35 - 14:05 KST
Orchid 2

13:35 KST

KNOD: In-Kernel Network Offload Device for GPU-Accelerated Packet Processing - Taehee Yoo, Rebellions & Hoyeon Lee, SUSE
Tuesday August 11, 2026 13:35 - 14:05 KST
GPU compute resources remain inaccessible to the Linux kernel without userspace runtimes (ROCm, CUDA), forcing network packet processing to stay CPU-bound regardless of available GPU capacity.
We present KNOD (in-Kernel Network Offload Device), a Linux kernel framework that enables GPU-accelerated network packet processing entirely within the kernel. KNOD compiles and launches GPU kernels natively, dispatching packet processing workloads directly to the GPU — no ROCm, no userspace involvement, using only publicly documented hardware interfaces.
XDP offload is KNOD's first use case. Building on our LPC 2025 talk, this talk covers: in-kernel GPU kernel compilation targeting AMD GFX9/Vega ISA with post-processing optimizations (latency: ~118µs → ~46µs); a lock-free GPU-side queue between NIC RX path and GPU compute; and branch divergence mitigation to maximize SIMT utilization across heterogeneous packet flows.
At equivalent throughput (~32 Mpps), KNOD reduces system-wide CPU utilization from 40% (CPU-based XDP) to 17-20%. We discuss lessons from MACsec/WireGuard offload attempts and the path toward KNOD as a general kernel framework for GPU-accelerated network functions.
Speakers
avatar for Taehee Yoo

Taehee Yoo

System Software Engineer, Rebellions
Taehee Yoo is a contributor to the Linux Networking Stack and a maintainer of the AMT module in the networking stack. And he is working as a Kernel engineer at Rebellions. He is currently focusing on KNOD project.
avatar for Hoyeon Lee

Hoyeon Lee

BPF System Engineer, SUSE
Hoyeon Lee is a BPF System Engineer at SUSE, focusing on BPF and system internals in the Linux kernel. He enjoys sharing knowledge through writing and speaking, and has been active in open source communities and technical translation.
Tuesday August 11, 2026 13:35 - 14:05 KST
Orchid 1

13:35 KST

Optimizing ML Inference Across Heterogeneous Accelerators - Sanjiban Sengupta, CERN, University of Manchester
Tuesday August 11, 2026 13:35 - 14:05 KST
Machine learning is central to high-energy physics,.These environments impose strict latency, throughput, and memory constraints, requiring data processing at unprecedented rates. Addressing these demands requires efficient, hardware-aware inference across heterogeneous architectures.

The ML4EP team at CERN is developing an integrated ecosystem for this purpose. We present recent advances in efficient ML deployment. First, aie4ml ports trained models to next-generation AMD FPGAs for ultra-low-latency inference. PQuantML complements this with hardware-aware pruning and quantization, reducing model size and compute cost while preserving performance.

For CPU and GPU inference, SOFIE translates trained models into optimized C++ for heterogeneous systems. Using alpaka, it enables backend-agnostic execution while minimizing data movement. It integrates with PQuantML to support quantized models and is deployable in both online systems and offline workflows.
Speakers
avatar for Sanjiban Sengupta

Sanjiban Sengupta

Doctoral Student at CERN, University of Manchester, CERN, University of Manchester
Sanjiban is a Doctoral Student at CERN, affiliated with the University of Manchester, researching ML inference optimization for the LHC. He contributed to SOFIE, focusing on Keras/PyTorch parsing, ONNX-based operators, and GNN support. He was a CERN Summer Student (2022) and a GSoC... Read More →
Tuesday August 11, 2026 13:35 - 14:05 KST
Grand Ballroom 2-3

13:35 KST

Panel: Realizing Sovereign AI: Strategies for Korea’s Tech Sovereignty and AI Independence Via Open Source - Yongkook Kim, IBM; Hong-Seok Kim, Rebellions; Rosa (Hyun Kyong) Lee, Korea Information Society Development Institute & Carlos Costa, IBM Research
Tuesday August 11, 2026 13:35 - 14:05 KST
When global AI development being centralized around proprietary "black-box" models, the demand for Sovereign AI has become a national priority for many countries, including South Korea. True sovereignty requires more than just local data or local LLMs; it demands independence across the entire stack—from silicon up to the software services, as well as AI model itself. This panel challenges the misconception that global technology leaders are incompatible with national goals, demonstrating instead how open-source collaboration is the only viable path to technical and data independence as foundation for Sovereign AI.
Speakers
avatar for Alex Kim

Alex Kim

CTO for Strategic Ecosystem Partnership at IBM, IBM
Yongkook (Alex) is an OSS advocate, and a tech leader with 25+ years in R&D and IT architecture. He started as a security chip engineer at IBM Poughkeepsie, then worked as an enterprise IT architect for financial clients like Morgan Stanley and DTCC. Alex co-founded the Linux Foundation's... Read More →
avatar for Hong-Seok Kim

Hong-Seok Kim

Chief Software Architect, Rebellions
Hong-Seok is the Chief Software Architect at Rebellions, an AI accelerator startup based in Korea. He is also one of the maintainers for PyTorch Korea, leading its Core Special Interest Group. Before joining Rebellions, he was at Google as an Engineering Director and worked on it... Read More →
avatar for Rosa (Hyun Kyong) Lee

Rosa (Hyun Kyong) Lee

AI Social Policy Group Leader, Research Fellow, Korea Information Society Development Institute,
Dr. Lee, is a AI Social Policy Group Leader at the Department of AI Policy Research, the Korea Information Society Development Institute (KISDI). Her research area covers policy for human-centered artificial intelligence (AI), AI ethics education, digital transformation and digital... Read More →
avatar for Carlos Costa

Carlos Costa

Distinguished Engineer, IBM Research
Dr. Costa is an IBM Distinguished Engineer leading efforts to build a next-generation cloud-native platform for AI. He has been involved in multiple projects in the areas of large-scale AI/ML, HPC and analytics, including the BlueGene/Q system, the Active Memory Cube (AMC) architecture... Read More →
Tuesday August 11, 2026 13:35 - 14:05 KST
Chrysanthemum

14:15 KST

How I Tricked ArgoCD Into Sharding on a Single Cluster - Faeka Ansari, Slice Financial Bank
Tuesday August 11, 2026 14:15 - 14:45 KST
One controller at 846m CPU. The other at 6m. Both running. Both supposedly doing the same job.

We had two ArgoCD application controllers. Scaling looked solved on paper. Except -- one was doing all the work and the other was just... sitting there.

The thing nobody tells you when you first set up ArgoCD HA is that sharding works at the cluster level, not the application level (sadly). So when you only have one cluster -- it doesn't matter how many controller replicas you spin up. The work doesn't split. It all piles onto one pod.

We were running 300+ apps across 11 EKS clusters in a banking environment. Dropping availability was not an option. So we went digging.

What we found was not in the official docs. It was a trick - something so counterintuitive that the first time I thought of it, I laughed. Then I tried it. Then it worked.

This talk is the story of that fix -- how we found it, why it works, and how we shipped it safely to production using Terraform without touching a single app config.
Speakers
avatar for Faeka Ansari

Faeka Ansari

Senior Software Engineer | CNCF Ambassador, Slice Financial Bank
Faeka is Senior Software Engineer at fintech startup, an International technical speaker and helping maintain open-source K8s-native projects. She is a Kubernetes Release team member and was an Linux Fn. mentee under Istio. She leads several community initiatives across CNCF, Google... Read More →
Tuesday August 11, 2026 14:15 - 14:45 KST
Orchid 2

14:15 KST

Advanced Performance Profiling Using Perf Tools - Namhyung Kim, Google
Tuesday August 11, 2026 14:15 - 14:45 KST
Explore advanced Linux perf techniques for sophisticated performance analysis. This session delves into specialized areas beyond typical CPU profiling:

* Data Type Profiling: Learn how to associate memory access hotspots with specific data structures and fields, gaining insights into data layout efficiency.
* Latency Profiling: Discover methods to identify and analyze serial execution segments within parallel programs that act as scalability bottlenecks.
* Kernel Lock Contention profiling: Understand how to use perf to detect and analyze kernel-level lock contention issues.
* And more..

This talk will cover the concepts and perf tool methodologies to diagnose these critical performance aspects in both user-space and kernel contexts, empowering you to pinpoint elusive performance problems.
Speakers
avatar for Namhyung Kim

Namhyung Kim

Software Engineer, Google
Namhyung Kim is a software engineer at Google. He is a co-maintainer of the Linux perf tools and the creator of uftrace project.
Tuesday August 11, 2026 14:15 - 14:45 KST
Orchid 1

14:15 KST

The Era of Vibe Coding: Why High-Skill Engineers Are More Critical Than Ever - Yongjin Lee, Songnae High-school
Tuesday August 11, 2026 14:15 - 14:45 KST
"Let’s be real: My AI has dementia."
"Vibe-coding in real: Me and My AI is stupid"
Welcome to the "Vibe Coding" era, where anyone can prompt their way to complex code. But as a 17-year-old developer who "bullied" LLMs to build a custom Linux Based OS (MaruxOS) from scratch, I’ve seen the ugly truth: AI is a brilliant assistant, but a chronic liar with a 5-minute memory.
In this session, I expose the messy reality of AI-native development. I’ll share how I battled "Contextual Dementia"—where AI forgets my ARM64 architecture mid-patch—and survived "Mindless Yes-Clicking Syndrome," where a single trusted prompt almost nuked my entire glibc.
Key Takeaways:
The Hallucination Hunter: Sniffing out AI’s "confident bullshit."
The Dementia Doctor: Managing AI’s memory loss to maintain architectural integrity.
The Final Decider: Moving beyond the "Yes-Clicking" trap to take true technical ownership.
AI might be writing the code, but only a real engineer can stop it from burning the house down.
Speakers
avatar for Yongjin Lee

Yongjin Lee

Student, Developer, Songnae High-school
A normal(maybe?) Student from Korea Developer of Korean Transportation app "LIINK" Developer of Open Source Project MaruxOS Loves developing things for societyFounder of MaruLab
Tuesday August 11, 2026 14:15 - 14:45 KST
Grand Ballroom 2-3

14:15 KST

From Contribution To Culture: 14 Years of Building an OSPO That Outgrew Itself - Darae Ahn, Samsung Electronics
Tuesday August 11, 2026 14:15 - 14:45 KST
Over the past decade, many organizations have established OSPOs to manage open source usage and compliance. However, building a sustainable open source culture requires more than policies and processes.

This session shares a 14-year journey of an OSPO that evolved from a contribution-focused group into a broader organization encompassing usage, compliance, and internal enablement.

It explores how open source practices were embedded into engineering culture through project incubation, developer engagement, and internal leadership programs. Over time, these efforts led to a shift where open source activities became self-sustaining, with teams proactively initiating projects and contributions.

The session also reflects on an unexpected outcome: talent mobility. As internal open source leaders grew, many moved on to new opportunities, revealing both retention challenges and the broader impact of cultivating open talent.

Key lessons include the balance between control and autonomy, the role of leadership in cultural change, and how open source can be viewed not only as a compliance requirement, but as a long-term investment in culture and organizational brand.
Speakers
avatar for Darae Ahn

Darae Ahn

Staff Engineer, Open Source Group, Samsung Electronics
I have over 12 years of experience in open source at Samsung Electronics, where I have built and scaled open source programs. My work spans contribution, policy, compliance, and tooling, strengthening organizational capabilities.

I also participate in governance discussions within the community, focusing on how to sustain open source values in evolving development environments... Read More →
Tuesday August 11, 2026 14:15 - 14:45 KST
Chrysanthemum

14:55 KST

The Butterfly Effect of a Broken Disk: Top-Down Ceph Troubleshooting To Upstream Contribution - Sangyun Lee, CJ Olivenetworks
Tuesday August 11, 2026 14:55 - 15:25 KST
Tech blogs usually talk about huge Ceph clusters with thousands of disks. But in reality, many of us run smaller on-prem setups. I will share my real experience of debugging a small Ceph cluster (10 nodes, 10 NVMe, 15 normal SSDs) and how tracking a slow app led me to write an upstream C++ patch.

It started when our Valkey (Redis) pods suffered from severe write latency. We checked CephFS metadata and Istio network metrics, but they were fine. So we dug into the storage layer using ceph osd perf. We saw huge latency on one specific node. Looking at the kernel logs (dmesg -k), we found a failing NVMe disk. I will explain the "Slow OSD" issue—how one broken disk can freeze a 3-replication cluster.

During this outage, reading ceph osd perf was very frustrating because the OSD IDs were completely unordered. Since it made debugging harder, I decided to fix it. I looked into Ceph's C++ code, changed the unordered hash map to a sorted vector (std::sort), and opened PR #67915 (https://github.com/ceph/ceph/pull/67915). I will share my experience discussing the fix with Ceph maintainers and why I believe engineers should fix the open-source tools they use.
Speakers
avatar for Sangyun Lee

Sangyun Lee

AI Platform Engineer @ CJ Olivenetworks AI Research Lab., CJ Olivenetworks
Sangyun Lee is a 1st-year AI Platform Engineer at CJ Olivenetworks AI Research Lab, building on-prem K8s AI platforms using Ceph, Istio, ArgoCD, and Airflow. As a Kubestronaut (aiming for Golden status by August), he explores the depths of vLLM inference and low-level infrastructure... Read More →
Tuesday August 11, 2026 14:55 - 15:25 KST
Orchid 2

14:55 KST

Bridge X86-64 Applications To ARM64 and RISC-V With Dynamic Binary Translation - Jim Huang & Chi-Kuan Chiu, National Cheng Kung University
Tuesday August 11, 2026 14:55 - 15:25 KST
Running unmodified x86-64 binaries on ARM64 and RISC-V Linux systems remains important, but existing solutions often involve high overhead or depend on platform-specific hardware. Box64 addresses this gap with a wrapping-first architecture that prioritizes native execution by routing calls into host libraries through ABI translation, while reserving JIT compilation for guest code that cannot be directly mapped.

This session presents the core design of Box64 in three parts. First, a native wrapping layer that spans hundreds of libraries, handling entry-point interception and callback bridging. Second, a multi-pass JIT compiler that applies optimizations such as flag liveness analysis and deferred flag computation. Third, system-level integration techniques that enable practical deployment, including container bypass, mixed-bitness coordination with Wine, and memory ordering support on weakly ordered architectures.

We will also share performance results on ARM64 and RISC-V platforms, showing how Box64 achieves near-native performance in favorable cases and significant speedups over traditional emulation approaches.
Speakers
avatar for Jim Huang

Jim Huang

Assistant Professor, National Cheng Kung University
Drawing from his contributions to the Android Open Source Project (AOSP), Jim specializes in real-time performance tuning and optimization of Linux-based automations. Additionally, he is a co-founder of the LXDE project, a lightweight desktop environment widely utilized in embedded... Read More →
avatar for Chi-Kuan Chiu

Chi-Kuan Chiu

Student, National Cheng Kung University
Chi-Kuan Chiu is a Computer Science student at National Cheng Kung University, Taiwan. He contributes to Box64, the open-source dynamic binary translator that runs x86-64 applications on ARM64 and RISC-V, and builds eBPF-based tooling to profile it across platforms. He will begin... Read More →
Tuesday August 11, 2026 14:55 - 15:25 KST
Orchid 1

14:55 KST

Open Source AI Agents on User-Owned Infra: K3s, MCP, and GPU Sharing in Practice - Miley Fu, OlaresOS
Tuesday August 11, 2026 14:55 - 15:25 KST
Most open-source AI demos stop at a chatbot, but real users want agents that can work with private files, call tools, and run close to their important data without sending everything to the public cloud.

This talk shows a practical reference architecture for private agent workflows on a single-node K3s-based personal cloud, including local model serving, private knowledge bases, app sandboxing, secure remote access, and multiple GPU allocation modes for competing AI workloads.

Using an open-source personal cloud OS as a case study, I will share what works, what breaks, and which design choices matter when you try to make local AI usable by developers, creators, and small teams rather than just homelab experts.
Speakers
avatar for Miley Fu

Miley Fu

DevRel, OlaresOS
Miley us the co-chair and keynote speaker for KubeCon+Open Source Summit and AI Dev China 2024. She works on WasmEdge runtime under Linux Foundation as the founding member for over 6 years. She talks at KubeCon, KCD Beijing+Shenzhen, CloudDay Italy, DevRelCon, Open Source Summit Japan... Read More →
Tuesday August 11, 2026 14:55 - 15:25 KST
Grand Ballroom 2-3

14:55 KST

How AI Is Changing Open Source Communities: Lessons From OpenEuler - Jianmin Wang, openEuler Community
Tuesday August 11, 2026 14:55 - 15:25 KST
Artificial Intelligence is reshaping how software is developed and maintained. From code generation to automated reviews, AI tools are increasingly influencing how open source communities collaborate. This also introduces new challenges, including how to handle AI-generated contributions, maintain trust and code quality, and define governance for AI-assisted workflows.

In this session, we share experiences from the openEuler community in integrating AI into development processes, including AI-assisted code review, package maintenance, and community guidelines for AI usage, as well as work on frameworks such as Intelligence BooM.

We will discuss how these changes affect contributor workflows, what challenges maintainers face in practice, and what approaches have worked so far. The goal is to provide practical reference points for other open source communities exploring similar directions.
Speakers
avatar for Jimmie Wang

Jimmie Wang

Senior Software Engineer, openEuler Community
Jimmie Wang has over a decade of experience in system software and open source, focusing on operating systems, privacy and data, AI Security. He is a core contributor to the openEuler community, serving on the Technical Committee and maintaining multiple SIGs. He is a frequent speaker... Read More →
Tuesday August 11, 2026 14:55 - 15:25 KST
Chrysanthemum

15:25 KST

Afternoon Break
Tuesday August 11, 2026 15:25 - 15:55 KST

Tuesday August 11, 2026 15:25 - 15:55 KST
Grand Ballroom Foyer

15:55 KST

From Region To Multi-AZ: Building Resilient Cloud Infrastructure With OpenStack, Kubernetes, and OVN - 승진 한, kt cloud
Tuesday August 11, 2026 15:55 - 16:25 KST
As cloud service providers evolve their infrastructure, Multi-AZ architecture becomes essential for service continuity, failure isolation, and operational resilience. However, building a Multi-AZ cloud with open source technologies is not simply about spreading components across data centers. It requires design decisions across compute, networking, storage, observability, automation, and failure validation.

This session shares a CSP’s journey from a region-centric architecture toward a Multi-AZ cloud model using open source technologies such as OpenStack, Kubernetes, OVN, Kube-OVN, OpenStack-Helm, Cluster API, Ceph, and cloud native observability tools.

The talk will cover regional versus AZ-local service design, OpenStack control plane deployment on Kubernetes, zone-aware networking, storage replication, image availability, observability, and DR validation. It will also discuss trade-offs in traffic locality, failover behavior, data consistency, and complexity.

Rather than presenting a vendor-specific platform, this session focuses on reusable architecture patterns and lessons for cloud operators who want to build resilient open cloud infrastructure with open source software.
Speakers
avatar for John Haan

John Haan

cloud engineer, kt cloud
John Haan is a cloud platform engineering leader at kt cloud, focusing on open source-based cloud infrastructure, OpenStack, Kubernetes, and cloud native operations. His work includes OpenStack-on-Kubernetes architecture, Multi-AZ cloud design, automation, observability, and resilient... Read More →
Tuesday August 11, 2026 15:55 - 16:25 KST
Orchid 2

15:55 KST

Maximizing Heterogeneous Memory Bandwidth in Multi-Socket Systems - Rakie Kim, SK Hynix; Yunjeong Mun & Honggyu Kim, SK hynix
Tuesday August 11, 2026 15:55 - 16:25 KST
The advent of heterogeneous memory like CXL allows systems to secure additional bandwidth, but effectively utilizing it remains challenging. While widely adopted memory tiering optimizes latency by migrating hot pages to fast memory, bandwidth-intensive workloads require utilizing multiple memory tiers simultaneously.

To address this, Weighted Interleaving was introduced to distribute pages proportionally based on each node's bandwidth. However, it has a critical limitation: it ignores the physical topology of multi-socket systems, often leading to inefficient cross-socket memory accesses.

To resolve this, we introduce Socket-aware Weighted Interleave, an advanced memory policy currently being upstreamed to the mainline Linux kernel. It recognizes multi-socket boundaries, preventing performance degradation from unintended remote accesses and maximizing total throughput via localized allocation.

This advanced policy is a newly integrated feature of HMSDK, an open-source toolkit introduced last year. This session will focus primarily on this new capability, exploring how HMSDK efficiently manages large-scale CXL and heterogeneous memory systems.
Speakers
avatar for Honggyu Kim

Honggyu Kim

Principal Software Engineer, SK hynix
Honggyu Kim is a principal software engineer and team lead at SK hynix. His interests are memory profiling and management in the Linux kernel especially focusing on CXL memory expansion solutions. He has also worked on tracing, binary analysis tools as well as performance and memory... Read More →
avatar for Yunjeong Mun

Yunjeong Mun

engineer, SK hynix
Yunjeong Mun is a senior engineer at SK hynix. Her research interest is emerging memory system software and performance analysis.
avatar for Rakie Kim

Rakie Kim

Principal Software Engineer, SK Hynix
Rakie Kim is a Principal Software Engineer at SK hynix, specializing in Linux kernel memory management. He focuses on building solutions to maximize system performance using heterogeneous memory architectures like CXL. Notably, he serves as a reviewer for the Linux kernel's Memory... Read More →
Tuesday August 11, 2026 15:55 - 16:25 KST
Orchid 1

15:55 KST

MCP Adoption and Why OSPO Skills Matter - Ana Jiménez Santamaría, Linux Foundation (TODO Group and PyTorch Foundation)
Tuesday August 11, 2026 15:55 - 16:25 KST
As organizations explore the Model Context Protocol (MCP), many of the early conversations naturally happen around AI tooling, integrations, and experimentation. At the same time, MCP may also raise questions that are familiar to Open Source Program Offices (OSPOs) and related teams or professionals with open source management experience, including governance, contribution strategy, standards engagement, and cross-functional coordination

This talk reflects on what organizations might gain by bringing those perspectives into MCP discussions earlier, as well as into their broader Agentic AI strategy. Rather than treating MCP only as a technical AI topic, the session will explore it as an area where open source process knowledge may also add value. Attendees will leave with ideas for how OSPOs can contribute to MCP adoption in practical, collaborative, and organization-aware ways
Speakers
avatar for Ana Jiménez Santamaría

Ana Jiménez Santamaría

Project Manager, Linux Foundation (TODO Group and PyTorch Foundation)
Ana is a Sr. Project Manager at the Linux Foundation, where she supports global open source communities and drives strategic initiatives across the TODO Group and the PyTorch Foundation. She collaborates with CTOs, engineering teams, and business units to promote Open Source management... Read More →
Tuesday August 11, 2026 15:55 - 16:25 KST
Grand Ballroom 2-3

15:55 KST

Skills-as-Packages: A Package Manager for AI Agent Skills - Brahada Srinivas, Amazon
Tuesday August 11, 2026 15:55 - 16:25 KST
AI agents like Claude Code, Cursor, and Codex learn libraries via SKILL.md files, but these skills are currently unversioned, ungoverned, and unshared. We solved code dependency management with pip and npm — now it's time to solve it for AI knowledge.
This talk presents an open-source, package-manager-style system for agent skills. Skills are linked to their packages, versioned with semver, declared in skills.toml, and locked via skills-lock.toml — just like regular dependencies.
The CLI (skills add, install, lock, publish) feels native to any developer using pip or uv.
We'll cover:

The SKILL.md open standard (YAML frontmatter + Markdown) — model-agnostic and runtime-agnostic
Manifest format supporting version constraints, inheritance, and monorepo scoping
Resolver that enforces constraint narrowing across org hierarchies
Registry with publishing, discovery, approval workflows, and security scanning
Real cases where this prevented production incidents by keeping agents on correct, up-to-date patterns

Live demo: Add a skill, resolve dependencies, publish it, and watch a new engineer's agent instantly get the right knowledge - no onboarding docs required.
Speakers
avatar for Brahada Srinivas

Brahada Srinivas

Ms, Amazon
Brahada Srinivas is a senior engineer working at Amazon focused on developer productivity and AI-assisted workflows. She designs systems at the intersection of package management and AI agent governance - making sure agents don't just write code, but write the right code. He is the... Read More →
Tuesday August 11, 2026 15:55 - 16:25 KST
Chrysanthemum

16:35 KST

What Happens When Your AI Agent Meets OPA - Jyoti Bisht, Harness
Tuesday August 11, 2026 16:35 - 17:05 KST
Tom and Jerry has run for over 80 years. Every episode follows the same plot: Tom builds an elaborate trap, Jerry walks straight through it, the house is destroyed, and the owner blames Tom. Sound familiar?

This talk is structured exactly like a Tom and Jerry cartoon except Tom is OPA and Jerry is your AI agent. Jerry is not malicious. He just wants the cheese. He will find every gap in every policy, squeeze through every webhook, and retreat to his mouse hole (MCP) the moment Tom gets close. The audience will root for Jerry. Jerry is still the problem.

We walk through three episodes. Episode one: Jerry discovers he can call kubectl delete and Tom's first policy stops him — but not before he's renamed two deployments. Episode two: Jerry finds a namespace Tom forgot to cover and provisions a GPU node at $8/hour. Episode three: Jerry and Tom finally cooperate — the agent runs a legitimate right-sizing workflow, OPA approves every step, and the cluster is actually better for it.


You'll leave with an OPA policy library for agentic tool-call governance, Argo Workflows, and a threat model built from real agent misbehaviour.
Speakers
avatar for Jyoti Bisht

Jyoti Bisht

Senior Developer Relations Engineer, Harness
Jyoti Bisht is a Senior Developer Relations Engineer with 4+ Years of experience working at the intersection of cloud infrastructure, open source and community building. A CNCF community member, GSoC contributor, and MLH pod leader, she has spoken at DevRelCon, etc. When she is not... Read More →
Tuesday August 11, 2026 16:35 - 17:05 KST
Orchid 2

16:35 KST

Flexible Paging in Linux: Per-process Page Size - Dev Jain, Arm Ltd
Tuesday August 11, 2026 16:35 - 17:05 KST
Applications often run faster on a 64K page size kernel than on 4K. This is because larger pages reduce TLB pressure and pagetable walk overhead, greatly improving memory access speed. But, sysadmins are hesitant to use a 16K/64K kernel as most applications have been historically tuned for 4K pages, and larger pages can increase memory waste due to internal fragmentation. This creates a tradeoff between performance and memory waste.

We propose a design that changes the game by separating the page size of the user and the kernel. Instead of enforcing a single system-wide page size, processes can operate under different page-size ABIs. A performance-critical app can use a 64K page-size ABI while still running on a 4K kernel. We achieve this by enabling Linux to provide memory in 64K chunks via 16 contiguous 4K pages, and by translating operations on the 4K page table into a 64K “native” page table that serves as the actual hardware page table.

At the same time, lightweight or memory-sensitive applications can continue using 4K pages to minimize memory waste. All of this happens on the same Linux kernel, without requiring separate builds of the kernel.
Speakers
avatar for Dev Jain

Dev Jain

Software Engineer, Arm Ltd
Dev is a software engineer at Arm India. He does Linux kernel development as part of the kernel Memory Management Performance team at Arm. Besides the kernel, he is interested in algorithms and mathematics.
Tuesday August 11, 2026 16:35 - 17:05 KST
Orchid 1

16:35 KST

Benchmarking Beyond OpenSearch: Multi-Engine Vector Search Performance With OSB - Mike Oviedo, AWS
Tuesday August 11, 2026 16:35 - 17:05 KST
As vector search becomes foundational to AI workloads, teams need reliable ways to evaluate engine performance before committing to one. We extended OpenSearch Benchmark (OSB) to support Milvus and Vespa alongside OpenSearch, making it possible to compare throughput, latency, and recall across engines using the same datasets and query patterns.

In this talk, we'll walk through how OSB's new engine-as-module architecture lets you benchmark any search engine with a single CLI command. We'll show how customizable workload parameters control everything from HNSW graph construction to bulk ingestion strategies, and demonstrate how to visualize comparative results in OpenSearch Dashboards, turning raw benchmark data into actionable performance insights.

Whether you're evaluating engines for a new project or optimizing an existing deployment, you'll leave with practical knowledge of how to run your own benchmarks and how to extend OSB to support additional engines by implementing five functions.
Speakers
avatar for Mike Oviedo

Mike Oviedo

Software Engineer, AWS
Michael Oviedo is a software engineer at AWS, working on performance and benchmarking tools for OpenSearch. He is a maintainer of the OpenSearch Benchmark project. Outside of work he enjoys playing golf and visiting national parks.
Tuesday August 11, 2026 16:35 - 17:05 KST
Grand Ballroom 2-3

16:35 KST

Computer Programming Is Dead; Long Live AI-First Programming - Stephen Chin, Neo4j & Cassandra Chin, Independent
Tuesday August 11, 2026 16:35 - 17:05 KST
Computer science graduates are facing an increasingly difficult job market. Recent data shows a sharp decline in employment outcomes for computer science majors, highlighting the mismatch between what universities teach and what employers now demand. The traditional model of teaching syntax first and hoping students eventually build something useful is no longer working. In this keynote we argue that programming as we knew it is effectively dead. The future lies in AI-First programming, built on the simple loop of try, learn, and grow. Learners try building code with AI assistance, learn by unpacking the generated code and asking AI for detailed explanations, and grow by testing and extending real applications. This loop not only builds confidence but also ensures we grow the generation of AI engineers that companies are desperate to hire.
Speakers
avatar for Stephen Chin

Stephen Chin

VP of Developer Relations, Neo4j
Stephen Chin is VP of Developer Relations at Neo4j and author of numerous titles including the upcoming GraphRAG: The Definitive Guide for O'Reilly. He has given keynotes and main stage talks at numerous conferences around the world including AI Engineer Summit, AI DevSummit, Devoxx... Read More →
avatar for Cassandra Chin

Cassandra Chin

Java Champion, Book Author, Keynote Speaker, Kids Workshop Instructor, Independent
Cassandra Chin is a keynote speaker, book author, podcast host, children's workshop instructor, and a computer science student. She has been teaching technology kids workshops at international conferences since she was 13 years old and is passionate about helping allow women, minorities... Read More →
Tuesday August 11, 2026 16:35 - 17:05 KST
Chrysanthemum

17:15 KST

From Frankenstein To Kamaji: Lessons in Building a Single CAPI Cluster Across Multiple Providers - Antonia von den Driesch, Giant Swarm
Tuesday August 11, 2026 17:15 - 17:45 KST
At Giant Swarm we use Cluster API to provision and bootstrap our k8s clusters. With this setup, control plane (CP) and worker nodes must run on the same infrastructure which was never an issue so far...

However, in bare-metal environments, using 128-core servers for CP nodes is luxury. It's far more efficient to host them as virtual machines on a hypervisor while keeping workers on physical hardware. But can we get around CAPI's limitations?

We will walk through how we built Frankenstein's cluster by mixing vSphere for the CP and Proxmox for workers as a testing ground. While technically functional, this required "hacky engineering". We will share the hurdles we hit and the operational risks of this hybrid cluster setup.

Finally, we will demonstrate how we solved this challenge with a cleaner, upstream-friendly alternative. Kamaji lets us run the CP as pods in a management cluster. We achieved even better resource optimisation with full native community support and no custom hacks.
Speakers
avatar for Antonia von den Driesch

Antonia von den Driesch

Platform Engineer, Giant Swarm
Antonia has been a platform engineer at Giant Swarm for 5 years and is currently working on development of Giant Swarms Industrial IoT platform which brings their managed Kubernetes product to Smart Factory customers.
Tuesday August 11, 2026 17:15 - 17:45 KST
Orchid 2

17:15 KST

Implementing a Stateless V4L2 Decoder Driver: Architecture and Lessons Learned - SungHo Lee, Chips&Media
Tuesday August 11, 2026 17:15 - 17:45 KST
Stateless video decoding has become the standard approach in V4L2 using the Request API, but implementing a stateless decoder driver remains complex. It requires careful handling of codec parameters, buffer management, and coordination between userspace and the kernel.

This session presents a practical overview of implementing a stateless V4L2 decoder driver based on real development experience. It covers how SPS/PPS and per-frame parameters are passed from userspace, how reference frames are managed, and how hardware constraints are handled within a generic V4L2 framework.

We also discuss common pitfalls such as synchronization issues and buffer lifecycle management, and highlight key differences from traditional stateful decoders.

Attendees will gain a clear understanding of stateless decoder architecture and practical insights for developing robust V4L2 drivers.
Speakers
avatar for Sungho Lee

Sungho Lee

Senior Principal Researcher, Chips&Media
I am a software engineer specializing in Linux media and video codec driver development. I have hands-on experience implementing V4L2 drivers for hardware encoders and decoders, including stateless decoding and advanced codec controls such as ROI and QP mapping. My work focuses on... Read More →
Tuesday August 11, 2026 17:15 - 17:45 KST
Orchid 1

17:15 KST

GitAIOps: A 4-Layer Architecture for Predictable AI-Assisted Operations - Hoon Jo, Megazone
Tuesday August 11, 2026 17:15 - 17:45 KST
AI agents have no memory between sessions. Every conversation starts from zero. Git becomes the only persistent memory an AI agent can rely on. GitAIOps is the pattern built on this principle: Git is the memory, and a 4-layer architecture defines what goes into that memory.

I applied this to a production migration: 15 Helm releases, Kafka ZooKeeper-to-KRaft, Redis-to-Valkey, full observability stack rebuild. The question: what does Git need to contain so any AI session picks up where the last one left off?

The answer is a 4-layer Git structure, each layer born from a production failure.
Layer 1: Human plans in Git (36 files, 23,854 lines). Too verbose for AI.
Layer 2: Distilled AI context in Git (6 files, 1,254 lines). 19:1 compression as a project state dashboard.
Layer 3: Command Guardrails in Git (117 files). Enforced ordering, no AI-generated commands.
Layer 4: Locked values in Git (30 files). Zero interpretation, reviewed like code.

Every AI action reads from Git, executes, and commits back. The loop is closed.

DEV: 2 weeks → 2 days. PROD: 1 week → 1 day. The session covers the architecture, each layer's failure, and real production artifacts.
Speakers
avatar for Hoon Jo

Hoon Jo

AI & Cloud-Native Engineer, Megazone
Hoon Jo is a CNCF Ambassador and Kubestronaut who has spoken at KubeCon North America, Europe, China, and India across multiple years. He is the author of multiple books on Kubernetes and AI-assisted operations. His current work focuses on building operational patterns where AI agents... Read More →
Tuesday August 11, 2026 17:15 - 17:45 KST
Chrysanthemum

17:15 KST

Personalisation and Specialisation of Search With OpenSearch Agentic Search - Cedric Pelvet & Aswath Srinivasan, AWS
Tuesday August 11, 2026 17:15 - 17:45 KST
Agentic search lets you ask in Natural Language and have OpenSearch plan and execute retrieval. The Query Planner re-writes Natural Language to OpenSearch DSL using SOTA LLMs. This works amazingly well but not without important limitations:

1/ The biggest factor is added latency for remote inference. eCommerce Search demands sub-100ms response times. Even the best results when slow lead to search abandonment.
2/ We'll demo overcoming latency by hosting SLMs locally within OpenSearch nodes using vLLM/Ollama. This is faster & cheaper, but SLMs suffer quality decline in query re-writes. We'll explore fine-tuning with domain-specific data—proving fine-tuned SLMs beat SOTA LLMs.
3/ Relevance remains generic even with Agentic Search. We'll show how to improve it using user and business contexts with hybrid search and reranking.

This hands-on talk covers:
1/ Search Latency with Remote Inference
2/ Locally hosting SLMs using vLLM/Ollama
3/ Agentic search with SLM Local Inference
4/ Improving relevance with user and business contexts
5/ Fine-tuning SLMs for domain-specific query re-writing via Axolotl/LLaMA-Factory
Speakers
avatar for Cedric Pelvet

Cedric Pelvet

Principal OpenSearch Specialist SA, Amazon Web Services
Cédric Pelvet is a Principal Specialist Solutions Architect at AWS, focusing on AI and near-realtime distributed systems for data like OpenSearch, Kafka and Flink.
avatar for Aswath Srinivasan

Aswath Srinivasan

Senior Search Engine Architect,, OpenSearch @ AWS
Aswath Srinivasan is a Senior Search Engine Architect at Amazon Web Services currently based in Munich, Germany. With over 18 years of experience in various search technologies, Aswath currently focuses on OpenSearch. He is a search and open-source enthusiast and helps customers and... Read More →
Tuesday August 11, 2026 17:15 - 17:45 KST
Grand Ballroom 2-3

17:45 KST

 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -