Loading…
11-12, August 2026
Seoul, South Korea
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit Korea 2026 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Korea Standard Time (KST), UTC +9. To see the schedule in your preferred timezone, please select from the drop-down menu to the right.
Venue: Grand Ballroom 2-3 clear filter
arrow_back View All Dates
Tuesday, August 11
 

09:00 KST

Keynote: Welcome + Opening Remarks - Jim Zemlin, CEO, The Linux Foundation
Tuesday August 11, 2026 09:00 - 09:25 KST

Speakers
avatar for Jim Zemlin

Jim Zemlin

CEO, The Linux Foundation
Jim Zemlin’s career spans three of the largest technology trends to rise over the last decade: mobile computing, cloud computing, and open source software. Today, as executive director of The Linux Foundation, he uses this experience to accelerate innovation in technology through... Read More →
Tuesday August 11, 2026 09:00 - 09:25 KST
Grand Ballroom 2-3

09:30 KST

Keynote Sessions To Be Announced
Tuesday August 11, 2026 09:30 - 10:15 KST

Tuesday August 11, 2026 09:30 - 10:15 KST
Grand Ballroom 2-3

10:15 KST

Keynote: Priya Nagpurkar, Vice President, Hybrid Cloud and AI Platforms, IBM Research
Tuesday August 11, 2026 10:15 - 10:30 KST

Speakers
avatar for Priya Nagpurkar

Priya Nagpurkar

Vice President, Hybrid Cloud and AI Platform, IBM Research

Tuesday August 11, 2026 10:15 - 10:30 KST
Grand Ballroom 2-3

11:00 KST

Squeezing Every Millisecond: A Practical Guide To Optimizing Time To First Token With OSS Muscle - Hrittik Roy, Platform Advocate
Tuesday August 11, 2026 11:00 - 11:30 KST
Large language models are getting faster GPUs every year, yet users still notice the pause before the first word appears. That pause has a name: Time To First Token (TTFT). And in production LLM systems, shaving even a few hundred milliseconds from it can dramatically change how responsive an application feels.

This talk tells the story of where those milliseconds go.

We will walk through the lifecycle of a request in modern LLM serving systems and explore the practical techniques engineers use to reduce TTFT in real deployments. Using examples from open source stacks like vLLM, TensorRT-LLM, and Hugging Face TGI, we will examine four powerful optimization levers: KV cache strategies, speculative decoding, model quantization, and batching policies.

Instead of focusing only on theory, the session highlights the tradeoffs practitioners face. When does speculative decoding actually help? When does batching hurt latency? When does quantization reduce memory pressure enough to speed up the first token?

Attendees will leave with a practical playbook for diagnosing TTFT bottlenecks and choosing the right optimization strategy for their model, infrastructure, and workload.
Speakers
avatar for Hrittik Roy

Hrittik Roy

vCluster, Platform Advocate
Hrittik is a Platform Advocate at Loft Labs and a CNCF Ambassador, with expertise in cloud native technologies and open source communities. He has contributed extensively to developer advocacy, technical writing, and community engagement. Hrittik has been a featured speaker at events... Read More →
Tuesday August 11, 2026 11:00 - 11:30 KST
Grand Ballroom 2-3

11:40 KST

Stop Trusting a Black Box: The Economic Case for Open, Sovereign AI - Vincent Caldeira, Red Hat
Tuesday August 11, 2026 11:40 - 12:10 KST
As AI matures from a novelty into a strategic asset, 79% of organisations now prioritise sovereignty to mitigate vendor lock-in and secure critical data. Despite this, the market remains paralysed by a paradox: while open models have achieved performance parity with proprietary systems at a fraction of the cost, they remain massively under-utilised due to perceived friction and trust gaps.

This session dissects this market inefficiency, with insights from LF Research revealing how closed-source dominance is often driven by inertia rather than superior capability. We will explore how to operationalise true independence by rejecting "open-washing" in favour of rigorous frameworks that demand full model completeness: verifying everything from training data to weights. Join us to learn how to transition from renting black-box APIs to architecting a transparent, reproducible, and economically superior AI stack.
Speakers
avatar for Vincent Caldeira

Vincent Caldeira

CTO APAC, Red Hat
Vincent Caldeira, Red Hat APAC CTO and Industry Visiting Scholar at Columbia University, drives tech strategy and emerging engineering. A Top 10 APAC CTO (2023) with 20+ years in finance IT, he is an authority on open source, cloud-native technologies and AI. Vincent holds leadership... Read More →
Tuesday August 11, 2026 11:40 - 12:10 KST
Grand Ballroom 2-3

13:35 KST

Optimizing ML Inference Across Heterogeneous Accelerators - Sanjiban Sengupta, CERN, University of Manchester
Tuesday August 11, 2026 13:35 - 14:05 KST
Machine learning is central to high-energy physics,.These environments impose strict latency, throughput, and memory constraints, requiring data processing at unprecedented rates. Addressing these demands requires efficient, hardware-aware inference across heterogeneous architectures.

The ML4EP team at CERN is developing an integrated ecosystem for this purpose. We present recent advances in efficient ML deployment. First, aie4ml ports trained models to next-generation AMD FPGAs for ultra-low-latency inference. PQuantML complements this with hardware-aware pruning and quantization, reducing model size and compute cost while preserving performance.

For CPU and GPU inference, SOFIE translates trained models into optimized C++ for heterogeneous systems. Using alpaka, it enables backend-agnostic execution while minimizing data movement. It integrates with PQuantML to support quantized models and is deployable in both online systems and offline workflows.
Speakers
avatar for Sanjiban Sengupta

Sanjiban Sengupta

Doctoral Student at CERN, University of Manchester, CERN, University of Manchester
Sanjiban is a Doctoral Student at CERN, affiliated with the University of Manchester, researching ML inference optimization for the LHC. He contributed to SOFIE, focusing on Keras/PyTorch parsing, ONNX-based operators, and GNN support. He was a CERN Summer Student (2022) and a GSoC... Read More →
Tuesday August 11, 2026 13:35 - 14:05 KST
Grand Ballroom 2-3

14:15 KST

The Era of Vibe Coding: Why High-Skill Engineers Are More Critical Than Ever - Yongjin Lee, Songnae High-school
Tuesday August 11, 2026 14:15 - 14:45 KST
"Let’s be real: My AI has dementia."
"Vibe-coding in real: Me and My AI is stupid"
Welcome to the "Vibe Coding" era, where anyone can prompt their way to complex code. But as a 17-year-old developer who "bullied" LLMs to build a custom Linux Based OS (MaruxOS) from scratch, I’ve seen the ugly truth: AI is a brilliant assistant, but a chronic liar with a 5-minute memory.
In this session, I expose the messy reality of AI-native development. I’ll share how I battled "Contextual Dementia"—where AI forgets my ARM64 architecture mid-patch—and survived "Mindless Yes-Clicking Syndrome," where a single trusted prompt almost nuked my entire glibc.
Key Takeaways:
The Hallucination Hunter: Sniffing out AI’s "confident bullshit."
The Dementia Doctor: Managing AI’s memory loss to maintain architectural integrity.
The Final Decider: Moving beyond the "Yes-Clicking" trap to take true technical ownership.
AI might be writing the code, but only a real engineer can stop it from burning the house down.
Speakers
avatar for Yongjin Lee

Yongjin Lee

Student, Developer, Songnae High-school
A normal(maybe?) Student from Korea Developer of Korean Transportation app "LIINK" Developer of Open Source Project MaruxOS Loves developing things for societyFounder of MaruLab
Tuesday August 11, 2026 14:15 - 14:45 KST
Grand Ballroom 2-3

14:55 KST

Open Source AI Agents on User-Owned Infra: K3s, MCP, and GPU Sharing in Practice - Miley Fu, OlaresOS
Tuesday August 11, 2026 14:55 - 15:25 KST
Most open-source AI demos stop at a chatbot, but real users want agents that can work with private files, call tools, and run close to their important data without sending everything to the public cloud.

This talk shows a practical reference architecture for private agent workflows on a single-node K3s-based personal cloud, including local model serving, private knowledge bases, app sandboxing, secure remote access, and multiple GPU allocation modes for competing AI workloads.

Using an open-source personal cloud OS as a case study, I will share what works, what breaks, and which design choices matter when you try to make local AI usable by developers, creators, and small teams rather than just homelab experts.
Speakers
avatar for Miley Fu

Miley Fu

DevRel, OlaresOS
Miley us the co-chair and keynote speaker for KubeCon+Open Source Summit and AI Dev China 2024. She works on WasmEdge runtime under Linux Foundation as the founding member for over 6 years. She talks at KubeCon, KCD Beijing+Shenzhen, CloudDay Italy, DevRelCon, Open Source Summit Japan... Read More →
Tuesday August 11, 2026 14:55 - 15:25 KST
Grand Ballroom 2-3

15:55 KST

MCP Adoption and Why OSPO Skills Matter - Ana Jiménez Santamaría, Linux Foundation (TODO Group and PyTorch Foundation)
Tuesday August 11, 2026 15:55 - 16:25 KST
As organizations explore the Model Context Protocol (MCP), many of the early conversations naturally happen around AI tooling, integrations, and experimentation. At the same time, MCP may also raise questions that are familiar to Open Source Program Offices (OSPOs) and related teams or professionals with open source management experience, including governance, contribution strategy, standards engagement, and cross-functional coordination

This talk reflects on what organizations might gain by bringing those perspectives into MCP discussions earlier, as well as into their broader Agentic AI strategy. Rather than treating MCP only as a technical AI topic, the session will explore it as an area where open source process knowledge may also add value. Attendees will leave with ideas for how OSPOs can contribute to MCP adoption in practical, collaborative, and organization-aware ways
Speakers
avatar for Ana Jiménez Santamaría

Ana Jiménez Santamaría

Project Manager, Linux Foundation (TODO Group and PyTorch Foundation)
Ana is a Sr. Project Manager at the Linux Foundation, where she supports global open source communities and drives strategic initiatives across the TODO Group and the PyTorch Foundation. She collaborates with CTOs, engineering teams, and business units to promote Open Source management... Read More →
Tuesday August 11, 2026 15:55 - 16:25 KST
Grand Ballroom 2-3

16:35 KST

Benchmarking Beyond OpenSearch: Multi-Engine Vector Search Performance With OSB - Mike Oviedo, AWS
Tuesday August 11, 2026 16:35 - 17:05 KST
As vector search becomes foundational to AI workloads, teams need reliable ways to evaluate engine performance before committing to one. We extended OpenSearch Benchmark (OSB) to support Milvus and Vespa alongside OpenSearch, making it possible to compare throughput, latency, and recall across engines using the same datasets and query patterns.

In this talk, we'll walk through how OSB's new engine-as-module architecture lets you benchmark any search engine with a single CLI command. We'll show how customizable workload parameters control everything from HNSW graph construction to bulk ingestion strategies, and demonstrate how to visualize comparative results in OpenSearch Dashboards, turning raw benchmark data into actionable performance insights.

Whether you're evaluating engines for a new project or optimizing an existing deployment, you'll leave with practical knowledge of how to run your own benchmarks and how to extend OSB to support additional engines by implementing five functions.
Speakers
avatar for Mike Oviedo

Mike Oviedo

Software Engineer, AWS
Michael Oviedo is a software engineer at AWS, working on performance and benchmarking tools for OpenSearch. He is a maintainer of the OpenSearch Benchmark project. Outside of work he enjoys playing golf and visiting national parks.
Tuesday August 11, 2026 16:35 - 17:05 KST
Grand Ballroom 2-3

17:15 KST

Personalisation and Specialisation of Search With OpenSearch Agentic Search - Cedric Pelvet & Aswath Srinivasan, AWS
Tuesday August 11, 2026 17:15 - 17:45 KST
Agentic search lets you ask in Natural Language and have OpenSearch plan and execute retrieval. The Query Planner re-writes Natural Language to OpenSearch DSL using SOTA LLMs. This works amazingly well but not without important limitations:

1/ The biggest factor is added latency for remote inference. eCommerce Search demands sub-100ms response times. Even the best results when slow lead to search abandonment.
2/ We'll demo overcoming latency by hosting SLMs locally within OpenSearch nodes using vLLM/Ollama. This is faster & cheaper, but SLMs suffer quality decline in query re-writes. We'll explore fine-tuning with domain-specific data—proving fine-tuned SLMs beat SOTA LLMs.
3/ Relevance remains generic even with Agentic Search. We'll show how to improve it using user and business contexts with hybrid search and reranking.

This hands-on talk covers:
1/ Search Latency with Remote Inference
2/ Locally hosting SLMs using vLLM/Ollama
3/ Agentic search with SLM Local Inference
4/ Improving relevance with user and business contexts
5/ Fine-tuning SLMs for domain-specific query re-writing via Axolotl/LLaMA-Factory
Speakers
avatar for Cedric Pelvet

Cedric Pelvet

Principal OpenSearch Specialist SA, Amazon Web Services
Cédric Pelvet is a Principal Specialist Solutions Architect at AWS, focusing on AI and near-realtime distributed systems for data like OpenSearch, Kafka and Flink.
avatar for Aswath Srinivasan

Aswath Srinivasan

Senior Search Engine Architect,, OpenSearch @ AWS
Aswath Srinivasan is a Senior Search Engine Architect at Amazon Web Services currently based in Munich, Germany. With over 18 years of experience in various search technologies, Aswath currently focuses on OpenSearch. He is a search and open-source enthusiast and helps customers and... Read More →
Tuesday August 11, 2026 17:15 - 17:45 KST
Grand Ballroom 2-3
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -