The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon China 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.
Please note: This schedule is automatically displayed in Hong Kong Standard Time (UTC+8:00). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis.
Sign up or log in to add sessions to your schedule and sync them to your phone or calendar.
Since graduating from CNCF, Argo Workflows has seen widespread adoption across industries. But how does it work? What are its latest features? And how can it handle large-scale task orchestration effectively? This talk will answer these questions. The talk begins with an overview of Argo Workflows’ core principles and the latest community developments, including new features like scheduling strategies, dynamic templates, and backfill capabilities. It then dives into best practices for large-scale task orchestration, covering high-availability deployment, workflow partitioning, and more. A key focus is on storage systems, which are critical for efficient large-scale task execution. The talk will share insights on selecting the right file and object storage solutions, implementing read-write separation, and choosing an optimal caching system. These strategies are essential for building scalable, high-performance pipelines.
Yashi Su is a software engineer at Alibaba Cloud, focusing on Kubernetes Container Storage Interface (CSI) for object storage. She maintains OSSFS (a FUSE daemon for Alibaba Object Storage Service) used in Cloud-Native scenarios and researches how to improve the read-write performance... Read More →
ShuangKun Tian is a software engineer at Alibaba Cloud, specializing in Scheduling, Elasticity, Workflow Orchestration and Performance tuning. He is a maintainer of the Argo Community. He has extensive practical experience in MLOps, Data Processing, CI/CD, and Large-scale Storage... Read More →
This meeting is meant to orient you in the Kubernetes community.
Part 1: Presentation and Intro ● Welcome to Kubernetes! ● What is Kubernetes? ● Kubernetes Community Structure ● What does it mean to be a “Contributor”? ● How to Start Contributing ● Current Work Opportunities ● Contribution Pitfalls
Part 2: New contributors journey We will invite some new contributors in the community to share their fresh experience and tips to you. ● How did I get involved with Kubernetes? ● What is most important in participating in Kubernetes community journey? ● Some tip to participate in Kubernetes community ● How to submit a "polite" PR?
Paco is co-chair of KubeCon+CloudNativeCon China 2024, and a member of Kubernetes Steering Committee. Paco is a kubeadm maintainer and an active kubernetes contributor. He is the leader of the open-source team in DaoCloud. He organized KCD Chengdu 2022 and KCS China 2023, and... Read More →
Mengjiao Liu is a Software Engineer. She contributes to Kubernetes and serves as the WG Structured Logging Lead and SIG Instrumentation Reviewer, focusing on enhancing logging quality. Additionally, she actively participates in SIG Docs as a Chinese owner and English reviewer, working... Read More →
Karmada (Kubernetes Armada) is a Kubernetes management system that enables you to run your cloud-native applications across multiple Kubernetes clusters and clouds.
In this presentation, the maintainer of the Karmada project will share:
- A Brief introduction to Karmada, including what it is and why you need it. - Key features and real-world use cases - Overview of the community, including the governance and how it works - New features over the last year * Migration Rollback (with Zendesk, MoMo) *Lightway Stateful Application failover (with Bloomberg) * Manage HA Karmada instance by operator (with Bloomberg) * OverridePolicy support (with Longbridge) * Cluster Level Propagation Pause and Resume (with Zendesk, MoMo) * Karmadactl Enhancements (with Huawei) - Future Plan - QA
Senior Software Engineer(maintainer of Karmada project), Huawei
Hongcai Ren(@RainbowMango) is the CNCF Ambassador, who has been working on Kubernetes and other CNCF projects since 2019, and is the maintainer of the Kubernetes and Karmada projects.
Inference workloads are becoming increasingly prevalent and vital in Cloud Native world. However, it's not easy, one of the biggest challenges is large foundation model can not fit into a single node, like llama 3.1-405B or DeepSeek R1, which brings out the distributed inference with model parallelism, again, make serving inference workloads more complicated.
LeaderWorkerSet, aka. LWS, is a dedicated multi-host inference project aims to solve this problem, it's a project under the guidance of Kubernetes SIG-Apps and Serving Working Group. It offers a couple of features like dual-template for different types of Pods, fine-gained rolling update strategies, topology managements and all-or-nothing failure handlings.
In this session, we'll introduce the capacities of lws and showcase the practice from our adopters like nvidia, google, and we'll demonstrate the integration with the most popular inference engines, such as vLLM, SGLang.
Kante is a senior software engineer and an open source enthusiast from DaoCloud, his work is mostly around scheduling, resource management and LLM inference. He actively contributes to upstream Kubernetes as SIG-Scheduling Maintainer and helps in incubating several projects like Kueue... Read More →
Join us as we celebrate nearly a decade of Cilium, now the de-facto standard CNI for Kubernetes and a cornerstone of cloud native networking, observability, and security. This session provides updates on the latest Cilium release and showcases how its unified eBPF-powered stack is transforming Kubernetes environments by replacing fragmented toolchains with seamless, secure, scalable, and simplified solutions.
Hear about how Cilium is simplifying the cloud native stack and solidifying its role as the comprehensive networking and security solution for modern cloud native architectures from contributors and end users Bytedance and Isovalent.
Kaixi Fan is a Senior Linux Network Engineer at ByteDance, specializing in cloud computing networks and kernel network protocol stacks. With extensive experience in high-performance networking, he has deep expertise in areas such as eBPF, DPDK, and software-defined networking (SDN... Read More →
OpenTelemetry, one of the most active projects within the CNCF, has become the industry standard for observability. Join us for the official project update session at KubeCon+CloudNativeCon China. In this session, contributors from the OpenTelemetry community will share some of the latest project developments and milestones, including SDK/Instrumentation, profiling, Go compile-time instrumentation injection, and the OpenTelemetry Collector. Don't miss this opportunity to stay informed and contribute to the discussion on the exciting advancements within OpenTelemetry.
Jared Tan is a Sr. Software Engineer at DaoCloud responsible for the Observability Platform, which includes contributions to the OpenTelemetry project and with a passion for observability and helping users start their observability journey. He has participated in several well-known... Read More →
Zihao is a software engineer at Alibaba Cloud. Over the past few years, he has participated in several well-known open source projects, he is steering committee member of Spring Cloud Alibaba project, and is a triager for OpenTelemetry Java Instrumentation now.
Computing power requirements is glowing more and more fast. Especially with the development of AI large models (text, images, and videos), the complexity and diversity of computing power are constantly increasing. The traditional computing power management and shedule way no longer meet these challenges. Therefore, computing power supply is shifting from "single and intensive" model to a more flexible and efficient "diverse collaboration" model. So how to integrate computing resources of different architectures and maximize the utilization rate and performance of cluster resources efficiently has become the core challenge faced by enterprises. In this session we will talk about how to build a cloud native system with high efficient computing cluster management capabilities. Projects will be covered like kubernetes, prometheus, volcano, karmada etc.
In order to facilitate networking and business relationships at the event, you may choose to visit a third party’s booth or access sponsored content. You are never required to visit third party booths or to access sponsored content. When visiting a booth or participating in sponsored activities, the third party will receive some of your registration data. This data includes your first name, last name, title, company, address, email, standard demographics questions (i.e. job function, industry), and details about the sponsored content or resources you interacted with. If you choose to interact with a booth or access sponsored content, you are explicitly consenting to receipt and use of such data by the third-party recipients, which will be subject to their own privacy policies.
Imagine your cloud-native applications as a bustling city. To ensure everything runs smoothly, you need to test its resilience by introducing controlled chaos, like planned roadblocks, to spot and fix weaknesses before they cause real trouble.
Join the LitmusChaos team, the folks behind this CNCF Incubating project, as they share the latest and greatest in chaos engineering. They'll walk you through new features from recent updates, like better resilience testing, improved observability, and scalability tools, all designed to tackle the real-world problems developers and SREs face daily.
You'll also get the inside scoop on the project's growth, how the community is shaping its future, and a sneak peek at what's coming next to make chaos engineering easier and more effective.
Sayan Mondal is a Senior Software Engineer II at Harness, building their Chaos Engineering platform and helping them shape the customer experience market. He's the maintainer of a few open-source libraries and is also a maintainer and community manager of LitmusChaos (the Incubating... Read More →
In this session, KubeEdge project maintainers will provide an overview of KubeEdge's architecture and its industry-specific use cases. The session will begin with a brief introduction to edge computing and its growing importance in IoT and distributed systems. The maintainers will then delve into the core components and architecture of KubeEdge, demonstrating how it extends Kubernetes' capabilities to manage edge computing workloads efficiently. They will share success stories and insights from organizations that have deployed KubeEdge in various edge environments, such as smart cities, industrial IoT, edge AI, robotics, and retail, highlighting the tangible benefits and transformational possibilities. Additionally, the session will introduce the certified KubeEdge conformance test, hardware test, KubeEdge course and certification, discuss advancements in technology and community governance within the KubeEdge project, and share the latest updates on the project's graduation status.
Yue Bao serves as a software engineer of Huawei Cloud. She is now working 100% on open source, focusing on lightweight edge for KubeEdge. She is the maintainer of KubeEgde and also the tech leader of KubeEdge SIG Release and Node. Before that, Yue worked on Huawei Cloud Intelligent... Read More →
Hongbing Zhang is Chief Operating Officer of DaoCloud. He is a veteran in open source areas, he founded IBM China Linux team in 2011 and organized team to make significant contributions in Linux Kernel/openstack/hadoop projects. Now he is focusing on cloud native domain and leading... Read More →
I will share the progress of the Ingress-NGINX project in this topic, as well as our newly incubated project, Ingate. Ingate is a project we created to actively adopt the Gateway API, and we will explore the next steps in the Ingate project based on the successes and failures we've experienced in the Ingress-NGINX project, along with user demands for frequently used features.
CNCF Ambassador, Kubernetes Ingress-NGINX maintainer, Kong Inc.
Jintao Zhang is a Microsoft MVP, CNCF Ambassador, Apache PMC, and Kubernetes Ingress-NGINX maintainer, he is good at cloud-native technology and Azure technology stack.
Kubespray, recognized by Kubernetes' SIG Cluster Lifecycle, deploys production-ready Kubernetes clusters on bare metal, enhancing performance for AI applications with robust GPU support. This session covers Kubespray's fundamentals, key features, and updates.
As AI workloads like LLMs grow, scalable GPU clusters are essential. Engineers will share insights from deploying custom GPU clusters at scale with Kubespray, discussing challenges and best practices. Attendees will learn to integrate Kubernetes technologies like LWS, Kueue, Gateway API Inference Extension, DRA, and tensor parallelism to enhance AI workloads like RAG and LoRA, improving resource utilization and performance.
We'll share Kubespray's inventory source code to customize AI clusters and use Kubernetes operators to define infrastructure in private clouds, enabling efficient cluster scaling.
Rong is a software engineer at vivo developing platform services on top of Kubernetes, providing containerized infrastructure. Focus on the closed loop system of scheduling、gpu technology、network and cluster management.
Kay Yan is kubespray maintainer, containerd/nerdctl maintainer. He is the Principal Software Engineer in DaoCloud, and develop the DaoCloud Enterprise Kubernetes Platform since 2016.
Join this interactive session for a brief overview of the Cloud Native Computing Foundation (CNCF) Technical Oversight Committee (TOC), including recent initiatives and opportunities to get involved. Learn how the TOC is helping shape the next decade of cloud native technologies, and how you can get involved. Following the overview, we’ll open the floor to your questions—whether they’re technical, or about building leadership within CNCF. Initial seeding questions include:
What are some of the latest Cloud Native AI initiatives?
How can we encourage more CNCF and TAG contributions from Asian countries?
What are the possible paths to becoming a CNCF TOC member?
Technical Expert, Lead of Cloud Native Open Source, Huawei
Kevin Wang has been an outstanding contributor in the CNCF community since its beginning and is the leader of the cloud native open source team at Huawei. Kevin has contributed critical enhancements to Kubernetes, led the incubation of the KubeEdge, Volcano, Karmada projects in CNCF... Read More →
Lin is the Head of Open Source at Solo.io, and a CNCF TOC member and ambassador. She has worked on the Istio service mesh since the beginning of the project in 2017 and serves on the Istio Steering Committee and Technical Oversight Committee. Previously, she was a Senior Technical... Read More →
Chris Aniszczyk is an open source executive and engineer with a passion for building a better world through open collaboration. He's currently a CTO at the Linux Foundation focused on developer relations and running the Open Container Initiative (OCI) / Cloud Native Computing Foundation... Read More →