Loading…
10-11 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon China 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Hong Kong Standard Time (UTC+8:00)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Audience: English clear filter
Tuesday, June 10
 

09:00 HKT

Keynote: Introductory Remarks - Jim Zemlin, Executive Director, The Linux Foundation
Tuesday June 10, 2025 09:00 - 09:10 HKT
Speakers
avatar for Jim Zemlin

Jim Zemlin

Executive Director, The Linux Foundation
Zemlin’s career spans three of the largest technology trends to rise over the last decade: mobile computing, cloud computing and open source software. Today, as executive director of The Linux Foundation, he uses this experience to accelerate the adoption of Linux and support the... Read More →
Tuesday June 10, 2025 09:00 - 09:10 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions

09:12 HKT

Keynote: Community Opening Remarks - Chris Aniszczyk, CTO, Cloud Native Computing Foundation
Tuesday June 10, 2025 09:12 - 09:22 HKT
Speakers
avatar for Chris Aniszczyk

Chris Aniszczyk

CTO, CNCF
Chris Aniszczyk is an open source executive and engineer with a passion for building a better world through open collaboration. He's currently a CTO at the Linux Foundation focused on developer relations and running the Open Container Initiative (OCI) / Cloud Native Computing Foundation... Read More →
Tuesday June 10, 2025 09:12 - 09:22 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions

09:24 HKT

Keynote: Crossplane Is the Answer! but What Is the Question? - Amit Dsouza, Odyssey Cloud & Cortney Nickerson, Nirmata
Tuesday June 10, 2025 09:24 - 09:34 HKT
Why consider Crossplane when so many IaC tools exist—Terraform, Pulumi, CloudFormation, Config Connector, and KRO? What unique challenges does it solve, and is it always the right choice?
Join Cortney & Amit as they explore why Crossplane is gaining traction, not just as an IaC tool but as a Platform Engineering enabler. Learn how Crossplane extends the Kubernetes API to manage both infrastructure and applications declaratively, empowering platform teams.
Beyond provisioning, security and compliance are critical. Discover how the Crossplane + ArgoCD + Kyverno stack enables GitOps-driven automation, ensuring deployments align with organizational compliance and security policies.
Through real-world use cases, we’ll explore:
Where does Crossplane fit among IaC tools?
When is Crossplane NOT the right choice?
How can it enable scalable, self-service platforms?
How does it integrate with ArgoCD & Kyverno for GitOps and security?
Speakers
avatar for Amit DSouza

Amit DSouza

Co-founder, Odyssey Cloud
Amit Dsouza is an IT professional with over 13 years of experience in the industry. He is a co-founder of Odyssey Cloud, Australia. With experience in Fortune 500 companies & startups, he has worked in various locations including Australia, Singapore, & India. Amit specializes in... Read More →
avatar for Cortney Nickerson

Cortney Nickerson

Head of Community, Nirmata
Cortney is Head of Community at Nirmata. As a CNCF and Civo Ambassador, co-organizer of CNCF Bilbao Community, and speaker and organizing member of various KCD events, she is a recognized voice in the cloud native space. Initially, a non-techie, she turned techie as employee 7 at... Read More →
Tuesday June 10, 2025 09:24 - 09:34 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions, Platform Engineering

09:36 HKT

Sponsored Keynote: Towards Clouds of AI Clusters - Bill Ren, Huawei Chief Open Source Liaison Officer, Board member of CNCF
Tuesday June 10, 2025 09:36 - 09:41 HKT
AI is quickly becoming the most important workload in our clouds. However, AI is not like other cloud native workloads. Whereas before, clouds could manage elastic resources that easily and cheaply scaled out, AI workloads do not readily support this. AI hardware infrastructure is moving towards large clusters of processors, is not readily scaled out, is not readily available on-demand, and is much more expensive. This requires significant changes to how we build and
manage our clouds, from the operating system up to our cloud native infrastructure. This talk will highlight how this evolution towards clouds of AI clusters is happening through projects such as Linux, Volcano, and Karmada.


Speakers
avatar for Bill Ren

Bill Ren

Chief Open Source Liaison Officer,Board member of CNCF, Huawei
Bill Ren holds an EMBA and Master Degree from Peking University, and a CS Bachelor Degree from Shanghai Jiaotong University. Since Joining Huawei in 2000, Bill served as an Intelligent Network Research and Development Engineer, Product Manager and Architect of India Branch, General... Read More →
Tuesday June 10, 2025 09:36 - 09:41 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions

09:43 HKT

Keynote: An Optimized Linux Stack for GenAI Workloads - Michael Yuan, WasmEdge
Tuesday June 10, 2025 09:43 - 09:53 HKT
Running GenAI workloads on Linux is a challenge due to the complexity of AI runtime toolchains and dependencies of heterogeneous GPU devices. The problem is especially acute in containers where the host and guest OSes must have compatible versions of GPU drivers and application software stacks.

CNCF’s Flatcar Linux project aims to simplify containerized Linux deployment. It has an immutable system that can be optimized for both host and guest systems. Furthermore, it supports cross-platform and cross-GPU Wasm workloads. As Wasm runtimes such as WasmEdge and LlamaEdge support a wide range of AI models, Flatcar Linux has become a good candidate for running GenAI workloads in containers.

In this talk, we will cover the basics of Flatcar and its support for Wasm runtimes. We will also discuss WasmEdge’s support for portable AI models and inference applications. Finally, we will give a demo of a complete GenAI app running in Flatcar across GPUs and CPUs.
Speakers
avatar for Michael Yuan

Michael Yuan

Founder, Second State
Dr. Michael Yuan is a maintainer of WasmEdge Runtime (a project under CNCF) and a co-founder of Second State. He is the author of 5 books on software engineering published by Addison-Wesley, Prentice-Hall, and O'Reilly. Michael is a long-time open-source developer and contributor... Read More →
Tuesday June 10, 2025 09:43 - 09:53 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions, Emerging + Advanced
  • Content Experience Level Any
  • Presentation Language English

09:55 HKT

Keynote: Scaling Model Training with Volcano: iFlytek’s Kubernetes Breakthrough - Dong Jiang, Platform Architect, iFlytek & Xuzheng Chang, Software Engineer, Huawei Cloud
Tuesday June 10, 2025 09:55 - 10:00 HKT
Training massive AI models at scale is tough—but doing it efficiently in Kubernetes is even tougher. In this keynote, we’ll share how iFlytek tackled key challenges in large-scale model training, including low GPU utilization, fragile workflows, and resource contention across teams. By leveraging Volcano, they boosted GPU usage by over 40%, and cut failure recovery time by 70%. This talk offers a quick but powerful look at how intelligent scheduling and orchestration can unlock performance, reliability, and fairness in multi-tenant AI platforms.
Speakers
avatar for Xuzheng Chang

Xuzheng Chang

Software Engineer, Huawei Cloud
Xuzheng Chang is a maintainer of the Volcano community, with in-depth research and practical experience in the fields of batch computing and cloud-native AI scheduling. Xuzheng has spearheaded several significant features within the Volcano community. Actively contributing to open-source... Read More →
avatar for Dong Jiang

Dong Jiang

Platform Architect, iFlytek
Tuesday June 10, 2025 09:55 - 10:00 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions

10:02 HKT

Keynote: The Future of AI in Hong Kong: From Local Innovation to Global Influence - Prof. Yike Guo, HKUST Provost and HKGAI Director & Roby Chen, CEO and Founder, DaoCloud
Tuesday June 10, 2025 10:02 - 10:12 HKT
The release of HKGAI V1 marks a new chapter in the development of AI in Hong Kong. Leveraging the strengths of both local connectivity and global outreach, the HKGAI team embraces the open-source community to tackle challenges ranging from optimizing high-performance computing clusters to exploring cutting-edge AI models. Looking ahead, Hong Kong aims to further integrate resources from mainland China and the international community, deepening technological innovation and application expansion, and contributing a "Hong Kong Solution" to global AI standards and use cases.
Speakers
avatar for Yike Guo

Yike Guo

HKUST Provost and HKGAI Director
Professor Guo Yike assumed office as the Provost of the Hong Kong University of Science and Technology (HKUST) on December 1, 2022. He is concurrently a Chair Professor in the Department of Computer Science and Engineering and also the Director of Hong Kong Generative AI Research... Read More →
avatar for Roby Chen

Roby Chen

CEO, DaoCloud
Roby Chen, Founder and CEO of DaoCloud, Master of Computer Science from Fudan University, CNCF Ambassador, has a deep understanding of cloud-native business models and technologies. Roby is an evangelist of open source cloud computing technology, gaining valuable experience in building... Read More →
Tuesday June 10, 2025 10:02 - 10:12 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions

10:13 HKT

Keynote: Closing Remarks
Tuesday June 10, 2025 10:13 - 10:15 HKT
Tuesday June 10, 2025 10:13 - 10:15 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions, Platform Engineering

11:45 HKT

Defining a Specification for AI/ML Artifacts - Fog Dong, BentoML; Gorkem Ercan, Jozu; Peng Tao & Chlins Zhang, Ant Group; Xudong Wang, Paypal
Tuesday June 10, 2025 11:45 - 12:15 HKT
AI has become a prominent figure in the cloud native ecosystem and there continues to be massive adoption in this emerging field. As frameworks and approaches are introduced, a pattern has emerged which threatens the ability to manage at scale: each implementation introduces their own format, runtime, and different ways of working, fragmenting the ecosystem. On other hand, open standards are the backbone of cohesive and scalable ecosystems.

This panel discussion seeks to explore the importance of defining standards within the CNCF ecosystem, particularly focusing on AI/ML artifacts. Beyond the advantages of the standard in facilitating integration with existing cloud native tools, this conversation will delve into how the standards can serve as a foundation for innovation. Join us to understand how standardization with innovative approaches can advance the cloud native AI landscape.
Speakers
avatar for Chlins Zhang

Chlins Zhang

Software Engineer, Ant Group
Chenyu Zhang is a software engineer at Ant Group, currently mainly responsible for the development and maintenance of project harbor, and also has some experience in devops and cloud native related technology stacks.
avatar for Peng Tao

Peng Tao

Staff Engineer, Ant Group
Kata Containers architecture committee member, Nydus maintainer, and Linux kernel developer.
avatar for Fog Dong

Fog Dong

Senior Software Engineer, BentoML
董天欣目前在 BentoML担任资深工程师,同时,她也是 KubeVela 的核心维护者以及 CNCF 大使。她致力于开源社区的建设,并不遗余力地为推动开源项目的发展而努力,尤其是在云原生 DevOps 领域。目前,她在 BentoML... Read More →
avatar for Gorkem Ercan

Gorkem Ercan

CTO, Jozu
Gorkem Ercan is a co-founder and CTO of Jozu. Gorkem has experience working and leading teams with various technologies ranging from building IDEs, to building mobile phones, and CI/CD systems. He is an avid contributor and supporter of open source and previously served at the Eclipse... Read More →
Tuesday June 10, 2025 11:45 - 12:15 HKT
Level 19 | Crystal Court I
  AI + ML

13:45 HKT

Antipatterns in Observability: Lessons Learned and How OpenTelemetry Solves Them - Steve Flanders, Splunk
Tuesday June 10, 2025 13:45 - 14:15 HKT
Observability is essential, but common antipatterns like over-collecting data, siloed tools, and poorly instrumented code can derail your efforts. This session uncovers the most frequent observability pitfalls and shows how OpenTelemetry addresses these challenges with its standardized approach. From eliminating vendor lock-in to streamlining telemetry pipelines, you’ll gain insights into building a more effective and sustainable observability strategy. Real-world examples will highlight how teams have successfully overcome these antipatterns, empowering you to avoid costly mistakes and maximize OpenTelemetry’s potential.
Speakers
avatar for Steve Flanders

Steve Flanders

Senior Director of Engineering, Splunk
Steve Flanders is a Senior Director of Engineering at Splunk responsible for the Observability Platform team, which includes contributions to the OpenTelemetry project. Previously, he was the Head of Product and Experience at Omnition, which Splunk acquired. Prior to Omnition, he... Read More →
Tuesday June 10, 2025 13:45 - 14:15 HKT
Level 16 | Grand Ballroom I
  Observability

13:45 HKT

From Bottleneck To Breakthrough: Conquering Applications Startup Peaks in Kubernetes - Hexi Guo, Alibaba Cloud & Rentian Zhou & Zhuoqi Liu, CloudPilot AI
Tuesday June 10, 2025 13:45 - 14:15 HKT
A variety of applications in Kubernetes typically require higher memory or compute resources during startup—such as Java, .NET, and Node.js applications, as well as those utilizing large data processing frameworks or machine learning models—due to the need to load substantial dependencies and perform complex initialization tasks. To prevent startup failures from resource contention, these applications typically have their resource requests set based on peak startup demands. However, this often leads to resource waste after startup is complete.
To address this challenge, this session presents a queue-based approach using Karpenter. This method allows applications set resource requests based on typical usage instead of peak startup needs. It temporarily spreads applications across multiple smaller nodes during startup, preventing single-node overload. After startup, it smoothly consolidates them onto fewer but larger nodes to optimize resource usage while maintaining service stability.
Speakers
avatar for Zhuoqi Liu

Zhuoqi Liu

Senior Software Engineer, CloudPilot AI Inc
avatar for Hexi Guo

Hexi Guo

Software Engineer, Alibaba Cloud
Alibaba Cloud technical expert, maintainer of Kubernetes elastic scaling component cluster-autoscaler, initiator of open source elastic component kubernetes-cronhpa-controller, responsible for the design and implementation of elastic solutions for Alibaba Cloud industry customers... Read More →
avatar for Rentian Zhou

Rentian Zhou

Software Engineer, CloudPilot AI
Rentian, a Software Engineer at CloudPilot AI, focuses on the Karpenter open-source project, contributing to karpenter-provider-alibabacloud and -aws. He has also contributed to various projects and serves as a Karmada Reviewer, the Member of the Volcano and Hwmaeistor communities... Read More →
Tuesday June 10, 2025 13:45 - 14:15 HKT
Level 19 | Crystal Court II
  Operations + Performance
  • Content Experience Level Any
  • Presentation Language English

14:30 HKT

Advancing Observability With Compile-Time Auto-Instrumentation in Golang - Liu Ziming, Alibaba Cloud & Przemek Delewski, Quesma
Tuesday June 10, 2025 14:30 - 15:00 HKT
Observability for cloud-native software applications requires efficient and reliable methods to gain insights into distributed systems. This talk will explore various instrumentation approaches for Golang, focusing on the concept of compile-time auto-instrumentation with OpenTelemetry. We will unveil implementation details of compile-time auto-instrumentation, highlighting the revolutionary features including flexible custom plugin capabilities, enhanced context propagation, trace-log correlation, and etc. The talk will cover examples of using compile-time auto instrumentation, lessons learned from the practice and scenarios that benefit from such an implementation. The audience will take away a solid understanding of how compile-time auto instrumentation works and why it presents an efficient and more performant solution for achieving observability.
Speakers
avatar for Przemek Delewski

Przemek Delewski

Principal Architect, Quesma
Przemek is a founding engineer at Quesma, working in the data transformation space and responsible for architectural direction. An observability veteran with over 15 years of experience at Dynatrace and Sumo Logic. OpenTelemetry Maintainer. Designs programming languages for fun
avatar for Liu Ziming

Liu Ziming

Engineer, Alibaba Cloud
Alibaba R&D Engineer
Tuesday June 10, 2025 14:30 - 15:00 HKT
Level 16 | Grand Ballroom I
  Observability

15:37 HKT

⚡ Lightning Talk: Advanced GPU-Orchestrated Workflows and HPC Integrations on K8s for Distributed AI/ML at Scale - Brandon Kang, Akamai Technologies
Tuesday June 10, 2025 15:37 - 15:42 HKT
As AI/ML workloads continue to scale in complexity, developers and platform engineers are pushing Kubernetes beyond typical MLOps boundaries.

This talk dives into strategies for orchestrating GPU-accelerated training and inference across large-scale clusters -integrating HPC principles, operator-based scheduling, and novel debugging workflows.

Attendees will learn how to implement fine-grained GPU partitioning, harness ephemeral containers to probe and adjust multi-node training in real time, and adopt eBPF-driven instrumentation for low-overhead kernel-level performance insights. We’ll explore cutting-edge scheduling optimizations—like reinforcement-learning approaches and HPC-inspired batch-queuing orchestration on Kubernetes that dynamically respond to heterogeneous job demands.

Real-world case studies will highlight HPC integration scenarios (RDMA, GPU Direct) for data-parallel workloads and complex training frameworks such as Horovod, Ray, and Spark on Kubernetes.
Speakers
avatar for Brandon Kang

Brandon Kang

Principal Technical Solutions Architect, Akamai Technologies
Brandon Kang is a Principal Technical Solutions Architect at Akamai Technologies, specializing in cloud-native projects across Asia as a compute specialist.Before joining Akamai, he served as a Lead Software Engineer at Samsung, a Senior Program Manager at Microsoft, and a Service... Read More →
Tuesday June 10, 2025 15:37 - 15:42 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Application Development

15:44 HKT

⚡ Lightning Talk: AI-Powered Kubernetes Diagnostics With K8sGPT - Kay Yan, DaoCloud
Tuesday June 10, 2025 15:44 - 15:49 HKT
In this Lightning Talk, we’ll dive into K8sGPT, a CNCF sandbox project that uses AI to enhance Kubernetes management. K8sGPT leverages LLMs to diagnose cluster issues, offering root cause analysis and solutions in simple terms. It encodes SRE expertise into analyzers, extracting key insights and enriching them with AI-powered explanations.
Key highlights:
- Core Features: Learn to use the CLI and K8sGPT Operator for cluster error analysis and contextualized insights.
- AI Integration & Security: Explore integration with AI models like OpenAI, Azure, and Ollama, with data anonymization for security.
- Real-world Demos: See how K8sGPT simplifies Kubernetes troubleshooting.
- Enterprise Strategies: Discover techniques like LoRA and RAG to tailor K8sGPT for specific environments.
Whether you're new to Kubernetes or an expert, K8sGPT can streamline cluster management, reduce troubleshooting time, and boost efficiency.
Speakers
avatar for Kay Yan

Kay Yan

Principal Software Engineer, DaoCloud
Kay Yan is kubespray maintainer, containerd/nerdctl maintainer. He is the Principal Software Engineer in DaoCloud, and develop the DaoCloud Enterprise Kubernetes Platform since 2016.
Tuesday June 10, 2025 15:44 - 15:49 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Operations + Performance

16:05 HKT

⚡ Lightning Talk: Disaster Recovery - How IaCaC and Kubernetes Enables Cost Efficiency and Fast Recovery - Sandy Wang, KPMG Australia
Tuesday June 10, 2025 16:05 - 16:10 HKT
Tech startup in early stage normally aim low running cost on infrastructure spend but fast development and delivery. When there are a first few clients onboard, disaster recovery plan is a must have. When DR is required and an agreed RTO is 6 hours for example, how to not only remain low running cost but also to meet agreed RTO and SLA, our DR plan and implementation is a success to share with the audience. We onboarded container orchestration platform Kubernetes, DevOps best practices, for example Infrastructure-and-Configuration-as-Code and Pipeline-as-Code. Our DR implementation only spends a minimum cost on always-on resources. When a DR incident happens, automated pipelines will bring up on-demand resources that include a Kubernetes cluster, and geo-recover database and storage, then deploy the latest applications into kubernetes cluster, production DR can be live within 2 hours.
Speakers
avatar for Pei (Sandy) Wang

Pei (Sandy) Wang

Senior DevSecOps Engineer, KPMG Australia
As a Senior DevSecOps Engineer at KPMG Australia, I have been leading the cloud operations and security for Origins, a blockchain-based SaaS solution for supply chain traceability, since May 2022. I have brought the best practices of DevSecOps into day-to-day development and delivery... Read More →
Tuesday June 10, 2025 16:05 - 16:10 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Operations + Performance

16:12 HKT

⚡ Lightning Talk: Empowering Sustainable Living With ORES: A Cloud Native Approach To Software-Defined Home Energy Net - Chris Xie, Futurewei & Karl Xiofeng Yang, DEGCent
Tuesday June 10, 2025 16:12 - 16:17 HKT
Discover how the LF Energy working group is driving innovation in sustainable living with the Open Renewable Energy Systems (ORES) project. This session will explore how ORES leverages cloud-native technologies to build an open architecture, open standards, and APIs for software-defined home energy networks. By embracing Kubernetes and other cloud-native principles, ORES enables seamless integration of renewable energy sources, energy storage, and smart devices for a future-proof, scalable, and sustainable energy ecosystem. Learn how ORES promotes collaboration, interoperability, and innovation to shape the next generation of energy solutions in the cloud-native era.
Speakers
avatar for Karl Xiofeng Yang

Karl Xiofeng Yang

CEO, DEGCent
20+ years' embedded software engineer background.
avatar for Chris Xie

Chris Xie

Head of Open Source Strategy, Futurewei
Chris Xie, Head of Open Source Strategy at Futurewei, is a prominent advocate for global open source collaboration. With a background that includes roles at both Fortune 500 companies and startups, he brings a unique combination of technical and strategic business expertise. Recently... Read More →
Tuesday June 10, 2025 16:12 - 16:17 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Emerging + Advanced
  • Content Experience Level Any
  • Presentation Language English

16:15 HKT

Introducing AIBrix: Cost-Effective and Scalable Kubernetes Control Plane for VLLM - Jiaxin Shan & Liguang Xie, ByteDance
Tuesday June 10, 2025 16:15 - 16:45 HKT
Managing large-scale LLM inference workloads on Kubernetes requires more than just high-performance inference engines like vLLM. It demands a comprehensive control plane that integrates deeply with engines while addressing the complexities of large-scale operations. This need inspired the creation of AIBrix, a Kubernetes-native control plane designed to scale LLM inference with modularity, flexibility, and cutting-edge algorithms.

AIBrix introduces a pluggable architecture with components for LLM specific autoscaling, high-density lora management, distributed KV cache, heterogenous serving, model loading etc. AIBrix emphasizes deep co-design with inference engines, enabling advanced features and optimizations. This talk will demonstrate AIBrix in action, showcasing its ability to improve scalability and optimize resource utilization. Additionally, we will present detailed benchmarks to evaluate the performance of these components, providing actionable insights for practitioners.
Speakers
avatar for Jiaxin

Jiaxin

Software Engineer, Bytedance
Jiaxin works at ByteDance Infrastructure Lab, focusing on serverless and AI infrastructure. He is also a co-chair of Kubernetes WG-Serving, Jiaxin drives innovations and contributes to the future of scalable AI systems.
avatar for Liguang Xie .

Liguang Xie .

Director of Engineering, ByteDance
Liguang Xie is an Engineering Lead at ByteDance’s Compute Infrastructure Team, leading next-gen serverless infrastructure design and overseeing open-source, research, and engineering efforts. He has extensive experience in large-scale distributed systems, AI/ML platforms, and LLM/GNN... Read More →
Tuesday June 10, 2025 16:15 - 16:45 HKT
Level 19 | Crystal Court I
  AI + ML

16:15 HKT

Guardians of the Gateway: Keeping Chaos Out of Your Cloud Highway - Sayan Mondal, Harness & Jintao Zhang, Kong Inc.
Tuesday June 10, 2025 16:15 - 16:45 HKT
Imagine an API gateway standing tall as the guardian of your cloud-native applications - directing traffic, enforcing policies, and ensuring everything runs smoothly. The Kong Gateway Operator orchestrates the control and data planes in Kubernetes, ensuring this process stays on track. But what happens when things start to wobble? A misstep here, a failure there and suddenly, chaos!

In this session, we’ll dive into the twists and turns of API gateway resilience. Think of it as an adventure where the operator faces unexpected disruptions, configuration hiccups, control plane mysteries, and unexpected traffic surges. We’ll explore what happens under the hood, how the gateway responds, and what we can learn from its behavior.

By the end, you’ll walk away with a deeper understanding of how to prepare your gateways for the unexpected and turn "uh-oh" moments into "we've got this" wins.
Speakers
avatar for Jintao Zhang

Jintao Zhang

CNCF Ambassador, Kubernetes Ingress-NGINX maintainer, Kong Inc.
Jintao Zhang is a Microsoft MVP, CNCF Ambassador, Apache PMC, and Kubernetes Ingress-NGINX maintainer, he is good at cloud-native technology and Azure technology stack.
avatar for Sayan Mondal

Sayan Mondal

Senior Software Engineer II, Harness
Sayan Mondal is a Senior Software Engineer II at Harness, building their Chaos Engineering platform and helping them shape the customer experience market. He's the maintainer of a few open-source libraries and is also a maintainer and community manager of LitmusChaos (the Incubating... Read More →
Tuesday June 10, 2025 16:15 - 16:45 HKT
Level 19 | Crystal Court II
  Connectivity
  • Content Experience Level Any
  • Presentation Language English

16:19 HKT

⚡ Lightning Talk: Dynamic GPU Fraction and Sharing With Cloud Native Principle - Tiejun Chen, Individual Contributor
Tuesday June 10, 2025 16:19 - 16:24 HKT
As we see, organizations are investing heavily in bringing AI accelerators into their data centers or using them on the public cloud but continue to struggle with the cost-effective and efficient management of these critical resources. There are some existing approaches to address them but heavy and inflexible. Here, we'd like to take this chance to review if-how we can address the challenges of expensive and limited machine learning compute resources like GPU and identifies solutions for GPU fractional optimization with our technical PoC - GPU.x by transparent backend Python hooker within ML upstream frameworks running Kubernetes. It's lightweight, easy and flexible without any code changes to your AI applications towards cloud native.
Speakers
avatar for Tiejun Chen

Tiejun Chen

Sr. Technical Lead, Individual Contributor
Tiejun Chen was Sr. technical leader. He ever worked at several tech companies such as VMware, Intel, Wind River Systems and so on, involved in - cloud native, edge computing, ML/AI, WebAssembly, etc. He ever made many presentations at AI.Dev NA 2023, kubecon China 2021 & 2024, Kube... Read More →
Tuesday June 10, 2025 16:19 - 16:24 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Emerging + Advanced

16:33 HKT

⚡ Lightning Talk: Supercharge Agentic AI Apps: A DevEx-Driven Approach To Cloud Native Scaffolding - Daniel Oh, Red Hat
Tuesday June 10, 2025 16:33 - 16:38 HKT
Agentic AI is revolutionizing how we create intelligent agents that can interact with the real world. However, building and deploying these systems often involves significant complexity and time investment. This demo-driven session introduces a cloud-native scaffolding approach, leveraging software templates to streamline and simplify the development of agentic AI projects. This results in a more efficient and developer-friendly experience. Through live demonstrations, attendees will see firsthand how this innovative scaffolding framework accelerates the development lifecycle of agentic AI applications. It provides automated code generation and pre-configured infrastructure. Seamless integration with popular AI libraries reduces overhead and complexity. By the end of the session, participants will have a clear understanding of how to adopt cloud-native scaffolding to revolutionize their development process and gain practical skills to drive innovation in their projects.
Speakers
avatar for Daniel Oh

Daniel Oh

Senior Principal Developer Advocate, Red Hat
Daniel Oh is a Java Champion and Senior Principal Developer Advocate at Red Hat to evangelize developers for building cloud-native apps and serverless ob Kubernetes ecosystems. He's also contributing to various cloud open-source projects and ecosystems as a CNCF ambassador for accelerating... Read More →
Tuesday June 10, 2025 16:33 - 16:38 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, AI + ML
  • Content Experience Level Any
  • Presentation Language English

16:54 HKT

⚡ Lightning Talk: Kata Confidential Containers Meet Persistent Storage: Overcoming CSI Driver Challenges - Andy Zhang & Archana Choudhary, Microsoft
Tuesday June 10, 2025 16:54 - 16:59 HKT
Kata Confidential Containers (CoCo) is a technology that provides hardware-based isolation for containerized workloads. It’s built on top of the Kata Containers project, which uses lightweight VMs to provide container isolation. It has the ability to disable file system sharing between host nodes and pods, which helps to reduce attack surfaces. However, such protection ability limits usage of Persistent Volumes. During this session, we will provide an introduction to Kata Confidential Containers and discuss the typical volume mount workflow of CSI drivers. We will cover the challenges that arise when supporting Kata CoCo in CSI drivers. We will explore the solutions we have developed to overcome these challenges and support Kata CoCo in our open source Azure File CSI driver. By the end of this session, you will have a comprehensive understanding of Kata confidential containers and be able to use them with persistent volumes including all the necessary details.
Speakers
avatar for Archana Choudhary

Archana Choudhary

Ms, Microsoft
A software engineer who has been exploring cloud-native technologies, particularly focusing on confidential containers over the past several months.
avatar for Andy Zhang (OSTC)

Andy Zhang (OSTC)

Principal Software Engineer, Microsoft
Andy Zhang is the storage lead in Azure Kubernetes Service team at Microsoft, maintainer of multiple Kubernetes projects, including Windows csi-proxy project, Azure CSI drivers, SMB, NFS, iSCSI CSI drivers, etc. Andy focuses on improving the experience of using storage in Kuberne... Read More →
Tuesday June 10, 2025 16:54 - 16:59 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Data Processing + Storage

17:01 HKT

⚡ Lightning Talk: WASM Vs Docker: Partners, Not Rivals - Pradumna V Saraf, Independent
Tuesday June 10, 2025 17:01 - 17:06 HKT
The rise of WebAssembly (WASM) has sparked comparisons with Docker which often leads to questions and confusion: Are WASM and Docker competing technologies?

In this talk, we will see how this is far from the truth. On one side, Docker revolutionised how we bundle and deploy applications, offering unparalleled portability and simplifying workflows across environments. On the other hand, WASM brings speed, security, and efficiency, enabling the execution of code written in languages like C, C++, and Rust almost at native speed, performance, and rapid startup time even in the browser.

We will explore how these two technologies bring the best of both worlds and help developers achieve portability, efficiency, security, and flexibility. We will also look at how Docker is actively working to make WASM mainstream by allowing WASM container images to be hosted on DockerHub and run WASM containers alongside traditional Linux and Windows containers.
Speakers
avatar for Pradumna Saraf

Pradumna Saraf

Open Source Developer, Independent
Pradumna is a Developer Advocate, Docker Captain, and a DevOps and Go Developer. He is passionate about Open Source and has mentored hundreds of people to break into the ecosystem. He also creates content on X (formerly Twitter) and LinkedIn, educating others about Open Source and... Read More →
Tuesday June 10, 2025 17:01 - 17:06 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Emerging + Advanced
  • Content Experience Level Any
  • Presentation Language English

17:08 HKT

⚡ Lightning Talk: Scaling AI With Wasm and Edge Computing - Miley Fu, WasmEdge
Tuesday June 10, 2025 17:08 - 17:12 HKT
What does the future of AI look like when we push the boundaries of cloud-based models and take it to the edge? In this talk, we’ll explore how Wasm and edge computing power AI deployment by providing developers with a fast, lightweight, and secure framework for running machine learning models across devices.

We’ll focus on how Wasm enables AI models to run efficiently on edge devices like NVIDIA GPUs, Mac, etc, driving LLM agents that require low latency and high throughput. This session will demonstrate the scalability of Wasm when integrated into distributed systems for AI processing, showing how the combination of edge computing and Wasm allows for faster, responsive AI applications that don’t rely on centralized cloud resources.

We’ll showcase real life use cases such as AI streamers commenting in real time, video translation agents deployment. Developers will walk away with an understanding of how to combine Wasm with edge infra to build and deploy AI apps that scale seamlessly
Speakers
avatar for Miley Fu

Miley Fu

CNCF Ambassador, Founding member at WasmEdge, WasmEdge
Miley is a Dev Advocate who build & contribute to open source. She is the co-chair and keynote speaker for KubeCon+Open Source Summit and AI Dev China 2024. With 6 years of experience working on WasmEdge runtime in CNCF sandbox as the founding member, she talks at KubeCon, KCD Shenzhen... Read More →
Tuesday June 10, 2025 17:08 - 17:12 HKT
Level 16 | Grand Ballroom I
  ⚡ Lightning Talks, Emerging + Advanced
 
Wednesday, June 11
 

09:00 HKT

Keynote: Welcome Back + Opening Remarks - Keith Chan, Director of Strategic Planning, The Linux Foundation APAC
Wednesday June 11, 2025 09:00 - 09:10 HKT
Speakers
avatar for Keith Chan

Keith Chan

Director of Strategic Planning, The Linux Foundation APAC
Wednesday June 11, 2025 09:00 - 09:10 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions

09:24 HKT

Keynote: Key Cloud Native Technologies in its Next Decade - Lin Sun, Head of Open Source, Solo.io
Wednesday June 11, 2025 09:24 - 09:34 HKT
When we started CNCF in 2015 to help advance container technology, Kubernetes was the seeding technology to provide a de facto container orchestration platform for all cloud native applications. Almost a decade later, the community has exploded with 200+ open source projects building on top of cloud native technologies. Looking ahead, what challenges will we have in the next decade? What gaps remain for users and contributors? And how do we evolve to meet the demands of an increasingly complex and connected world?

Let us review some of the key CNCF projects today and lay out some possible avenues for where cloud native is going for the next decade, AI, agentic network, sustainability and beyond.

Speakers
avatar for Lin Sun

Lin Sun

Head of Open Source & CNCF TOC, Solo.io
Lin is the Head of Open Source at Solo.io, and a CNCF TOC member and ambassador. She has worked on the Istio service mesh since the beginning of the project in 2017 and serves on the Istio Steering Committee and Technical Oversight Committee. Previously, she was a Senior Technical... Read More →
Wednesday June 11, 2025 09:24 - 09:34 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions
  • Content Experience Level Any
  • Presentation Language English

10:10 HKT

Keynote: Closing Remarks
Wednesday June 11, 2025 10:10 - 10:15 HKT
Wednesday June 11, 2025 10:10 - 10:15 HKT
Level 16 | Grand Ballroom I
  Keynote Sessions, Platform Engineering

11:00 HKT

Unified Observability in GRPC: Metrics and Tracing Using OpenTelemetry Plugin - Purnesh Dixit, Google
Wednesday June 11, 2025 11:00 - 11:30 HKT
gRPC’s performance advantages hinge on minimizing latency, but its binary protocol and streaming capabilities make debugging and monitoring inherently opaque. While distributed tracing identifies bottlenecks, metrics like error rates and throughput are critical for holistic insights. Yet, manual instrumentation for these signals in gRPC is complex, error-prone, and lacks standardization.

In this talk, Purnesh Dixit from the gRPC team unveils the new OpenTelemetry plugin for gRPC, developed by the gRPC team at Google, which provides unified metrics and tracing out-of-the-box to monitor retries, diagnose streaming bottlenecks, and optimize performance without invasive code changes.
1) Client-per-call: Track overall RPC lifecycle (e.g., grpc.client.call.duration).

2) Client-per-call-attempt: Analyze individual retries/hedges (e.g., grpc.client.attempt.duration).

3) Server-instruments: Measure concurrency, request queuing, and stream lifetimes (e.g., grpc.server.call.started).
Speakers
avatar for Purnesh Dixit

Purnesh Dixit

Purnesh Dixit (gRPC Team, Google), Google
Purnesh is a software engineer on the gRPC team at Google. He is a contributor to the OpenTelemetry support in gRPC-go.
Wednesday June 11, 2025 11:00 - 11:30 HKT
Level 16 | Grand Ballroom I
  Observability

11:00 HKT

Resilient Multiregion Global Control Planes With Crossplane and K8gb - Yury Tsarev & Steven Borrelli, Upbound
Wednesday June 11, 2025 11:00 - 11:30 HKT
Ensuring resilience in control planes is critical for organizations managing infrastructure and applications across multiple regions with Kubernetes. This talk presents a reference architecture for creating a Crossplane-based Global Control Plane, enhanced with k8gb for DNS-based failover and leveraging an Active/Passive setup.
We’ll explore how Crossplane’s declarative infrastructure provisioning integrates with k8gb to build robust, scalable, and resilient multicluster environments. Key takeaways include:

- Architecting resilient multiregion control planes with Active/Passive roles
- Demonstrating failover mechanisms where the Passive control plane transitions to Active during failures
- Strategies for optimizing failover times while maintaining availability

This session will guide attendees through proven methods and real-world challenges of building resilient Global Control Planes, empowering them to manage critical workloads across geographically distributed regions confidently.
Speakers
avatar for Steven Borrelli

Steven Borrelli

Principal Soutions Architect, Upbound
Steven is a Principal Solutions Architect for Upbound, where he helps customers adopt Crossplane.
avatar for Yury Tsarev

Yury Tsarev

Principal Solutions Architect, Upbound
Yury is an experienced software engineer who strongly focuses on open-source, software quality and distributed systems. As the creator of k8gb (https://www.k8gb.io) and active contributor to the Crossplane ecosystem, he frequently speaks at conferences covering topics such as Control... Read More →
Wednesday June 11, 2025 11:00 - 11:30 HKT
Level 19 | Crystal Court I
  Operations + Performance

11:00 HKT

Peer Group Mentoring
Wednesday June 11, 2025 11:00 - 12:00 HKT
Peer Group Mentoring allows participants to meet with experienced open source veterans across many CNCF projects. Mentees are paired with 2 – 10 other people in a pod-like setting to explore technical, community, and career questions together.

Sign-up to be a Mentee

Sign-up to be a Mentor
Wednesday June 11, 2025 11:00 - 12:00 HKT
Level 20 | Salon 4
  Inclusion + Accessibility
  • Content Experience Level Any
  • Presentation Language English

11:45 HKT

How Bloomberg Creates a Resilient Data Analytics Platform Using Karmada - Michas Szacillo & Ilan Filonenko, Bloomberg
Wednesday June 11, 2025 11:45 - 12:15 HKT
Bloomberg’s Data Analytics Platform Engineering team supports a wide-range of real-time streaming, large batch ETL, and data exploration use-cases by using Apache Flink, Apache Spark, and Trino across multi-cluster Kubernetes. However, deploying and managing these workflows at scale efficiently can be challenging due to varying resource requirements and uptime needs. For stateful applications like Apache Flink, ensuring recovery and state conservation after downtime is especially important.

This session will discuss how Bloomberg uses Karmada, a multi-cluster management system, to deploy and manage Apache Flink. We’ll also explore how Karmada’s capabilities can be expanded to handle additional data analytics workloads, including Apache Spark and Trino. The session will cover the unique requirements and real-life use-cases for each, including:

- Resource-aware workload scheduling
- Custom resource requirements and health interpretation
- State conservation during application failover
Speakers
avatar for Ilan Filonenko

Ilan Filonenko

Engineering Group Lead, Bloomberg
Ilan Filonenko is an Engineering Group Lead focusing on Cloud Native Data Analytics Infrastructure at Bloomberg - where he has designed and implemented distributed systems at both the application and infrastructure level. Previously, Ilan was an engineering consultant and technical... Read More →
avatar for Michas Szacillo

Michas Szacillo

Tech Lead, Bloomberg L.P.
Michas is a senior software engineer and tech lead on Bloomberg’s Streaming Analytics engineering team. The platform, which is running on Kubernetes, serves as the foundation for many of Bloomberg's data streaming use cases. Michas is also a frequent collaborator to the CNCF community... Read More →
Wednesday June 11, 2025 11:45 - 12:15 HKT
Level 19 | Crystal Court II
  Data Processing + Storage

11:45 HKT

Kube Intelligence - A Metric Based Insightful Remediation Recommender - Yash Bhatnagar, Google
Wednesday June 11, 2025 11:45 - 12:15 HKT
Not everything can be thought about while designing or developing the applications, and as such lot of the design decisions are based on estimates and potential usage patterns.

More often that not, these estimates differ from reality and introduce inefficiencies in the system across several fronts - and if at all visible, it always much later in the lifecycle when you already have several customers & high footprint.

And hence, unless there is a clear sign of performance degradation or unjustified costs, there is often no incentive to invest time & effort for some unknown gains.

In this session Yash will outline a real world case study about how they went about building an internal platform for handling several aspects of post deployment challenges like

1. rightsizing opportunities,
2. architecture migrations like moving to serverless,
3. finding right maintenance windows, etc

by using a wide range of metrics, and how impactful these minor optimizations turned out to be.
Speakers
avatar for Yash Bhatnagar

Yash Bhatnagar

Software Engineer, Google
Yash is working with Google as Software Engineer, and has 9 years of industrial experience with cloud architectures and micro-service development across Google and VMware. He has been a speaker at several international conferences such as KubeCon + CloudNativeCon and Open Source... Read More →
Wednesday June 11, 2025 11:45 - 12:15 HKT
Level 19 | Crystal Court I
  Platform Engineering
  • Content Experience Level Any
  • Presentation Language English

13:45 HKT

Progressive Delivery Made Easy With Argo Rollouts - Kevin Dubois, Red Hat
Wednesday June 11, 2025 13:45 - 14:15 HKT
ou might already be using a CI/CD solution, but are you 100% sure things will roll out without a glitch once you go to production?  Unfortunately differences between testing/staging and production environments are virtually unavoidable. There’s always a risk for unforeseen issues related to your production environment and/or actual load which can lead to potential disruptions to your users.

Progressive delivery is the next step after Continuous Delivery to roll out your application in a controlled and automated way so you can verify and test your application *in production* before it becomes fully available to all your user bases.

Embrace GitOps and Progressive Delivery with techniques like blue-green, canary release, shadowing traffic, dark launches and automatic metrics-based rollouts to validate the application in production using Kubernetes and tools like Istio, Prometheus, ArgoCD, and Argo Rollouts.

Come to this session to learn about Progressive Delivery in action using Kubernetes.
Speakers
avatar for Kevin Dubois

Kevin Dubois

Senior Principal Developer Advocate, Red Hat
Kevin is a Java Champion, software engineer, author and international speaker with a passion for Open Source, Java, and Cloud Native Development & Deployment practices. He currently works as developer advocate at Red Hat where he gets to enjoy working with Open Source projects and... Read More →
Wednesday June 11, 2025 13:45 - 14:15 HKT
Level 19 | Crystal Court I
  Platform Engineering

13:45 HKT

Women's Community Gathering
Wednesday June 11, 2025 13:45 - 14:45 HKT
Strong communities foster a feeling of belonging by providing opportunities for interaction, collaboration, and shared experiences. We hope to do just that with a gathering of attendees who identify as women and non-binary individuals at KubeCon + CloudNativeCon China! Join fellow women community members for networking and connection.
Wednesday June 11, 2025 13:45 - 14:45 HKT
Level 20 | Salon 5
  Inclusion + Accessibility
  • Content Experience Level Any
  • Presentation Language English

14:30 HKT

The Past, the Present, and the Future of Platform Engineering - Mauricio "Salaboy" Salatino, Diagrid & Viktor Farcic, Upbound
Wednesday June 11, 2025 14:30 - 15:00 HKT
Do you think platform engineering is too hard? Or is it just a buzzword? Is the CNCF landscape too tricky to visualize? If you’ve been in this industry long enough, you should know that platform engineering has been around for a long time.

Most of us have been trying to build developer platforms for decades, and most of us have failed at that. That begs the questions: “What is different now?” “Why will this time be different?” and “Do we have a chance to succeed?”

We’ll take a look at the past, the present, and the future of platform engineering. We’ll see what we were doing in the past, what we did wrong, and why we failed. Further on, we’ll see what we (the industry as a whole) are doing now and, more importantly, where we might go from here.

Get ready for the hard truths and challenges you will face when trying to build a platform based on Kubernetes. Join us for a pain-infused journey filled with challenges teams will face when building platforms to enable other teams.
Speakers
avatar for Viktor Farcic

Viktor Farcic

Viktor Farcic, Upbound
Viktor Farcic is a lead rapscallion at Upbound, a member of the CNCF Ambassadors, Google Developer Experts, CDF Ambassadors, and GitHub Stars groups, and a published author. He is a host of the YouTube channel DevOps Toolkit and a co-host of DevOps Paradox.
avatar for Mauricio Salatino

Mauricio Salatino

Software Engineer, Diagrid
Mauricio works as an Open Source Software Engineer at @Diagrid, contributing to and driving initiatives for the Dapr OSS project. Mauricio also serves as a Steering Committee member for the Knative Project and Co-Leading the Knative Functions initiative. He published a book titled... Read More →
Wednesday June 11, 2025 14:30 - 15:00 HKT
Level 19 | Crystal Court I
  Platform Engineering

14:30 HKT

Guardians of Multi-Tenancy: Enhanced Authorization To Prevent Lateral Node Escape - Dahu Kuang & Cheng Gao, Alibaba Cloud
Wednesday June 11, 2025 14:30 - 15:00 HKT
Maximizing security in multi-tenant clusters while maintaining cost-effectiveness is crucial for enterprise OPS. Most enterprise clusters deploy multiple daemonsets, which are attractive targets for attackers seeking to escape and move laterally, ultimately taking over the entire cluster.

The SIG community has introduced several advanced security features recently, such as CRD Field Selectors, Field and Label Selector Authorization, validating admission policy (VAP), and Structured Authorization Config. These allow users to define more flexible authorization configurations, addressing filtering and authorization needs for CRDs, kubelet, and other resources in multi-tenant environments.

We will share the lessons learned from the node escape incidents and demonstrate how to implement these new features and show how to use the Common Expression Language (CEL) to configure customized policies in Authorization Webhook and VAP, resulting more node-specific restrictions within clusters.
Speakers
avatar for Dahu Kuang

Dahu Kuang

Senior Engineer, Alibaba Cloud
Dahu Kuang is a Security Tech Lead on the Alibaba Cloud Container Service for Kubernetes (ACK) team, focusing on the design and implementation of container security-related work, especially within the context of secure supply chain.
avatar for Cheng Gao

Cheng Gao

Senior Security Engineer, Alibaba Cloud
Cheng Gao, Senior Security Engineer at Alibaba Cloud, focuses on the Security Development Lifecycle (SDL) for cloud-native applications. With expertise in container services, observability, and Serverless architectures, Cheng has led security assurance for several internal container... Read More →
Wednesday June 11, 2025 14:30 - 15:00 HKT
Level 16 | Grand Ballroom I
  Security
  • Content Experience Level Any
  • Presentation Language English

15:30 HKT

Policy as Code: Past, Present and Future for Novice - Hoon Jo, Megazone
Wednesday June 11, 2025 15:30 - 16:00 HKT
When you're new to Kubernetes, Policy as Code (PaC) can be a very unfamiliar topic. But as you get more familiar with Kubernetes, you'll probably be interested in how you can use it securely, especially since Kubernetes is essentially a declarative system via YAML, so having security also be done in code will help with usability and reducing human error.

In order to make PaC easier to understand, I'll demonstrate the Admission Control part directly in Kubernetes. Until recently, this part was based on webhooks, but since v1.23, the decision to actively embrace the Common Expression Language (CEL) has made it possible to apply it as code directly inside Kubernetes. Validating Admission Policy became GA in v1.30, and Mutating Admission Policy is in Alpha in v1.32.

Based on this outline, I'll talk about how PaC has been applied to Kubernetes in the past, how it works today, and finally, how we can expect it to be integrated into Kubernetes in the future.

See you at the session! 🙂
Speakers
avatar for Hoon Jo

Hoon Jo

Cloud Solutions Architect, Cloud Native Engineer, Megazone
Hoon Jo is Cloud Solutions Architect as well as Cloud Native engineer at Megazone. He has many times of speaker experience for cloud native technologies. And spread out Cloud Native Ubiquitous in the world. He has written several books and latest books is 『CONTAINER INFRASTRUCTURE... Read More →
Wednesday June 11, 2025 15:30 - 16:00 HKT
Level 16 | Grand Ballroom I
  Cloud Native Novice

15:30 HKT

Composable Platforms: Modular Platform Engineering With Kratix and Backstage - Hossein Salahi, Liquid Reply
Wednesday June 11, 2025 15:30 - 16:00 HKT
Constructing and managing platforms for diverse teams and workloads presents a significant challenge in today's cloud-native environment. This session introduces the concept of composable platforms, using modular, reusable components as the foundation for platform engineering. This talk will demonstrate how using Kratix, a workload-centric framework, and Backstage an extensible developer portal enables the creation of self-service platforms that balance standardization with adaptability.

The session will detail platform design for scalability and governance, streamlining developer workflows through Backstage, and using Kratix Promises for varied workload requirements. Attendees will gain practical insights into building scalable and maintainable platforms through real-world examples, architectural patterns, and a live demonstration of a fully integrated Kratix-Backstage deployment.
Speakers
avatar for Hossein Salahi

Hossein Salahi

Tech Lead, Liquid Reply
Hossein is an experienced cloud computing professional with nearly a decade of expertise in distributed systems and cloud technologies. He began as a student specializing in cloud automation and progressed to a full-time role focusing on on-premises cloud infrastructure and containers... Read More →
Wednesday June 11, 2025 15:30 - 16:00 HKT
Level 19 | Crystal Court I
  Platform Engineering

16:15 HKT

Taming Dependency Chaos for LLM in K8s - Peter Pan, Neko Ayaka & Kebe Liu, DaoCloud
Wednesday June 11, 2025 16:15 - 16:45 HKT
AI developer in K8S: either in Jupyter notebook or LLM serving: Python Dependency is always a headache :
- Prepare a set of base Images? The maintenance amounts & efforts will be a nightmare: Since (1) packages in AI world are rapidly version bumping, (2) diff llm codes require diff packages permutation/combination.
- Leave users to `pip install` by themselves ? The resigned waiting blocks productivity and efficiency. You may agree if you did it.
- If on a GPU Cloud, the pkg preparation time may even cost a lot: you rent a GPU but wasted in waiting pip downloading...
- you may choose to D.I.Y: docker-commit your own base-images, but you have to worry about the Dockerfile, registry and additional cloud cost if you don't have local docker env.

----
So we introduce https://github.com/BaizeAI/dataset.

The solution:
1. A CRD to describe the dependency and env.
2. K8S Job to pre-load the packages.
3. PVC to store and mount
4. `conda` to switch from envs
5. share between namespaces
Speakers
avatar for Peter Pan

Peter Pan

R&D Engineering VP, Daocloud
- DaoCloud Software Engineering VP- Regular KubeCon "Program Committee" : 2023 EU, 2024 HK, 2024 India, 2025 EU- Regular KubeCon Speaker: 2023 SH, 2024 EU, 2024 HK- Maintainer of below CNCF projects : cloudtty, kubean, hwameistor- CNCF WG-AI (AI Working-Group) Member + CNAI white-paper... Read More →
avatar for Kebe Liu

Kebe Liu

DaoCloud, Senior software engineer, DaoCloud
AI Infra and Service Mesh Team Lead at DaoCloud. Member of Istio Steering Committee. Creator of open source projects such as Merbridge and kcover.
avatar for Neko Ayaka

Neko Ayaka

Senior Software Engineer, DaoCloud
Cloud native developer, AI researcher, Gopher with 5 years of experience in loads of development fields across AI, data science, backend, frontend. Co-founder of https://github.com/nolebase
Wednesday June 11, 2025 16:15 - 16:45 HKT
Level 19 | Crystal Court I
  Application Development
  • Content Experience Level Any
  • Presentation Language English
 
Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.