The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Japan 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.
Please note: This schedule is automatically displayed in Japan Standard Time (UTC+9:00). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis.
Sign up or log in to add sessions to your schedule and sync them to your phone or calendar.
Kubernetes cluster upgrades are frequent and can lead to unforeseen problems, including application downtime. Therefore, it is essential to understand the roles of each component that makes up a Kubernetes cluster and how they behave during an upgrade in order to safely upgrade the Kubernetes cluster and achieve zero downtime for your applications.
This session will explain the basics and practical practices to improve safety, and ensure zero downtime for your applications. In 30 minutes, we will provide an easy-to-understand explanation of everything from the roles of the main Kubernetes components to points to note during upgrades and recommended configurations to minimize or eliminate application downtime during the upgrade process. This session is recommended for those who are going to operate Kubernetes in earnest and those who are troubled by upgrades and want to learn how to prevent application downtime during Kubernetes cluster upgrades.
Kazuki Uchima is a Technical Solutions Engineer at Google Cloud, specializing in Cloud Native technologies with a focus on Kubernetes. He provides consulting, architecture design, and technical support for Cloud Native solutions.
Kakeru Ishii is a Technical Solutions Engineer in Japan who helps Google Cloud customers especially when they have troubles with Kubernetes cluster, or applications on it. Background of his technical skills are around computer graphics, but now his enthusiasm is towards infrastructure... Read More →
At PlayStation Network, our Kubernetes platform with 50+ clusters handles massive amounts of user traffic every day, and the platform team consists of engineers in several global locations with different technological and cultural backgrounds. Despite such scale and organizational complexity, we achieved remarkable stability in FY2024 so far, maintaining a notable 99.995% uptime for our platform. In this session, we will share the key practices behind this success, including a controlled deployment strategy, robust scaling techniques, minimized manual intervention, and 24/7 operations spanning global regions. While these approaches may not be special individually, their consistent and disciplined application has been the foundation of our platform's stability. Those who strive to achieve stable platform operation and organizations looking to expand or consolidate their platforms will leave with actionable strategies to enhance the reliability of their platform.
Engineering Manager, Sony Interactive Entertainment
I'm an engineering manager at Sony Interactive Entertainment. I joined in 2017 and led a team that provided CI/CD pipelines and ECS infrastructure for PlayStation Network developers on the Japan site. Since 2021, I have been part of the global Internal Developer Platform team, leading... Read More →
I am a Software Engineer at Sony Interactive Entertainment, where I develop and operate the Kubernetes-based application platform for PlayStation Network. Although I am in my first year as a professional, my passion for Kubernetes and cloud-native technologies began during my student... Read More →
While typical web applications do not require large amounts of resources constantly, there are cases where specific processes consume significant CPU and memory. In this session, we will introduce an architecture that offloads such resource-intensive processes to Kubernetes Jobs. We will explain specific methods for Job management, how to integrate web applications (Next.js, @kubernetes/client-node) with the Kubernetes API, methods for data integration between Jobs and web applications, and real-time tracking of Job progress in the UI, all while sharing practical examples. Furthermore, we will provide a detailed introduction to a pattern where Kubernetes Job definitions generated from applications are managed using ConfigMaps, enabling quick configuration switching between environments, and offer hints to optimize your applications in terms of cost, performance, and management.
Focuses on cloud-native application development and operations centered on Kubernetes, holding CKAD and CKA certifications. Specializes in automation using GitHub Actions and loves reducing tedious work. Has expertise in designing and operating systems combining Web Application... Read More →
Excessive DNS query pressure can cause CoreDNS failures, leading to a cluster-wide meltdown. Thus, per-pod or per-node DNS proxy is a common high-availability solution. By binding the clusterIP of the kube-dns service to the local node, node-local DNS can provide seamless DNS interception and proxying, meeting the demands of low latency and high throughput. However, proxy's breakdown can directly lead to access failures, lacking high availability. To solve this issue, we use cgroup eBPF to implement enhanced service redirection for node-local DNS, inspired by Cilium's LocalRedirectPolicy. Based on the health status of the local proxy or the redirection QoS strategy, the query of the kube-dns service can be dynamically resolved to the local node-local DNS, otherwise normally forwarded to CoreDNS. This scheme is independent of the CNI, and could cooperate with kube-proxy, offer multiple service redirection strategies and provide production-grade quality assurance for node-local DNS.
I currently serve as a Senior Tech Lead at DaoCloud, with over 14 years of engineering experience. I have had the experience of being a speaker at KubeCon three times. I am the initiator and maintainer of the CNCF sandbox project Spiderpool, and an active community member of Cilium... Read More →
"I've started to understand the basics of Kubernetes, but when it comes to running it in production, I can't quite imagine what kind of issues might arise..." To ease these concerns, this session will use original characters, illustrations, and animations in a “cute manga” style to visually demonstrate how production applications can break and how to troubleshoot them. In our story, the main application as a character takes center stage as the hero, venturing out on a grand journey—only to be "suddenly attacked by a monster" at the most inopportune moment. By following this storyline, you will learn both how applications fail and how to fix them. Additionally, just as an adventurer equips better armor to prepare for future battles, we will explore common issues and explain how to prevent them from happening in the first place. Join us on an exciting adventure in "manga-style troubleshooting" and gain the confidence to tackle production Kubernetes challenges head-on!
I am an SRE at a major technology company, specializing in ensuring the reliability, scalability, and performance of a large-scale online learning platform. My work focuses on monitoring, automation, and incident response to maintain high availability for a wide user base. I also... Read More →
Kubernetes is widely recognized as a platform for building platforms, but even with modern platform engineering techniques, managing an end-to-end release and deployment lifecycle remains challenging.
This talk will first analyze key pain points in platform engineering and propose a new perspective on infrastructure code by separating different personas’ views. We advocate for a tenant-centric API that prioritizes user experience—minimizing input, reducing learning curves, and abstracting cloud provider details to decouple desired resources from underlying specs.
Next, we’ll introduce a design for fanning out tenant resource claims, enhancing flexibility and extensibility through a code generation component and a Kubernetes-like labeling system. Finally, we’ll cover the glue that binds everything together: Pkl for templating and validation, Prow for GitHub event-driven automation, and Crossplane + ArgoCD as the claim realization engine.
Wei Huang is a Software Engineer at Apple, focusing on Kube scheduling and control plane. He has served as a co-chair of Kubernetes SIG-Scheduling for years. He is also the founder of two Kubernetes sub-projects, scheduler-plugins, and kwok.
Got an open source project? Considering submitting it to the CNCF (or another foundation)? Whether and when to do this is one of the biggest decisions you'll make in the life of your project. Joining a foundation changes things fundamentally, and whether or not this is a good decision is going to depend on where you are in your project's development and what your goals for the project currently are.
As a community architect in the Red Hat Open Source Program Office, I help projects with this decision. During this talk, I will share common things that projects need to consider before they make this important decision. I will also talk about common benefits and challenges that projects usually experience. After this talk you should have a better understanding of whether you should contribute the project to CNCF or not.
I am a community architect at Red Hat, Open Source Program Office, where I work with open source projects related to Red Hat cloud native technologies and help them grow their communities. I enjoy a good cup of coffee and exploring cities on a bicycle :)
As cloud-native applications accelerate in complexity, managing APIs end to end is a top priority. This session introduces a next-generation, Kubernetes-native approach using Envoy Proxy and WebAssembly (Wasm) for dynamic, high-performance traffic control.
We’ll show how Wasm powers real-time policy enforcement, AI-assisted traffic analysis, and advanced rate limiting—without rebuilding Envoy. Attendees will learn how to implement Zero Trust principles, secure multi-tenant API gateways, and deliver external routes with minimal overhead.
Observability enhancements using eBPF and OpenTelemetry will demonstrate how to align trace data with runtime metrics for deeper insights. Finally, we’ll discuss developer onboarding, version governance, and lifecycle management to ensure long-term API success.
A live demo will illustrate ephemeral testing environments, rapid iteration, and how to achieve a future-proof, high-throughput API management layer in Kubernetes.
Mostafa is a technologist and consultant specializing in cloud native computing. He started his career as a software engineer before getting into the trenches of application and production support. He enjoys helping enterprise companies successfully adopt DevOps and cloud-native technologies... Read More →
Principal Technical Solutions Architect, Akamai Technologies
Brandon Kang is a Principal Technical Solutions Architect at Akamai Technologies, specializing in cloud-native projects across Asia as a compute specialist.Before joining Akamai, he served as a Lead Software Engineer at Samsung, a Senior Program Manager at Microsoft, and a Service... Read More →
At PlayStation Network, our Kubernetes platform with 50+ clusters handles massive amounts of user traffic every day, and the platform team consists of engineers in several global locations with different technological and cultural backgrounds. Despite such scale and organizational complexity, we achieved remarkable stability in FY2024 so far, maintaining a notable 99.995% uptime for our platform. In this session, we will share the key practices behind this success, including a controlled deployment strategy, robust scaling techniques, minimized manual intervention, and 24/7 operations spanning global regions. While these approaches may not be special individually, their consistent and disciplined application has been the foundation of our platform's stability. Those who strive to achieve stable platform operation and organizations looking to expand or consolidate their platforms will leave with actionable strategies to enhance the reliability of their platform.
Engineering Manager, Sony Interactive Entertainment
I'm an engineering manager at Sony Interactive Entertainment. I joined in 2017 and led a team that provided CI/CD pipelines and ECS infrastructure for PlayStation Network developers on the Japan site. Since 2021, I have been part of the global Internal Developer Platform team, leading... Read More →
I am a Software Engineer at Sony Interactive Entertainment, where I develop and operate the Kubernetes-based application platform for PlayStation Network. Although I am in my first year as a professional, my passion for Kubernetes and cloud-native technologies began during my student... Read More →
OpenTelemetry has become the go-to framework for unifying observability signals across metrics, logs, and traces. However, implementing OpenTelemetry often comes with its own set of challenges: broken instrumentation, missing signals, and misaligned semantic conventions that undermine its effectiveness. Debugging these issues can be daunting, leaving teams stuck with incomplete or unreliable observability data.
In this session, Kasper will demystify the debugging process for OpenTelemetry. Attendees will learn how to identify and troubleshoot common issues, ensure signals are transferred correctly, and align instrumentation with semantic conventions for consistent insights. Through live demos, Kasper will showcase techniques for validating resource configurations, debugging signal pipelines, and building confidence in your observability setup. This session is designed for anyone looking to unlock the full potential of OpenTelemetry and create robust observability practices.
Kasper is a Co-Chair for KubeCon+CloudNativeCon EU/NA, Kubestronaut, CNCF Ambassador, and co-founder of the Nordic meetup alliances, Cloud Native Nordics, where he also serves as Community Lead. He works in Developer Relations at Dash0, previously Lunar where he and his team built... Read More →
OpenMetrics (OM) had a wild journey: it started as a project to standardize the Prometheus exposition format, and it became an entirely separate CNCF Incubating project. Even though the project had high maturity, it struggled for years to find tools to comply with the first version of the spec. Finally, in 2025, it was incorporated back into the Prometheus Github organization so Prometheus developers could lead the efforts for OM 2.0.
In this talk, Arthur, a Prometheus maintainer and OpenMetrics contributor, will walk you through the main challenges that tools like Prometheus and OpenTelemetry face when trying to comply with OpenMetrics 1.0 and how the community plans to address these challenges in OM 2.0.
The audience will also learn how changing an exposition format can make Prometheus and OpenTelemetry-Collector more memory-efficient while making their specifications easier to translate into each other!
Arthur Sens is a Software Engineer at Grafana, focusing on Prometheus and OpenTelemetry interoperability. He is also an active member and maintainer for both communities. The only things that can take Arthur away from the computer are his passion for lifting unnecessarily heavy... Read More →