KubeCon + CloudNativeCon Japan 2025: Full Schedule

16-17 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Japan 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Japan Standard Time (UTC+9:00). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis.

11:30 JST

A Journey and Lessons Learned To Enable IBM AIU Accelerators in Kubernetes - Takuya Mishina & Tatsuhiro Chiba, IBM Research

Monday June 16, 2025 11:30 - 12:00 JST

Level 1 | Pegasus Ballroom A+B1

In this presentation, we will unveil the secrets and journey to enable an IBM's AI Accelerator in Kubernetes. We utilize wide range of tools and frameworks such as device plugin, custom scheduler, metrics exporter, webhooks and custom resources together to satisfy real requirements from various stakeholders - general users, cluster administrators and driver/runtime developers. Our device plugin and custom scheduler can accept special preference such as topology-aware allocation to enable RDMA, and webhook-based validator guides them to follow specification changes. To sync up allocation status among the components, we carefully defined a custom resource after performance estimation. From developer perspective, we provide various debug-purpose capabilities: for example, device allocation by PCI address for inspection, and pseudo device mode to achieve non-real-device test. In addition, multi-architecture support gives freedom of platform choice to all of the participants.

Speakers

Takuya Mishina

Researcher, IBM Research

Takuya Mishina is a Staff Research Scientist at IBM Research - Tokyo. He has been working for enhancing cloud infrastructure lifecycle management such as security and compliance posture management. Recent interests include extending the automation mechanism to provide usable AI hardware... Read More →

Tatsuhiro Chiba

Senior Technical Staff Member, IBM Research

Tatsuhiro Chiba is a STSM and Manager at IBM Research, specialized in performance optimization and acceleration of large scale AI and HPC workloads on Hybrid Cloud. He is leading a project to enhance OpenShift performance and sustainability for AI and HPC by exploiting various cloud... Read More →

Monday June 16, 2025 11:30 - 12:00 JST
Level 1 | Pegasus Ballroom A+B1

AI + ML

Content Experience Level Intermediate
Presentation Language English

12:10 JST

Beyond Stock Outs: Scaling Inference on Mixed GPU Hardware With DRA - John Belamaric, Google & Jack Francis, Microsoft

Monday June 16, 2025 12:10 - 12:40 JST

Level 1 | Pegasus Ballroom A+B1

Pods are PENDING?!? Ugh, none of the latest GPUs are available. But there are tons of older ones! If only you could tell Kubernetes “use the best GPU available, as long as it has 20GB+”...(enter: DRA).

Kubernetes’ Dynamic Resource Allocation (DRA) system, beta since 1.32, allows variations on which GPUs get allocated to Pods. You can write a flexible spec so that when the Deployment scales, Pods can land on whatever nodes have available GPUs. DRA works even if you need different numbers of devices for different hardware! This enables a new level of utilization and efficiency, saving your organization real money.

Combined with an advanced Node Autoscaler like Google’s Custom Compute Classes or Karpenter, you can spin up more VMs with whatever GPUs are available - or the most economical - all for a single Deployment. Scaling is simpler and more reliable, and your workload can scale even when your preferred type of GPU is stocked out.

Come learn how, and see it in action with a demo!

Speakers

John Belamaric

Senior Staff Software Engineer, Google

John is a Sr Staff SWE, co-chair of K8s SIG Architecture and of K8s WG Device Management, helping lead efforts to improve how GPUs, TPUs, NICs and other devices are selected, shared, and configured in Kubernetes. He is also co-founder of Nephio, an LF project for K8s-based automation... Read More →

Jack Francis

Principal Software Engineer, Microsoft

Jack works on open source Kubernetes from his basement in Portland, Oregon, USA. When he’s not working, he’s usually upstairs hanging out with his family. On occasion he straps on a guitar and turns the amp to 11.

Monday June 16, 2025 12:10 - 12:40 JST
Level 1 | Pegasus Ballroom A+B1

AI + ML

Content Experience Level Intermediate
Presentation Language English

14:10 JST

Access AI Models Anywhere: Scaling AI Traffic With Envoy AI Gateway - Dan Sun, Bloomberg & Takeshi Yoneda, Tetrate.io

Monday June 16, 2025 14:10 - 14:40 JST

Level 1 | Pegasus Ballroom A+B1

As Generative AI adoption increases, organizations face accelerating challenges in deploying, scaling, and managing access to diverse AI models across cloud and on-prem environments. Envoy AI Gateway utilizes Envoy Proxy’s powerful filter architecture and extensibility through ext-proc to deliver key features such as centralized credential management, intelligent model routing, and LLM token usage control.
As the first CNCF-backed open source AI gateway, Envoy AI Gateway is built on top of a robust, high performance Envoy Gateway to help democratize AI infrastructure for organizations of all sizes.

In this talk, we will dive into the architecture of Envoy AI Gateway to learn how it extends Envoy’s capabilities to efficiently manage AI-driven workloads for enterprise needs, while providing robustness, scalability, and adaptability in the rapidly-changing generative AI landscape. We will also showcase a demo of an AI agent seamlessly accessing models anywhere through a unified API.

Speakers

Dan Sun

Team Lead, Bloomberg

Dan Sun is a software engineer team lead at Bloomberg. He is the co-founder and maintainer of KServe, an open source Serverless AI inference platform project. He is a co-founder of the Envoy AI Gateway project.

Takeshi Yoneda

Open Source Software Engineer, Tetrate.io

Takeshi Yoneda is a software engineer at Tetrate.io, with contributions to numerous open source projects, including compilers and network proxies. He is a co-founder of the Envoy AI Gateway project.

Monday June 16, 2025 14:10 - 14:40 JST
Level 1 | Pegasus Ballroom A+B1

AI + ML

Content Experience Level Any
Presentation Language English

14:50 JST

Zero-Extraction Cold Starts: How FUSE-Streaming Slashed ComfyUI Cold Starts by 10x - Fog Dong, BentoML

Monday June 16, 2025 14:50 - 15:20 JST

Level 1 | Pegasus Ballroom A+B1

Cold-start delays for GPU-heavy GenAI apps like ComfyUI aren’t just about speed—they’re architectural failures. While others optimize incremental steps, we eliminate entire phases: no image downloads, no layer extraction, no redundant model copies.

We introduce a radical Kubernetes-native pattern: Direct-to-GPU streaming via FUSE-mounted object storage (S3/GCS), bypassing legacy container workflows. By rearchitecting the snapshotter to support seekable, on-demand FUSE streaming, we enable:

- Instant container boot: Models/CUDA dependencies mount directly from object storage, avoiding registry bottlenecks (40MB/s → 900MB/s throughput)
- Zero-extraction overhead: Layers load incrementally via range-optimized fetches, eliminating Zstd unpack/copy latency
- True cold start elimination: ComfyUI pods activate in 90s (vs. 8+ mins) by co-locating model mounting and inference prep

We’ll dissect a live ComfyUI deployment using 100% OSS primitives to hack container internals in the session.

Speakers

Fog Dong

Senior Software Engineer, BentoML

Fog Dong, a Senior Engineer at BentoML, KubeVela maintainer, CNCF Ambassador, and LFAPAC Evangelist, has a rich background in cloud native and AI infra. Previously instrumental in developing Alibaba's large-scale Serverless workflows and Bytedance's cloud-native CI/CD platform, she... Read More →

Monday June 16, 2025 14:50 - 15:20 JST
Level 1 | Pegasus Ballroom A+B1

AI + ML

Content Experience Level Advanced
Presentation Language English

15:50 JST

Scaling AI Responsibly: Building Ethical, Sustainable, and Cloud Native AI Systems - Amita Sharma & Vincent Caldeira, Red Hat; Mohit Suman, Salesforce; Shamsher Ansari, Platform9; Anusha Hegde, Nirmata

Monday June 16, 2025 15:50 - 16:20 JST

Level 1 | Pegasus Ballroom A+B1

Panel Discussion - As AI continues to reshape industries, organizations face mounting pressure to scale AI systems responsibly while addressing challenges in efficiency, sustainability, and trust. This panel convenes leading experts to discuss how cloud-native technologies and CNCF projects are paving the way for scalable, ethical, and resource-efficient AI. Attendees will gain actionable insights into optimizing AI workflows, reducing environmental impact, and ensuring transparency in AI decision-making. From leveraging open-source tools to implementing cost-effective and ethical AI practices, this session will equip you with the knowledge to build AI systems that are both innovative and responsible. Discover how to harness the power of cloud-native ecosystems to drive AI transformation without compromising on sustainability or trust.
AI/ML engineers and data scientists looking to scale AI systems in cloud-native environments.

Speakers

Amita Sharma

AI/ML Engineering Manager, Red Hat

Amita is an Engineering Manager at Red Hat, leading Kubeflow Training, Feature Store. With 20 years of industry experience, including 14 years at Red Hat, she has held various roles. She is an active open-source contributor. Since 2011, she has contributed to the Fedora Project and... Read More →

Vincent Caldeira

CTO APAC, Red Hat

Vincent Caldeira, CTO of Red Hat in APAC, is responsible for strategic partnerships and technology strategy. Named a top CTO in APAC in 2023, he has 20+ years in IT, excelling in technology transformation in finance. An authority in open source and cloud-native technologies, Vincent... Read More →

Shamsher Ansari

Technical Product Manager, Platform9

Shamsher Ansari is a Technical Product Manager at Platform9, driving cloud-native infrastructure and Kubernetes solutions. With extensive experience in cloud, edge computing, and open-source technologies, he focuses on delivering scalable and cost-efficient products. Previously, at... Read More →

Anusha Hegde

Senior Technical Product Manager, Nirmata

Anusha Hegde is a Senior Technical Product Manager at Nirmata, focusing on cloud security, Kubernetes policy management, policy-as-code automation, and building AI-first products while analyzing AI’s impact on her product and customers. Previously, she was a Tech Lead at VMware... Read More →

Mohit Suman

Senior Product Manager, Salesforce

Mohit Suman is a Product Management Leader at Salesforce, driving AI Observability, MLOps, and AI App Dev. With 12+ years in product strategy, engineering, and architecture, he builds scalable solutions for developer productivity. A passionate advocate for open source and public speaking... Read More →

Monday June 16, 2025 15:50 - 16:20 JST
Level 1 | Pegasus Ballroom A+B1

AI + ML

Content Experience Level Intermediate
Presentation Language English

16:30 JST

Optimizing Data Locality and GPU Utilization for Training Workloads in Kubernetes - Haoyuan Li, Alluxio

Monday June 16, 2025 16:30 - 17:00 JST

Level 1 | Pegasus Ballroom A+B1

As organizations scale their model training workloads in cloud-native environments, they face significant data processing and storage challenges: managing massive training datasets across distributed storage systems while ensuring optimal I/O performance. While Kubernetes excels at compute orchestration, the increasing distribution of data across multiple storage backends creates bottlenecks that impact training performance and infrastructure costs.

This presentation introduces a Kubernetes-native distributed caching system that utilizes NVMe storage to overcome data locality challenges. Haoyuan Li will also share real-world, large-scale production use cases to show how this architecture lowers data infrastructure costs, increases GPU utilization, and enables workload portability to navigate GPU scarcity challenges.

Speakers

Haoyuan Li

Founder and CEO, Alluxio

Haoyuan Li is the Founder and CEO of Alluxio. He graduated with a Computer Science Ph.D. from the AMPLab at UC Berkeley. At the AMPLab, he co-created and led Alluxio (formerly Tachyon), an open-source virtual distributed file system. Before UC Berkeley, he got a M.S. from Cornell... Read More →

Monday June 16, 2025 16:30 - 17:00 JST
Level 1 | Pegasus Ballroom A+B1

Data Processing + Storage

Content Experience Level Intermediate
Presentation Language English

17:10 JST

Addons Need Love Too: Maintaining Addons for Better Cluster Security - Stevie Caldwell & Andy Suderman, Fairwinds

Monday June 16, 2025 17:10 - 17:40 JST

Level 1 | Pegasus Ballroom A+B1

Projects both within and outside of the CNCF ecosystem provide additional capabilities for Kubernetes clusters. These "addons" become integral to the functioning of our clusters, but we don't often talk about their impact as a whole or managing them holistically as first-class citizens.

We know there are barriers to keeping things like addons up-to-date and that it can be difficult to get buy-in for allocating the time and resources for updating something that is working just fine (for now), especially if you’re multiple major versions behind. In this session we will help you understand and articulate the benefits of catching up and keeping addons updated and how to be proactive moving forward. You will walk away with some tools and strategies for navigating the complexity of the addon ecosystem and make the process as painless as possible. You will be able to create an action plan for improving the stability and security of your clusters and share that with stakeholders.

Speakers

Stevie Caldwell

Senior Tech Lead, Fairwinds

Stevie Caldwell is a Senior Site Reliability Engineering Technical Lead at Fairwinds. Stevie also participates in the R&D arm of Fairwinds where she contributes to Fairwinds’s open source projects. She has worked with Kubernetes for 6+ years, has presented at a number of webinars... Read More →

Andy Suderman

CTO, Fairwinds

Andy Suderman is CTO at Fairwinds, a managed Kubernetes-as-a-Service provider. Andy has worked with cloud native technologies for the last eight years helping organizations adopt and manage Kubernetes. Andy is the creator and primary developer of Goldilocks—an open source tool that... Read More →

Monday June 16, 2025 17:10 - 17:40 JST
Level 1 | Pegasus Ballroom A+B1

Security

Content Experience Level Intermediate
Presentation Language English

11:30 JST

Multi Cluster Magics With Argo CD and Cluster Inventory or Don't Get Lost in the Clusterverse: Navig - Nick Eberts, Google

Tuesday June 17, 2025 11:30 - 12:00 JST

Level 1 | Pegasus Ballroom A+B1

You probably have more than one cluster and there is a decent chance you are using Argo CD. Additionally, it is quite likely that you have a few other variations of Kubernetes cluster lists. We posit that writing glue code to stitch together these clusters lists is not an awesome use of your time. Thankfully the good folks in SIG-Multicluster built this super cool api for cluster lists, cluster profile/cluster inventory! We are going to show you how to use said fancy new list with Argo CD along with other multi-cluster tools across Kubernetes clusters hosted by different providers. There will be demos. Possibly Mustaches. And a decent amount of awful puns. So come on down to bear witness to some sweet multi-cluster abstractions that will surely get your heart rate up.

Speakers

Nick Eberts

Product Manager, Google

Nick is currently the product manager for GKE Fleets & Teams focusing on multi-cluster capabilities that streamline GCP customers experience while building platforms on GKE. He also is a Kubernetes contributor, participates in SIG-Multicluster, and has been part of the community since... Read More →

Tuesday June 17, 2025 11:30 - 12:00 JST
Level 1 | Pegasus Ballroom A+B1

Platform Engineering

Content Experience Level Intermediate
Presentation Language English

12:10 JST

Cloud Native Scalability for Internal Developer Platforms - Hiroshi Hayakawa, LY Corporation

Tuesday June 17, 2025 12:10 - 12:40 JST

Level 1 | Pegasus Ballroom A+B1

Platform Engineering enables developers to focus on business value-aligned tasks by providing internal developer platforms (IDPs) that automate non-essential tasks. Kubernetes is widely used as a foundation for IDPs thanks to its scalability and flexibility.

However, Kubernetes was designed as a general workload orchestrator, not a platform component. As a result, IDP builders must integrate additional Cloud Native technologies and customizations, which can create scalability bottlenecks. At LY Corporation, his team has developed a Kubernetes-based, multi-tenant IDP running over 140K pods, and they faced such scalability challenges.

In this session, he will discuss scalability bottlenecks faced in the IDP, including observability pipelines, access control, etc. He will also explore scaling strategies for IDPs and how they address real-world scalability issues. By the end of this session, you will gain deeper insights into scalability challenges from a platform builder’s perspective.

Speakers

Hiroshi Hayakawa

Platform Engineer, LY Corporation

Hiroshi is a lead engineer for Kubernetes-based application platforms in LY Corporation's Private Cloud Division. The company operates numerous large-scale applications on its Kubernetes-based platform, and he excels in ensuring stable operations at scale on Kubernetes and driving... Read More →

Tuesday June 17, 2025 12:10 - 12:40 JST
Level 1 | Pegasus Ballroom A+B1

Platform Engineering

Content Experience Level Intermediate
Presentation Language English

14:10 JST

BGP Peering Patterns for Kubernetes Networking at Preferred Networks - Sho Shimizu, Preferred Networks, Inc. & Yutaro Hayakawa, Isovalent at Cisco

Tuesday June 17, 2025 14:10 - 14:40 JST

Level 1 | Pegasus Ballroom A+B1

BGP (Border Gateway Protocol) is increasingly being used to connect Kubernetes networking with the rest of the IT estate, especially in large-scale and on-premises environments. However, the complexity of many network architectures requires users to have more flexibility and control over how they deploy BGP. Based on the experience at Preferred Networks, this session introduces key BGP peering patterns that enhance Kubernetes networking while maintaining operational simplicity, including:

1. The Sidecar BGP Peering Pattern: A method of running a dedicated BGP speaker alongside Kubernetes networking components, balancing automation with fine-grained control.
2. Native Routing over IP Clos Networks – A tunneling-free approach that integrates Kubernetes with large-scale BGP-based datacenter fabrics for better performance.

Based on real-world experience, we will share best practices and lessons learned, helping attendees design scalable and reliable Kubernetes networking with BGP.

Speakers

Sho Shimizu

Software Engineer, Preferred Networks, Inc.

Sho Shimizu, software engineer at Preferred Networks, Inc., specializes in Kubernetes networking for AI/ML workloads. Since joining in 2019, he has developed a custom CNI plugin and is responsible for container networking architecture across the company's AI/ML infrastructure. Previously... Read More →

Yutaro Hayakawa

Software Engineering Technical Leader, Isovalent at Cisco

Working for Cilium at Isovalent. Linux Networking & BPF enthusiast.

Tuesday June 17, 2025 14:10 - 14:40 JST
Level 1 | Pegasus Ballroom A+B1

Connectivity

Content Experience Level Intermediate
Presentation Language English

14:50 JST

Green OpenTelemetry: Have Your Cake and Eat It Too - Adriana Villela, Dynatrace & Nancy Chauhan, Student

Tuesday June 17, 2025 14:50 - 15:20 JST

Level 1 | Pegasus Ballroom A+B1

It’s a not-so-dirty little secret that the technology that we so heavily rely on comes at an environmental cost. As technology becomes more complex, we need Observability to better understand it, and yet this too contributes to an increasing global tech carbon footprint.

Luckily, we have tools at our disposal that can help us understand our carbon footprint, and take mitigating actions. Tools like Kepler, Kube-Green, and green reviews.

In this talk, attendees will learn about Kepler, Kube-Green and green reviews. They will learn how to use these tools to make tweaks to their OpenTelemetry Collectors and other Kubernetes infrastructure. This will therefore keep systems observable while keeping the environment in mind.

Speakers

Nancy Chauhan

Engineer | Co-Chair, CNCF TAG Environmental Sustainability | Founder, Women in Cloud Native Community, Student

I am Nancy Chauhan, a software engineer passionate about solving complex problems and enhancing software reliability. As a CNCF Ambassador, I engage with a global cloud-native community, contributing to open-source projects and fostering collaboration. I also founded the Women in... Read More →

Adriana Villela

Principal Developer Advocate, Dynatrace

Adriana Villela is a Principal Developer Advocate, helping companies achieve reliability greatness through Observability, SRE, & DevOps practices. Previously, she managed a Platform Engineering team & an Observability Practices team at Tucows. Adriana has worked at various large-scale... Read More →

Tuesday June 17, 2025 14:50 - 15:20 JST
Level 1 | Pegasus Ballroom A+B1

Observability

Content Experience Level Intermediate
Presentation Language English

15:50 JST

The Future of Prometheus Exposition Format - Arthur Sens, Grafana Labs

Tuesday June 17, 2025 15:50 - 16:20 JST

Level 1 | Pegasus Ballroom A+B1

OpenMetrics (OM) had a wild journey: it started as a project to standardize the Prometheus exposition format, and it became an entirely separate CNCF Incubating project. Even though the project had high maturity, it struggled for years to find tools to comply with the first version of the spec. Finally, in 2025, it was incorporated back into the Prometheus Github organization so Prometheus developers could lead the efforts for OM 2.0.

In this talk, Arthur, a Prometheus maintainer and OpenMetrics contributor, will walk you through the main challenges that tools like Prometheus and OpenTelemetry face when trying to comply with OpenMetrics 1.0 and how the community plans to address these challenges in OM 2.0.

The audience will also learn how changing an exposition format can make Prometheus and OpenTelemetry-Collector more memory-efficient while making their specifications easier to translate into each other!

Speakers

Arthur Silva Sens

Senior Software Engineer, Grafana Labs

Arthur Sens is a Software Engineer at Grafana, focusing on Prometheus and OpenTelemetry interoperability. He is also an active member and maintainer for both communities. The only things that can take Arthur away from the computer are his passion for lifting unnecessarily heavy... Read More →

Tuesday June 17, 2025 15:50 - 16:20 JST
Level 1 | Pegasus Ballroom A+B1

Observability

Content Experience Level Beginner
Presentation Language English

16:30 JST

From Moon Prism Power To eBPF Super Saiyan: A Guide To Cloud Native Security & Observability - Carla Gaggini, Isovalent at Cisco & Kenta Tada, Toyota Motor Corporation

Tuesday June 17, 2025 16:30 - 17:00 JST

Level 1 | Pegasus Ballroom A+B1

Join us on an anime-powered journey to explore the power of eBPF in cloud native. Inspired by Sailor Moon and Dragon Ball GT, this talk breaks down how eBPF enhances observability, networking, and security, using the Sailor Guardians and Gohan to make eBPF approachable. We’ll break down topics like real-time observability by comparing it to psychic flames.
And just as unchecked power in anime can lead to chaos, eBPF’s immense capabilities need to be used wisely. Drawing from Dragon Ball GT's arc where the Dragon Balls become a threat, we’ll examine best practices for safe deployment of eBPF (e.g., LSM, verifier constraints, Linux Capability restrictions). Spoiler alert: our Sailor Guardians and Gohan will come out victorious, worry not!

By the end, you’ll gain a clear understanding of how to wield eBPF safely and effectively in cloud native environments, no Super Saiyan transformation required!

Speakers

Carla Gaggini

Head of Global Community Events, Isovalent at Cisco

Carla has been managing events and communities since 2011, producing and running many conferences, meetups, webinars and hackathons. Formerly managing everything “WTF is Cloud Native” and “Software Circus”, she now buzzes around eBPF at Isovalent at Cisco. She is one of the... Read More →

Kenta Tada

Project Manager, Toyota Motor Corporation

Kenta Tada is an experienced Linux engineer and team lead who has worked with various organizations. He has contributed to the Linux kernel and BPF tools and has served as a reviewer for the system validator used in kubeadm. He also reviewed the Japanese translation of the book... Read More →

Tuesday June 17, 2025 16:30 - 17:00 JST
Level 1 | Pegasus Ballroom A+B1

Cloud Native Novice

Content Experience Level Any
Presentation Language English