Loading…
16-17 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Japan 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Japan Standard Time (UTC+9:00)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Venue: Level 1 | Pegasus Ballroom A+B1 clear filter
Monday, June 16
 

11:30 JST

A Journey and Lessons Learned To Enable IBM AIU Accelerators in Kubernetes - Takuya Mishina & Tatsuhiro Chiba, IBM Research
Monday June 16, 2025 11:30 - 12:00 JST
In this presentation, we will unveil the secrets and journey to enable an IBM's AI Accelerator in Kubernetes. We utilize wide range of tools and frameworks such as device plugin, custom scheduler, metrics exporter, webhooks and custom resources together to satisfy real requirements from various stakeholders - general users, cluster administrators and driver/runtime developers. Our device plugin and custom scheduler can accept special preference such as topology-aware allocation to enable RDMA, and webhook-based validator guides them to follow specification changes. To sync up allocation status among the components, we carefully defined a custom resource after performance estimation. From developer perspective, we provide various debug-purpose capabilities: for example, device allocation by PCI address for inspection, and pseudo device mode to achieve non-real-device test. In addition, multi-architecture support gives freedom of platform choice to all of the participants.
Speakers
avatar for Takuya Mishina

Takuya Mishina

Researcher, IBM Research
Takuya Mishina is a Staff Research Scientist at IBM Research - Tokyo. He has been working for enhancing cloud infrastructure lifecycle management such as security and compliance posture management. Recent interests include extending the automation mechanism to provide usable AI hardware... Read More →
avatar for Tatsuhiro Chiba

Tatsuhiro Chiba

Senior Technical Staff Member, IBM Research
Tatsuhiro Chiba is a STSM and Manager at IBM Research, specialized in performance optimization and acceleration of large scale AI and HPC workloads on Hybrid Cloud. He is leading a project to enhance OpenShift performance and sustainability for AI and HPC by exploiting various cloud... Read More →
Monday June 16, 2025 11:30 - 12:00 JST
Level 1 | Pegasus Ballroom A+B1
  AI + ML

12:10 JST

Beyond Stock Outs: Scaling Inference on Mixed GPU Hardware With DRA - John Belamaric, Google & Jack Francis, Microsoft
Monday June 16, 2025 12:10 - 12:40 JST
Pods are PENDING?!? Ugh, none of the latest GPUs are available. But there are tons of older ones! If only you could tell Kubernetes “use the best GPU available, as long as it has 20GB+”...(enter: DRA).

Kubernetes’ Dynamic Resource Allocation (DRA) system, beta since 1.32, allows variations on which GPUs get allocated to Pods. You can write a flexible spec so that when the Deployment scales, Pods can land on whatever nodes have available GPUs. DRA works even if you need different numbers of devices for different hardware! This enables a new level of utilization and efficiency, saving your organization real money.

Combined with an advanced Node Autoscaler like Google’s Custom Compute Classes or Karpenter, you can spin up more VMs with whatever GPUs are available - or the most economical - all for a single Deployment. Scaling is simpler and more reliable, and your workload can scale even when your preferred type of GPU is stocked out.

Come learn how, and see it in action with a demo!
Speakers
avatar for John Belamaric

John Belamaric

Senior Staff Software Engineer, Google
John is a Sr Staff SWE, co-chair of K8s SIG Architecture and of K8s WG Device Management, helping lead efforts to improve how GPUs, TPUs, NICs and other devices are selected, shared, and configured in Kubernetes. He is also co-founder of Nephio, an LF project for K8s-based automation... Read More →
avatar for Jack Francis

Jack Francis

Principal Software Engineer, Microsoft
Jack works on open source Kubernetes from his basement in Portland, Oregon, USA. When he’s not working, he’s usually upstairs hanging out with his family. On occasion he straps on a guitar and turns the amp to 11.
Monday June 16, 2025 12:10 - 12:40 JST
Level 1 | Pegasus Ballroom A+B1
  AI + ML

14:10 JST

Access AI Models Anywhere: Scaling AI Traffic With Envoy AI Gateway - Dan Sun, Bloomberg & Takeshi Yoneda, Tetrate.io
Monday June 16, 2025 14:10 - 14:40 JST
As Generative AI adoption increases, organizations face accelerating challenges in deploying, scaling, and managing access to diverse AI models across cloud and on-prem environments. Envoy AI Gateway utilizes Envoy Proxy’s powerful filter architecture and extensibility through ext-proc to deliver key features such as centralized credential management, intelligent model routing, and LLM token usage control.
As the first CNCF-backed open source AI gateway, Envoy AI Gateway is built on top of a robust, high performance Envoy Gateway to help democratize AI infrastructure for organizations of all sizes.

In this talk, we will dive into the architecture of Envoy AI Gateway to learn how it extends Envoy’s capabilities to efficiently manage AI-driven workloads for enterprise needs, while providing robustness, scalability, and adaptability in the rapidly-changing generative AI landscape. We will also showcase a demo of an AI agent seamlessly accessing models anywhere through a unified API.
Speakers
avatar for Dan Sun

Dan Sun

Team Lead, Bloomberg
Dan Sun is a software engineer team lead at Bloomberg. He is the co-founder and maintainer of KServe, an open source Serverless AI inference platform project. He is a co-founder of the Envoy AI Gateway project.
avatar for Takeshi Yoneda

Takeshi Yoneda

Open Source Software Engineer, Tetrate.io
Takeshi Yoneda is a software engineer at Tetrate.io, with contributions to numerous open source projects, including compilers and network proxies. He is a co-founder of the Envoy AI Gateway project.
Monday June 16, 2025 14:10 - 14:40 JST
Level 1 | Pegasus Ballroom A+B1
  AI + ML
  • Content Experience Level Any
  • Presentation Language English

14:50 JST

Zero-Extraction Cold Starts: How FUSE-Streaming Slashed ComfyUI Cold Starts by 10x - Fog Dong, BentoML
Monday June 16, 2025 14:50 - 15:20 JST
Cold-start delays for GPU-heavy GenAI apps like ComfyUI aren’t just about speed—they’re architectural failures. While others optimize incremental steps, we eliminate entire phases: no image downloads, no layer extraction, no redundant model copies.

We introduce a radical Kubernetes-native pattern: Direct-to-GPU streaming via FUSE-mounted object storage (S3/GCS), bypassing legacy container workflows. By rearchitecting the snapshotter to support seekable, on-demand FUSE streaming, we enable:

- Instant container boot: Models/CUDA dependencies mount directly from object storage, avoiding registry bottlenecks (40MB/s → 900MB/s throughput)
- Zero-extraction overhead: Layers load incrementally via range-optimized fetches, eliminating Zstd unpack/copy latency
- True cold start elimination: ComfyUI pods activate in 90s (vs. 8+ mins) by co-locating model mounting and inference prep

We’ll dissect a live ComfyUI deployment using 100% OSS primitives to hack container internals in the session.
Speakers
avatar for Fog Dong

Fog Dong

Senior Software Engineer, BentoML
Fog Dong, a Senior Engineer at BentoML, KubeVela maintainer, CNCF Ambassador, and LFAPAC Evangelist, has a rich background in cloud native and AI infra. Previously instrumental in developing Alibaba's large-scale Serverless workflows and Bytedance's cloud-native CI/CD platform, she... Read More →
Monday June 16, 2025 14:50 - 15:20 JST
Level 1 | Pegasus Ballroom A+B1
  AI + ML

15:50 JST

Scaling AI Responsibly: Building Ethical, Sustainable, and Cloud Native AI Systems - Amita Sharma & Vincent Caldeira, Red Hat; Mohit Suman, Salesforce; Shamsher Ansari, Platform9; Anusha Hegde, Nirmata
Monday June 16, 2025 15:50 - 16:20 JST
Panel Discussion - As AI continues to reshape industries, organizations face mounting pressure to scale AI systems responsibly while addressing challenges in efficiency, sustainability, and trust. This panel convenes leading experts to discuss how cloud-native technologies and CNCF projects are paving the way for scalable, ethical, and resource-efficient AI. Attendees will gain actionable insights into optimizing AI workflows, reducing environmental impact, and ensuring transparency in AI decision-making. From leveraging open-source tools to implementing cost-effective and ethical AI practices, this session will equip you with the knowledge to build AI systems that are both innovative and responsible. Discover how to harness the power of cloud-native ecosystems to drive AI transformation without compromising on sustainability or trust.
AI/ML engineers and data scientists looking to scale AI systems in cloud-native environments.
Speakers
avatar for Amita Sharma

Amita Sharma

AI/ML Engineering Manager, Red Hat
Amita is an Engineering Manager at Red Hat, leading Kubeflow Training, Feature Store. With 20 years of industry experience, including 14 years at Red Hat, she has held various roles. She is an active open-source contributor. Since 2011, she has contributed to the Fedora Project and... Read More →
avatar for Vincent Caldeira

Vincent Caldeira

CTO APAC, Red Hat
Vincent Caldeira, CTO of Red Hat in APAC, is responsible for strategic partnerships and technology strategy. Named a top CTO in APAC in 2023, he has 20+ years in IT, excelling in technology transformation in finance. An authority in open source and cloud-native technologies, Vincent... Read More →
avatar for Shamsher Ansari

Shamsher Ansari

Technical Product Manager, Platform9
Shamsher Ansari is a Technical Product Manager at Platform9, driving cloud-native infrastructure and Kubernetes solutions. With extensive experience in cloud, edge computing, and open-source technologies, he focuses on delivering scalable and cost-efficient products. Previously, at... Read More →
avatar for Anusha Hegde

Anusha Hegde

Senior Technical Product Manager, Nirmata
Anusha Hegde is a Senior Technical Product Manager at Nirmata, focusing on cloud security, Kubernetes policy management, policy-as-code automation, and building AI-first products while analyzing AI’s impact on her product and customers. Previously, she was a Tech Lead at VMware... Read More →
avatar for Mohit Suman

Mohit Suman

Senior Product Manager, Salesforce
Mohit Suman is a Product Management Leader at Salesforce, driving AI Observability, MLOps, and AI App Dev. With 12+ years in product strategy, engineering, and architecture, he builds scalable solutions for developer productivity. A passionate advocate for open source and public speaking... Read More →
Monday June 16, 2025 15:50 - 16:20 JST
Level 1 | Pegasus Ballroom A+B1
  AI + ML

16:30 JST

Optimizing Data Locality and GPU Utilization for Training Workloads in Kubernetes - Haoyuan Li, Alluxio
Monday June 16, 2025 16:30 - 17:00 JST
As organizations scale their model training workloads in cloud-native environments, they face significant data processing and storage challenges: managing massive training datasets across distributed storage systems while ensuring optimal I/O performance. While Kubernetes excels at compute orchestration, the increasing distribution of data across multiple storage backends creates bottlenecks that impact training performance and infrastructure costs.

This presentation introduces a Kubernetes-native distributed caching system that utilizes NVMe storage to overcome data locality challenges. Haoyuan Li will also share real-world, large-scale production use cases to show how this architecture lowers data infrastructure costs, increases GPU utilization, and enables workload portability to navigate GPU scarcity challenges.
Speakers
avatar for Haoyuan Li

Haoyuan Li

Founder and CEO, Alluxio
Haoyuan Li is the Founder and CEO of Alluxio. He graduated with a Computer Science Ph.D. from the AMPLab at UC Berkeley. At the AMPLab, he co-created and led Alluxio (formerly Tachyon), an open-source virtual distributed file system. Before UC Berkeley, he got a M.S. from Cornell... Read More →
Monday June 16, 2025 16:30 - 17:00 JST
Level 1 | Pegasus Ballroom A+B1
  Data Processing + Storage

17:10 JST

Addons Need Love Too: Maintaining Addons for Better Cluster Security - Stevie Caldwell & Andy Suderman, Fairwinds
Monday June 16, 2025 17:10 - 17:40 JST
Projects both within and outside of the CNCF ecosystem provide additional capabilities for Kubernetes clusters. These "addons" become integral to the functioning of our clusters, but we don't often talk about their impact as a whole or managing them holistically as first-class citizens.

We know there are barriers to keeping things like addons up-to-date and that it can be difficult to get buy-in for allocating the time and resources for updating something that is working just fine (for now), especially if you’re multiple major versions behind. In this session we will help you understand and articulate the benefits of catching up and keeping addons updated and how to be proactive moving forward. You will walk away with some tools and strategies for navigating the complexity of the addon ecosystem and make the process as painless as possible. You will be able to create an action plan for improving the stability and security of your clusters and share that with stakeholders.
Speakers
avatar for Stevie Caldwell

Stevie Caldwell

Senior Tech Lead, Fairwinds
Stevie Caldwell is a Senior Site Reliability Engineering Technical Lead at Fairwinds. Stevie also participates in the R&D arm of Fairwinds where she contributes to Fairwinds’s open source projects. She has worked with Kubernetes for 6+ years, has presented at a number of webinars... Read More →
avatar for Andy Suderman

Andy Suderman

CTO, Fairwinds
Andy Suderman is CTO at Fairwinds, a managed Kubernetes-as-a-Service provider. Andy has worked with cloud native technologies for the last eight years helping organizations adopt and manage Kubernetes. Andy is the creator and primary developer of Goldilocks—an open source tool that... Read More →
Monday June 16, 2025 17:10 - 17:40 JST
Level 1 | Pegasus Ballroom A+B1
  Security
 
Tuesday, June 17
 

11:30 JST

Multi Cluster Magics With Argo CD and Cluster Inventory or Don't Get Lost in the Clusterverse: Navig - Nick Eberts, Google
Tuesday June 17, 2025 11:30 - 12:00 JST
You probably have more than one cluster and there is a decent chance you are using Argo CD. Additionally, it is quite likely that you have a few other variations of Kubernetes cluster lists. We posit that writing glue code to stitch together these clusters lists is not an awesome use of your time. Thankfully the good folks in SIG-Multicluster built this super cool api for cluster lists, cluster profile/cluster inventory! We are going to show you how to use said fancy new list with Argo CD along with other multi-cluster tools across Kubernetes clusters hosted by different providers. There will be demos. Possibly Mustaches. And a decent amount of awful puns. So come on down to bear witness to some sweet multi-cluster abstractions that will surely get your heart rate up.
Speakers
avatar for Nick Eberts

Nick Eberts

Product Manager, Google
Nick is currently the product manager for GKE Fleets & Teams focusing on multi-cluster capabilities that streamline GCP customers experience while building platforms on GKE. He also is a Kubernetes contributor, participates in SIG-Multicluster, and has been part of the community since... Read More →
Tuesday June 17, 2025 11:30 - 12:00 JST
Level 1 | Pegasus Ballroom A+B1
  Platform Engineering

12:10 JST

Cloud Native Scalability for Internal Developer Platforms - Hiroshi Hayakawa, LY Corporation
Tuesday June 17, 2025 12:10 - 12:40 JST
Platform Engineering enables developers to focus on business value-aligned tasks by providing internal developer platforms (IDPs) that automate non-essential tasks. Kubernetes is widely used as a foundation for IDPs thanks to its scalability and flexibility.

However, Kubernetes was designed as a general workload orchestrator, not a platform component. As a result, IDP builders must integrate additional Cloud Native technologies and customizations, which can create scalability bottlenecks. At LY Corporation, his team has developed a Kubernetes-based, multi-tenant IDP running over 140K pods, and they faced such scalability challenges.

In this session, he will discuss scalability bottlenecks faced in the IDP, including observability pipelines, access control, etc. He will also explore scaling strategies for IDPs and how they address real-world scalability issues. By the end of this session, you will gain deeper insights into scalability challenges from a platform builder’s perspective.
Speakers
avatar for Hiroshi Hayakawa

Hiroshi Hayakawa

Platform Engineer, LY Corporation
Hiroshi is a lead engineer for Kubernetes-based application platforms in LY Corporation's Private Cloud Division. The company operates numerous large-scale applications on its Kubernetes-based platform, and he excels in ensuring stable operations at scale on Kubernetes and driving... Read More →
Tuesday June 17, 2025 12:10 - 12:40 JST
Level 1 | Pegasus Ballroom A+B1
  Platform Engineering

14:10 JST

BGP Peering Patterns for Kubernetes Networking at Preferred Networks - Sho Shimizu, Preferred Networks, Inc. & Yutaro Hayakawa, Isovalent at Cisco
Tuesday June 17, 2025 14:10 - 14:40 JST
BGP (Border Gateway Protocol) is increasingly being used to connect Kubernetes networking with the rest of the IT estate, especially in large-scale and on-premises environments. However, the complexity of many network architectures requires users to have more flexibility and control over how they deploy BGP. Based on the experience at Preferred Networks, this session introduces key BGP peering patterns that enhance Kubernetes networking while maintaining operational simplicity, including:

1. The Sidecar BGP Peering Pattern: A method of running a dedicated BGP speaker alongside Kubernetes networking components, balancing automation with fine-grained control.
2. Native Routing over IP Clos Networks – A tunneling-free approach that integrates Kubernetes with large-scale BGP-based datacenter fabrics for better performance.

Based on real-world experience, we will share best practices and lessons learned, helping attendees design scalable and reliable Kubernetes networking with BGP.
Speakers
avatar for Sho Shimizu

Sho Shimizu

Software Engineer, Preferred Networks, Inc.
Sho Shimizu, software engineer at Preferred Networks, Inc., specializes in Kubernetes networking for AI/ML workloads. Since joining in 2019, he has developed a custom CNI plugin and is responsible for container networking architecture across the company's AI/ML infrastructure. Previously... Read More →
avatar for Yutaro Hayakawa

Yutaro Hayakawa

Software Engineering Technical Leader, Isovalent at Cisco
Working for Cilium at Isovalent. Linux Networking & BPF enthusiast.
Tuesday June 17, 2025 14:10 - 14:40 JST
Level 1 | Pegasus Ballroom A+B1
  Connectivity

14:50 JST

Green OpenTelemetry: Have Your Cake and Eat It Too - Adriana Villela, Dynatrace & Nancy Chauhan, Student
Tuesday June 17, 2025 14:50 - 15:20 JST
It’s a not-so-dirty little secret that the technology that we so heavily rely on comes at an environmental cost. As technology becomes more complex, we need Observability to better understand it, and yet this too contributes to an increasing global tech carbon footprint.

Luckily, we have tools at our disposal that can help us understand our carbon footprint, and take mitigating actions. Tools like Kepler, Kube-Green, and green reviews.

In this talk, attendees will learn about Kepler, Kube-Green and green reviews. They will learn how to use these tools to make tweaks to their OpenTelemetry Collectors and other Kubernetes infrastructure. This will therefore keep systems observable while keeping the environment in mind.
Speakers
avatar for Nancy Chauhan

Nancy Chauhan

Engineer | Co-Chair, CNCF TAG Environmental Sustainability | Founder, Women in Cloud Native Community, Student
I am Nancy Chauhan, a software engineer passionate about solving complex problems and enhancing software reliability. As a CNCF Ambassador, I engage with a global cloud-native community, contributing to open-source projects and fostering collaboration. I also founded the Women in... Read More →
avatar for Adriana Villela

Adriana Villela

Principal Developer Advocate, Dynatrace
Adriana Villela is a Principal Developer Advocate, helping companies achieve reliability greatness through Observability, SRE, & DevOps practices. Previously, she managed a Platform Engineering team & an Observability Practices team at Tucows. Adriana has worked at various large-scale... Read More →
Tuesday June 17, 2025 14:50 - 15:20 JST
Level 1 | Pegasus Ballroom A+B1
  Observability

15:50 JST

The Future of Prometheus Exposition Format - Arthur Sens, Grafana Labs
Tuesday June 17, 2025 15:50 - 16:20 JST
OpenMetrics (OM) had a wild journey: it started as a project to standardize the Prometheus exposition format, and it became an entirely separate CNCF Incubating project. Even though the project had high maturity, it struggled for years to find tools to comply with the first version of the spec. Finally, in 2025, it was incorporated back into the Prometheus Github organization so Prometheus developers could lead the efforts for OM 2.0.

In this talk, Arthur, a Prometheus maintainer and OpenMetrics contributor, will walk you through the main challenges that tools like Prometheus and OpenTelemetry face when trying to comply with OpenMetrics 1.0 and how the community plans to address these challenges in OM 2.0.

The audience will also learn how changing an exposition format can make Prometheus and OpenTelemetry-Collector more memory-efficient while making their specifications easier to translate into each other!
Speakers
avatar for Arthur Silva Sens

Arthur Silva Sens

Senior Software Engineer, Grafana Labs
Arthur Sens is a Software Engineer at Grafana, focusing on Prometheus and OpenTelemetry interoperability. He is also an active member and maintainer for both communities. The only things that can take Arthur away from the computer are his passion for lifting unnecessarily heavy... Read More →
Tuesday June 17, 2025 15:50 - 16:20 JST
Level 1 | Pegasus Ballroom A+B1
  Observability

16:30 JST

From Moon Prism Power To eBPF Super Saiyan: A Guide To Cloud Native Security & Observability - Carla Gaggini, Isovalent at Cisco & Kenta Tada, Toyota Motor Corporation
Tuesday June 17, 2025 16:30 - 17:00 JST
Join us on an anime-powered journey to explore the power of eBPF in cloud native. Inspired by Sailor Moon and Dragon Ball GT, this talk breaks down how eBPF enhances observability, networking, and security, using the Sailor Guardians and Gohan to make eBPF approachable. We’ll break down topics like real-time observability by comparing it to psychic flames.
And just as unchecked power in anime can lead to chaos, eBPF’s immense capabilities need to be used wisely. Drawing from Dragon Ball GT's arc where the Dragon Balls become a threat, we’ll examine best practices for safe deployment of eBPF (e.g., LSM, verifier constraints, Linux Capability restrictions). Spoiler alert: our Sailor Guardians and Gohan will come out victorious, worry not!

By the end, you’ll gain a clear understanding of how to wield eBPF safely and effectively in cloud native environments, no Super Saiyan transformation required!
Speakers
avatar for Carla Gaggini

Carla Gaggini

Head of Global Community Events, Isovalent at Cisco
Carla has been managing events and communities since 2011, producing and running many conferences, meetups, webinars and hackathons. Formerly managing everything “WTF is Cloud Native” and “Software Circus”, she now buzzes around eBPF at Isovalent at Cisco. She is one of the... Read More →
avatar for Kenta Tada

Kenta Tada

Project Manager, Toyota Motor Corporation
Kenta Tada is an experienced Linux engineer and team lead who has worked with various organizations. He has contributed to the Linux kernel and BPF tools and has served as a reviewer for the system validator used in kubeadm. He also reviewed the Japanese translation of the book... Read More →
Tuesday June 17, 2025 16:30 - 17:00 JST
Level 1 | Pegasus Ballroom A+B1
  Cloud Native Novice
  • Content Experience Level Any
  • Presentation Language English
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Content Experience Level
  • Presentation Language
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.