Loading…
16-17 June
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Japan 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Japan Standard Time (UTC+9:00)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Company: Advanced clear filter
Monday, June 16
 

12:10 JST

New Cache Hierarchy for Container Images and OCI Artifact in Kubernetes Clusters Using Containerd - Toru Komatsu & Hidehito Yabuuchi, Preferred Networks, Inc.
Monday June 16, 2025 12:10 - 12:40 JST
One of the key bottlenecks in Kubernetes pod startup is the time taken to pull container images and OCI artifacts. It’s also costly to fetch large container images from the registry often. To tackle this problem, we developed a cache system with the following features:

* New Cache Hierarchy: Images pulled by pods are shared across the entire cluster, enabling cluster-wide optimization, not only cluster-local cache.
* Ninja: Users experience faster container image pulls without any changes on their part. Just like a ninja, the system stealthily enhances performance.
* Preheating: It supports pushing images to preheat the cache for subsequent pulls.

Deployed in a production cluster, the cache system has achieved a cache hit rate of around 95%, significantly reducing pod startup times and network communication with registries. Attendees will learn practical insights into leveraging cache and CRI to optimize image and OCI artifact pulls, ultimately enhancing cluster efficiency.
Speakers
avatar for Toru Komatsu

Toru Komatsu

Software Engineer, Preferred Networks, Inc.
Toru is the creator and lead developer of one of the CNCF Projects, Youki, an OCI runtime written in Rust. He is also a maintainer of the OCI Runtime Specification. Additionally, he is a member of Kubernetes and containerd and serves as a reviewer for runwasi. Toru is involved in... Read More →
avatar for Hidehito Yabuuchi

Hidehito Yabuuchi

Software Engineer, Preferred Networks, Inc.
Hidehito Yabuuchi is a Software Engineer at Preferred Networks, Inc. He has worked on on-premises Kubernetes clusters primarily for ML and HPC. His main interests are Kubernetes schedulers, container image registries, cooperation with public clouds, among others. He also has led the... Read More →
Monday June 16, 2025 12:10 - 12:40 JST
Level 1 | Pegasus B2-C
  Operations + Performance

14:50 JST

Zero-Extraction Cold Starts: How FUSE-Streaming Slashed ComfyUI Cold Starts by 10x - Fog Dong, BentoML
Monday June 16, 2025 14:50 - 15:20 JST
Cold-start delays for GPU-heavy GenAI apps like ComfyUI aren’t just about speed—they’re architectural failures. While others optimize incremental steps, we eliminate entire phases: no image downloads, no layer extraction, no redundant model copies.

We introduce a radical Kubernetes-native pattern: Direct-to-GPU streaming via FUSE-mounted object storage (S3/GCS), bypassing legacy container workflows. By rearchitecting the snapshotter to support seekable, on-demand FUSE streaming, we enable:

- Instant container boot: Models/CUDA dependencies mount directly from object storage, avoiding registry bottlenecks (40MB/s → 900MB/s throughput)
- Zero-extraction overhead: Layers load incrementally via range-optimized fetches, eliminating Zstd unpack/copy latency
- True cold start elimination: ComfyUI pods activate in 90s (vs. 8+ mins) by co-locating model mounting and inference prep

We’ll dissect a live ComfyUI deployment using 100% OSS primitives to hack container internals in the session.
Speakers
avatar for Fog Dong

Fog Dong

Senior Software Engineer, BentoML
Fog Dong, a Senior Engineer at BentoML, KubeVela maintainer, CNCF Ambassador, and LFAPAC Evangelist, has a rich background in cloud native and AI infra. Previously instrumental in developing Alibaba's large-scale Serverless workflows and Bytedance's cloud-native CI/CD platform, she... Read More →
Monday June 16, 2025 14:50 - 15:20 JST
Level 1 | Orion
  AI + ML
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Content Experience Level
  • Presentation Language
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.