Spotlight

What Happens When You Run Java at Scale on Kubernetes

Michael Preston

This article explains the challenges of running Java at scale on Kubernetes, covering JVM memory management with container limits, heap sizing with MaxRAMPercentage, CPU throttling, and garbage collector selection for containers.

More articles →

Tools and utilities

  • Infralens: eBPF observability

    InfraLens is a zero-instrumentation observability tool that uses eBPF to automatically discover and visualize service-to-service communication in Kubernetes clusters without requiring code changes or sidecars.

  • KubeAttention

    KubeAttention is a machine learning-powered Kubernetes scheduler plugin that uses eBPF telemetry to detect noisy neighbor interference and place latency-sensitive workloads on optimal nodes.

  • K8up: Kubernetes backup operator

    K8up is a Kubernetes Operator that helps you:

  • OpenEBS

    OpenEBS is a modern Block-Mode storage platform, a Hyper-Converged Software Storage System, and a virtual NVMe-oF SAN (vSAN) Fabric that is natively integrated into Kubernetes' core.

  • AgentDiscover Scanner: AI agent detection

    AgentDiscover Scanner detects autonomous AI agents and Shadow AI in codebases using static analysis for Python and JavaScript, network monitoring for active LLM traffic, and Kubernetes runtime detection via Cilium Tetragon eBPF.

More projects →

Events starting soon

Discover more events onn Kube Events →

GPU Containers as a Service
GPU Containers as a Service

Running GPU workloads on Kubernetes sounds straightforward until you need to isolate multiple tenants on the same server. The moment you virtualize GPUs for security, you lose access to NVIDIA kernel drivers — and almost every tool in the ecosystem assumes those drivers exist.

Landon Clipp built a GPU-based Containers as a Service platform from scratch, solving each isolation layer — from kernel separation with Kata Containers + QEMU to NVLink fabric partitioning to network policies with Cilium/eBPF — and shares exactly what broke along the way.

In this interview:

  • Why standard NVIDIA tooling (GPU Operator) fails in multi-tenant setups, and how to use CDI with PCI topology scanning to make GPUs visible to Kubernetes without kernel drivers
  • How to partition the NVLink fabric between tenants using a trusted service VM running Fabric Manager, and why the physical PCIe wiring differs between Supermicro HGX and NVIDIA DGX systems
  • Why gVisor doesn't work for GPU workloads — NVIDIA's unstable ioctl ABI means Google has to update gVisor for every driver release, and they only support a handful of GPUs
  • What caused 8-GPU VMs to take 30+ minutes to boot, and the specific fixes (IOMMUFD, cold plugging, kernel upgrades) that brought it down to minutes
  • How Cilium network policies enforce tenant isolation at the Kubernetes identity level instead of fragile IP-based rules

Where Containers as a Service fits best: inference workloads where AI teams want to ship an OCI image without managing infrastructure or signing multi-million dollar cluster contracts.

Learn from production

More case studies →

Matching jobs

    • Data Engineer with ILLUIN Technology

    • Salary: $90K to $412.61K a year

    • Location: based in the office in Paris La Défense, FR

    • Tech stack: Kubernetes, On-premise, Docker, SQL, Java, Kotlin, Python, Scala, Snowflake, Terraform

    • Data Engineer with XBOW

    • Salary: $72K to $188.18K a year

    • Location: remote from

    • Tech stack: Kubernetes, C#, Go, Java, Javascript, Python, Shell, SQL, Typescript, PostgreSQL

    • DevOps Engineer with Cosine

    • Salary: US$49.5K to US$275K a year

    • Location: based in the office in London, GB

    • Tech stack: Kubernetes, AWS, On-premise, Helm, Docker, Go, Javascript, Python, Typescript, Redis

    • DevOps Engineer with Meritis

    • Salary: $99.9K to $275K a year

    • Location: based in the office in Sophia Antipolis, FR

    • Tech stack: Kubernetes, AWS, Azure, GCP, Docker, Python, Terraform, GitHub Actions, Jenkins, Ansible

    • DevOps Engineer with Septeo

    • Salary: $99.9K to $275K a year

    • Location: based in the office in Montpellier, FR

    • Tech stack: Kubernetes, AWS, Docker, Terraform, Ansible

Discover more Kubernetes jobs on Kube Careers →

Subscribe to Learn Kubernetes Weekly

Trusted by 77K engineers. Delivered 176 issues and counting.

or subscribe via

Build something

More tutorials →

Call for Papers closing soon

  1. 2

    days

    Devopsdays Amsterdam

    The Call For Paper is open until 29 March 2026 at GMT-4. More info →
    • Location: Amsterdam, NL

    • In-person conference organized by Devopsdays.

    • The conference starts on the 19 June 2026.

    • Apply here
  2. 2

    days

    KubeCon + CloudNativeCon Japan 2026

    The Call For Paper is open until 29 March 2026 at GMT-4. More info →
    • Location: Yokohama, JP

    • In-person conference organized by Linux Foundation.

    • The conference starts on the 30 July 2026.

    • Apply here
  3. 4

    days

    WeAreDevelopers World Congress 2026 North America

    The Call For Paper is open until 31 March 2026 at GMT-4. More info →
    • Location: San Jose, CA, USA

    • In-person conference organized by WeAreDevelopers.

    • The conference starts on the 25 September 2026.

    • Apply here
  4. 4

    days

    J On the Beach

    The Call For Paper is open until 31 March 2026 at GMT-4. More info →
    • Location: Malaga, ES and virtual

    • Online & in-person conference organized by Yay Yay Events.

    • The conference starts on the 29 October 2026.

    • Apply here
  5. 4

    days

    Øredev

    The Call For Paper is open until 31 March 2026 at GMT-4. More info →
    • Location: MALMÖ, SE

    • In-person conference organized by Øredev.

    • The conference starts on the 4 November 2026.

    • Apply here
  6. 4

    days

    Cloud Native Summit Munich 2026

    The Call For Paper is open until 31 March 2026 at GMT-4. More info →
    • Location: Munich, DE

    • In-person conference organized by Cloud Native Summit Munich.

    • The conference starts on the 30 June 2026.

    • Apply here
  7. 4

    days

    Kubernetes Community Days Czech & Slovak - Prague 2026

    The Call For Paper is open until 1 April 2026 at GMT-4. More info →
    • Location: Bratislava, SK

    • In-person conference organized by KCD Czech & Slovak.

    • The conference starts on the 21 May 2026.

    • Apply here

Thanks to our sponsors who make Kube Today possible

Find out more about being a sponsor →

More articles

Even more articles →