Spotlight

The Memory Time Bomb Hiding in Your Kubernetes Cluster

Naveen Krishnan

This article shows that default Kubernetes system pods such as kube-proxy and CoreDNS ship with incomplete or missing memory configurations, making them vulnerable to OOM kills during memory pressure.

More articles →

Tools and utilities

  • Gemini: automated backups

    Gemini is a Kubernetes CRD and operator for managing VolumeSnapshots.

  • scaf: streamlined development

    scaf generates a new project structure with Kubernetes manifests in three Kustomize layers for dev, sandbox, and production.

  • Benchmark Suite for Gateway API Implementations

    This tool provides a comprehensive test suite to evaluate real-world behavior (latency, scale, route propagation, traffic) of Kubernetes Gateway API implementations, beyond basic conformance.

  • Kubernetes Operator for MCP Servers

    MCP Operator deploys and validates MCP servers on Kubernetes with automatic protocol detection for SSE and Streamable HTTP, built-in monitoring via Prometheus and Grafana dashboards, optional metrics sidecar for request tracking, and HPA support.

  • k8s-iac-framework – GitOps IaC Framework for Multi-Cluster Kubernetes

    This framework helps you provision Kubernetes clusters via OpenTofu/Terragrunt, manage apps and system components via Helm + safehelm, and standardize secret encryption and lifecycle commands across environments.

More projects →

Events starting soon

Discover more events onn Kube Events →

Intelligent Kubernetes Load Balancing
Intelligent Kubernetes Load Balancing

You're running gRPC services in Kubernetes, load balancing looks fine on the dashboard — but some pods are burning at 80% CPU while others sit idle, and adding more replicas only partially helps.

Rohit Agrawal, a Staff Software Engineer on the traffic platform team at Databricks, explains why this happens and how his team replaced Kubernetes's default networking with a proxy-less, client-side load-balancing system built on the xDS protocol.

In this episode:

  • Why KubeProxy's Layer 4 routing breaks down under high-throughput gRPC: it picks a backend once per TCP connection, not per request
  • How Databricks built an Endpoint Discovery Service (EDS) that watches Kubernetes directly and streams real-time pod metadata to every client
  • How zone-aware spillover cut cross-availability-zone costs without sacrificing availability
  • Why CPU-based routing failed (monitoring lag creates oscillation) and what signals to use instead

The system has been running in production for three years across hundreds of services, handling millions of requests.

Learn from production

More case studies →

Matching jobs

    • Software Engineer with New Relic

    • Salary: $4.5K to $5.34L a year

    • Location: based in the office in Hyderabad, IN

    • Tech stack: Kubernetes, AWS, Azure, GCP, Docker, Go, GraphQL, Java, Javascript

    • Technical Success Manager I with New Relic

    • Salary: $47.97K to $321.2K a year

    • Location: remote from

    • Tech stack: Kubernetes, AWS, Azure, Java, Python, Ruby, SQL

    • Technical writer with Nebius

    • Salary: US$108K to US$198K a year

    • Location: based in the office in Amsterdam, NL

    • Tech stack: Kubernetes

    • Agentic Engineer with Hedra

    • Salary: $0 to $771.1K a year

    • Location: based in the office in San Francisco, CA, USA

    • Tech stack: Kubernetes, Python, Typescript

Discover more Kubernetes jobs on Kube Careers →

Subscribe to Learn Kubernetes Weekly

Trusted by 77K engineers. Delivered 178 issues and counting.

or subscribe via

Build something

More tutorials →

Call for Papers closing soon

  1. 5

    days

    Open Conf 2026

    The Call For Paper is open until 19 April 2026 at GMT-4. More info →
    • Location: Athens, GR

    • In-person conference organized by Open Conf.

    • The conference starts on the 21 November 2026.

    • Apply here
  2. 7

    days

    SREday Munich 2026

    The Call For Paper is open until 21 April 2026 at GMT-4. More info →
    • Location: Munich, DE

    • In-person conference organized by SREday.

    • The conference starts on the 15 May 2026.

    • Apply here
  3. 7

    days

    CLC26

    The Call For Paper is open until 21 April 2026 at GMT-4. More info →
    • Location: Mannheim, DE

    • In-person conference organized by Rheinwerk Verlag.

    • The conference starts on the 11 November 2026.

    • Apply here
  4. 16

    days

    Tech Fuse Des Moines 2026

    The Call For Paper is open until 30 April 2026 at GMT-4. More info →
    • Location: Des Moines, IA, USA

    • In-person conference organized by Tech Fuse DSM.

    • The conference starts on the 16 October 2026.

    • Apply here
  5. 16

    days

    Devopsdays Graz

    The Call For Paper is open until 30 April 2026 at GMT-4. More info →
    • Location: Graz, AT

    • In-person conference organized by Devopsdays.

    • The conference starts on the 4 September 2026.

    • Apply here
  6. 16

    days

    bit summit 2026

    The Call For Paper is open until 30 April 2026 at GMT-4. More info →
    • Location: Hamburg, DE

    • In-person conference organized by bit summit.

    • The conference starts on the 23 September 2026.

    • Apply here
  7. 16

    days

    IT-Tage

    The Call For Paper is open until 30 April 2026 at GMT-4. More info →
    • Location: Frankfurt, DE

    • In-person conference organized by Alkmene Verlag.

    • The conference starts on the 10 December 2026.

    • Apply here

Thanks to our sponsors who make Kube Today possible

Find out more about being a sponsor →

More articles

Even more articles →