Spotlight
Kalyan Josyula
This case study shows how a team traced repeated pod OOM kills in ASP.NET Core to native memory growth from zombie SignalR connections, glibc fragmentation, and kernel socket buffers.
Moeid Heidari
This tutorial teaches how to deploy Crossview on Kubernetes with Helm and secure it for enterprise use with session auth, SSO, proxy header auth, RBAC, TLS, and high-availability settings.
David Kornel
This tutorial shows how to test Kubernetes deployments and operators from Java on real clusters without heavy boilerplate by using kubetest4j on top of the Fabric8 client.
This tutorial explains how Amazon EKS Pod Identity session policies let teams restrict pod IAM permissions with inline policies.
Tools and utilities
k10s is a terminal dashboard for watching multiple Kubernetes clusters at once, with side-by-side views, health signals, warnings, and recent logs in one screen.
KubeDiagrams is a tool that automatically generates visual architecture diagrams from Kubernetes manifests, Helm charts, and live clusters.
H8s is a home infrastructure project combining Kubernetes with Talos OS security, running on 2 N100 mini PCs with GitOps deployment via ArgoCD.
Nelm is meant to be a direct replacement for Helm 3, providing first-class Helm chart support yet improving on what Helm 3 offers.
Valkey Operator is a Kubernetes operator that automates deployment and lifecycle management of Valkey clusters and instances with features like automated installation and configuration management.
Events starting soon
May 14, 2026
Location: Nashville, TN, USA
This event requires an entrance fee
May 14, 2026
This is a virtual event
This is a free event.
May 14, 2026
This is a virtual event
This is a free event.
May 14, 2026
Location: Portland, OR, USA
This is a free event.
May 15, 2026
Location: TASHKENT, UZ
This event requires an entrance fee
May 15, 2026
Location: İstanbul, TR
This event requires an entrance fee
Most teams scale Kubernetes by thinking about pods and nodes. At Render, Brian Stack ran into a different dimension: hundreds of thousands of namespaces per cluster, multiplied across DaemonSets that list-watch every namespace.
Brian explains how Render traced the issue through Calico and Vector, worked with upstream maintainers, and turned memory profiling into operational wins: lower node costs, lighter API-server load, and faster rollouts.
In this interview:
Learn from production
Nick Roan
This case study shows how a single RAG chunk size change collapsed vLLM prefix-cache hit rate from 85% to 4%, triggering an 80% GPU replica increase while latency stayed flat.
It also includes the fix: adding a two-phase cache replay gate in CI.
Dat Ton
This case study explains how cURL 65 errors and DNS resolution failures on AWS EKS were caused by Linux kernel network limits being exceeded, resolved by increasing netdev_budget, netdev_budget_usecs, and netdev_max_backlog parameters.
Matt Camp
This case study shows how Unitary built Osmia, an open-source orchestration layer on EKS to run autonomous AI coding agents safely at scale using pod isolation, Karpenter, IRSA-based secrets, and real-time trajectory scoring.
Aditya Suryawanshi
This is a war story about a 3-person startup that replaced a $14,850/month over-engineered Kubernetes setup on AWS with Fly.io for $680, cutting P99 latency from 320ms to 180ms and deploy time from 8 minutes to 45 seconds.
Matching jobs
Data Engineer with Brillio
Salary: $64.8K to $2.48L a year
Location: based in the office in Bangalore, KA, IN
Tech stack: Kubernetes, AWS, Docker, Scala, Shell, SQL, DynamoDB, Spark, Cloudformation
DevOps Engineer with 3Pillar
Salary: $126.9K to $302.5K a year
Location: remote from
Tech stack: Kubernetes, AWS, Azure, GCP, Docker, C++, Java, Redis, Cloudformation, Terraform
DevOps Engineer with Aircall
Salary: $99.9K to $275K a year
Location: based in the office in Paris, FR
Tech stack: Kubernetes, AWS, Docker, Go, Python, Typescript, Cloudformation, Terraform, Datadog
DevOps Engineer with Aledade
Salary: $49.5K to $539K a year
Location: remote from
Tech stack: Kubernetes, AWS, Docker, Go, Python, Pulumi, Terraform, GitHub Actions
DevOps Engineer with Apply Digital
Salary: $67.5K to $539K a year
Location: remote from
Tech stack: Kubernetes, GCP, Docker, Python, Shell, Terraform, GitHub Actions, Gitlab
Build something
Shawrup K Suter
This tutorial shows how CRaC can cut Spring Boot startup time on Kubernetes from 23 seconds to 2.8 seconds and explains the real production issues around AWS SDK checkpointing and OpenTelemetry.
augusthottie
This tutorial shows how to add Prometheus, Grafana, Alertmanager, custom metrics, ServiceMonitors, dashboards, and alert rules to an EKS cluster through GitOps.
Felix Hoang
This tutorial teaches how to eliminate static kubeconfig files by configuring HashiCorp Vault as an OIDC provider for authentication with dynamic, short-lived tokens.
Sajosam
This tutorial shows how to build a self-service IDP where developers provision real AWS S3 buckets via a Backstage form, with Crossplane handling AWS API calls through Kubernetes CRDs.
Call for Papers closing soon
1
days
Location: Hamburg, DE
In-person conference organized by code.talks.
The conference starts on the 5 November 2026.
2
days
Location: Denver, CO, USA
In-person conference organized by Devopsdays.
The conference starts on the 22 September 2026.
2
days
Michigan Technology Conference 2026
Location: Rochester, MI, USA
In-person conference organized by The Michigan Technology Conference Association.
The conference starts on the 30 October 2026.
3
days
Location: San Jose, CA, USA
In-person conference organized by TechEx Events.
The conference starts on the 19 May 2026.
4
days
Location: London, UK
In-person conference organized by Devopsdays.
The conference starts on the 17 September 2026.
4
days
Location: Rio de Janeiro, BR
In-person conference organized by Devopsdays.
The conference starts on the 15 August 2026.
5
days
Kubernetes Community Days Lima 2026
Location: Lima, PE
In-person conference organized by KCD Lima, Perú.
The conference starts on the 18 July 2026.
More articles
Alexandr Ivenin
This article explains how GKE 1.33 and 1.34 improve node auto provisioning with ComputeClasses, workload specific scaling, parallel node pool creation, and smarter consolidation for targeted autoscaling.
Marco Piraccini
This article explains how Kubernetes skew protection routes traffic based on app version to prevent frontend and backend mismatches during deployments, and version-aware routing using the Gateway API.
Dillon
This article shows how to maintain VM-level network security during KubeVirt live migration by using Calico labels and policy enforcement rather than node or pod IPs.
Abhishek Gupta
This article explains how the DocumentDB Kubernetes Operator delivers high availability with automatic failover, replica promotion, and optional zone, region, and multi-cloud resilience.