Spotlight
Andrew Pitt
This tutorial shows how to run an open source LLM on OpenShift with Red Hat AI Inference Server based on vLLM, using a PVC, GPU-backed deployment, OpenAI-compatible endpoint, model switching, and an optional AnythingLLM UI.
Devansh Jain
This article explains how to run Temporal on Kubernetes in production, covering GKE deployment, Cassandra repair and backups, Istio mTLS, resource sizing, PodDisruptionBudgets, and Prometheus-based monitoring.
Matt Camp
This case study shows how Unitary built Osmia, an open-source orchestration layer on EKS to run autonomous AI coding agents safely at scale using pod isolation, Karpenter, IRSA-based secrets, and real-time trajectory scoring.
Aditya Suryawanshi
This is a war story about a 3-person startup that replaced a $14,850/month over-engineered Kubernetes setup on AWS with Fly.io for $680, cutting P99 latency from 320ms to 180ms and deploy time from 8 minutes to 45 seconds.
Tools and utilities
Radar provides Kubernetes cluster visibility through:
Siclaw is an open source AI SRE platform for read-only infrastructure diagnostics, root cause analysis, team workflows, Kubernetes access, and MCP-based investigation without changing live systems directly.
GoKubeDownscaler is a horizontal autoscaler for Kubernetes workloads written in Go that automatically scales down deployments, statefulsets, and other resources based on time schedules to save costs.
This tool benchmarks Kubernetes log collectors by measuring throughput, CPU, memory, and log loss with a built-in verifier across agents like Vector, Fluent Bit, OpenTelemetry Collector, and Grafana Alloy.
Kube-Argus is a single-binary Kubernetes dashboard that combines live cluster state, log streaming, YAML editing, drain workflows, cost analysis, and AI-assisted diagnosis in one web interface.
Events starting soon
May 2, 2026
Location: Cambridge, MA, USA
This event requires an entrance fee
May 5, 2026
This is a virtual event
This is a free event.
May 5, 2026
Location: Austin, TX, USA
This event requires an entrance fee
May 5, 2026
Location: Oslo, NO
This is a free event.
May 5, 2026
Location: Offenbach, DE
This is a free event.
May 6, 2026
Location: Zurich, CH
This event requires an entrance fee
A single Kubernetes CRD for every service request turns small changes into full-platform reconciliations.
Alexander Held, former platform engineer at Mercedes-Benz Tech Innovation, describes a production refactor from a 2,000-line CRD to purpose-built resources and controllers. He shows how teams can model business workflows as Kubernetes APIs and then use owner references, finalizers, and events to keep platform operations predictable.
You will learn:
Learn from production
Ejiroghene Laurel Dafe
This case study shows how one engineer resolved two real Kubernetes production incidents involving an overly aggressive Ingress rate limit and Istio breaking non-HTTP socket traffic.
Maxim Nazarenko
This case study explains how to migrate bound Kubernetes volumes from deprecated in-tree Azure Disk provisioning to CSI with in-place PVC re-binding, minimal restarts, and no data loss across production disks.
DV Engineering
This case study shows how DoubleVerify built a Kubernetes and Ray serving platform to deploy and scale ML models in production.
It also covers RayService wrapped with Helm, fault tolerance with external Redis, and platform gains like 30% lower GPU cost.
Varun Arora
This case study shows building a centralized multi-account AWS monitoring platform managing 25+ accounts using Python Boto3 to fetch resource configurations into MongoDB, with Flask API and Next.js frontend achieving $30k annual savings.
Matching jobs
Data Engineer with Agile Defense
Salary: $54K to $297.88K a year
Location: based in the office in Chantilly, VA, USA
Tech stack: Kubernetes, Docker, Python, Kafka, Spark, GitHub Actions
Data Engineer with Teads
Salary: US$45K to US$275K a year
Location: based in the office (and remote from home) in Netanya, IL
Tech stack: Kubernetes, AWS, GCP, Docker, SQL, Java, Python, Scala, Typescript, Flink
DevOps Engineer with zooplus SE
Salary: $116.1K to $302.5K a year
Location: based in the office (and remote from home) in Krakow, PL; Wroclaw, PL
Tech stack: Kubernetes, AWS, Helm, Docker, Python, Rust, Terraform, Jenkins, Ansible, Sensu
Engineering Manager with SupplyHouse.com
Salary: $65K to $92K a year
Location: remote from
Tech stack: Kubernetes, Docker, Java, Javascript, Typescript, Redis, MySQL
Machine Learning Engineer with Gen Digital Inc.
Salary: $45K to $462K a year
Location: based in the office in New York, NY, USA
Tech stack: Kubernetes, AWS, Docker, Terraform
Build something
M. Syawal Saputra
This tutorial teaches how to build an elastic Kubernetes platform on VMware vSphere using Cluster API, Talos, and Cluster Autoscaler for declarative provisioning and automatic node scaling.
Alexandr Ivenin
This tutorial shows how to automate EKS deployments with Argo CD, Argo CD Image Updater, GitHub, and Amazon ECR so new container images flow to the cluster through GitOps without manual deployment steps.
Charles Sullivan
This tutorial teaches how to build a cert-manager external issuer that uses a YubiHSM 2 to sign TLS certificates via Go's crypto.Signer interface.
Saeed Anwar Ansari
This tutorial explains how to use KEDA on GKE to autoscale workloads based on event-driven signals rather than just CPU or memory.
Call for Papers closing soon
1
days
Location: Shanghai, CN
In-person conference organized by CNCF.
The conference starts on the 9 September 2026.
1
days
Location: Berlin, DE
In-person conference organized by Devopsdays.
The conference starts on the 29 September 2026.
6
days
Location: Hamburg, DE
In-person conference organized by techcamp.
The conference starts on the 26 August 2026.
8
days
Location: Kraków, PL
In-person conference organized by Devopsdays.
The conference starts on the 4 July 2026.
13
days
Location: Hamburg, DE
In-person conference organized by code.talks.
The conference starts on the 5 November 2026.
14
days
Location: Denver, CO, USA
In-person conference organized by Devopsdays.
The conference starts on the 22 September 2026.
14
days
Michigan Technology Conference 2026
Location: Rochester, MI, USA
In-person conference organized by The Michigan Technology Conference Association.
The conference starts on the 30 October 2026.
More articles
Kai Burjack
This case study explains why Nginx Ingress Controller causes TCP connection timeouts on GKE during pod termination.
It traces:
Fabián Sellés Rosa
This article explains how one team evaluated Crossplane and KRO to replace KIAM with EKS Pod Identities, balancing flexibility, maturity, and operational overhead after outages.
This article explains how the Kubernetes Image Promoter was rewritten to improve rate limiting, observability, and resilience in the pipeline that publishes and signs images for registry·k8s·io.
Nick Krestakos
This article explains a real homelab migration from Docker Compose to a three-node K3s cluster, covering: