Spotlight
Razeen Abdal-Rahman
This tutorial shows how to run a small, security-focused k3s cluster on a Raspberry Pi inside a normal home network with tight hardware and networking limits.
Alexandr Ivenin
This article explains how GKE 1.33 and 1.34 improve node auto provisioning with ComputeClasses, workload specific scaling, parallel node pool creation, and smarter consolidation for targeted autoscaling.
Nick Roan
This case study shows how a single RAG chunk size change collapsed vLLM prefix-cache hit rate from 85% to 4%, triggering an 80% GPU replica increase while latency stayed flat.
It also includes the fix: adding a two-phase cache replay gate in CI.
Dat Ton
This case study explains how cURL 65 errors and DNS resolution failures on AWS EKS were caused by Linux kernel network limits being exceeded, resolved by increasing netdev_budget, netdev_budget_usecs, and netdev_max_backlog parameters.
Tools and utilities
Node Healthcheck Operator automatically detects unhealthy nodes and triggers pluggable remediators like BMC, ClusterAPI, or software reboots to recover workloads without manual intervention.
Clabernetes is a Kubernetes controller that deploys valid containerlab topologies into a Kubernetes cluster, effectively bringing containerlab workflows into a distributed Kubernetes environment.
H8s is a home infrastructure project combining Kubernetes with Talos OS security, running on 2 N100 mini PCs with GitOps deployment via ArgoCD.
ayaFlow is an eBPF-based Rust tool that runs as a sidecarless DaemonSet to capture node-wide network traffic, expose metrics, and provide lightweight kernel-level visibility for troubleshooting and observability.
kubevirt-benchmark is a vendor-neutral performance testing toolkit for KubeVirt VMs on OpenShift or any Kubernetes distribution, covering VM provisioning, boot storms, live migration, chaos benchmarking, and failure recovery.
Events starting soon
May 19, 2026
Location: Soltau, DE
This event requires an entrance fee
May 19, 2026
Location: Aarhus, DK
This is a free event.
May 19, 2026
This is a virtual event
This is a free event.
May 19, 2026
Location: San Jose, CA, USA
This event requires an entrance fee
May 19, 2026
Location: San Jose, CA, USA
This event requires an entrance fee
May 20, 2026
This is a virtual event
This event requires an entrance fee
Learn from production
Matt Camp
This case study shows how Unitary built Osmia, an open-source orchestration layer on EKS to run autonomous AI coding agents safely at scale using pod isolation, Karpenter, IRSA-based secrets, and real-time trajectory scoring.
Ejiroghene Laurel Dafe
This case study shows how one engineer resolved two real Kubernetes production incidents involving an overly aggressive Ingress rate limit and Istio breaking non-HTTP socket traffic.
Varun Arora
This case study shows building a centralized multi-account AWS monitoring platform managing 25+ accounts using Python Boto3 to fetch resource configurations into MongoDB, with Flask API and Next.js frontend achieving $30k annual savings.
Rob Sherling
This case study shows how EMC Healthcare built an on-premise CI/CD pipeline using K3s, ArgoCD, and Argo Workflows to automate testing and deployments with preview environments.
Matching jobs
Machine Learning Engineer with iFood
Salary: $27K to $246.4K a year
Location: remote from
Tech stack: Kubernetes, AWS, Rust
Software Engineer with iFood
Salary: $30.24K to $247.5K a year
Location: remote from
Tech stack: Kubernetes, AWS, Docker, Go, Kotlin, SQL, RabbitMQ, Kafka
Data Engineer with Filevine
Salary: $18K to $297.88K a year
Location: remote from
Tech stack: Kubernetes, AWS, Azure, GCP, Docker, C#, Python, SQL, Snowflake, Airflow
Data Engineer with Firmus Technologies
Salary: US$88.2K to US$286K a year
Location: based in the office in Singapore, SG
Tech stack: Kubernetes, AWS, Azure, GCP, Helm, Python, SQL, Snowflake, Kafka, Airflow
Head Of Engineering with Cyara
Salary: $185.4K to $385K a year
Location: remote from
Tech stack: Kubernetes, AWS, Docker
Build something
M. Syawal Saputra
This tutorial teaches how to build an elastic Kubernetes platform on VMware vSphere using Cluster API, Talos, and Cluster Autoscaler for declarative provisioning and automatic node scaling.
George Zefkilis
This tutorial shows how to build a local data platform using KinD for Kubernetes and Terraform for infrastructure provisioning, with Argo CD as the GitOps engine for deploying and managing applications.
Josh Woolbright
This tutorial shows how to add observability to an EKS cluster by migrating to ArgoCD's App of Apps pattern and deploying kube-prometheus-stack with a properly configured EBS CSI driver and kubelet cAdvisor configuration.
Joseignacio Carretero
This tutorial teaches how to set up a local DNS server specifically for demo environments using dnsmasq and Docker containers.
More articles
Netflix Technology Blog
This article explains how Netflix traced severe container launch slowdowns to Linux mount lock contention, image layer mount storms, and CPU architecture differences while scaling containers on modern Kubernetes infrastructure.
Anish Kumar – The DevOps Guy
This article explains how Kubeshark provides packet-level visibility in Kubernetes by capturing live pod traffic, decoding protocols such as HTTP and gRPC, and mapping requests back to workloads for debugging.
Kai Burjack
This case study explains why Nginx Ingress Controller causes TCP connection timeouts on GKE during pod termination.
It traces:
Nick Krestakos
This article explains a real homelab migration from Docker Compose to a three-node K3s cluster, covering: