Spotlight
This article explains five Ingress-NGINX behaviors that can break migrations, including path-matching differences, regex quirks, rewrite behavior, and annotation mismatches when migrating to another ingress solution.
Kalyan Josyula
This case study shows how a team traced repeated pod OOM kills in ASP.NET Core to native memory growth from zombie SignalR connections, glibc fragmentation, and kernel socket buffers.
This article explains that BuildKit is not just Docker’s build engine but a general-purpose framework that can turn custom frontend definitions into images, tarballs, local artifacts, and package outputs.
Nick Roan
This case study shows how a single RAG chunk size change collapsed vLLM prefix-cache hit rate from 85% to 4%, triggering an 80% GPU replica increase while latency stayed flat.
It also includes the fix: adding a two-phase cache replay gate in CI.
Tools and utilities
eksup analyzes your EKS cluster and generates a step-by-step upgrade playbook, flagging deprecated APIs, add-on version mismatches, and node group issues before you upgrade.
k10s is a terminal dashboard for watching multiple Kubernetes clusters at once, with side-by-side views, health signals, warnings, and recent logs in one screen.
Valkey Operator is a Kubernetes operator that automates deployment and lifecycle management of Valkey clusters and instances with features like automated installation and configuration management.
Teleskopio is a small, open-source Kubernetes web client that provides a clean browser interface for viewing and managing cluster resources without the weight of a full platform dashboard.
Mocker is a Docker-compatible CLI and Compose tool for macOS that runs on Apple’s Containerization framework, so existing Docker commands and compose files work without Docker Desktop.
Events starting soon
May 18, 2026
Location: London, UK
This event requires an entrance fee
May 18, 2026
Location: Boston, MA, USA
This event requires an entrance fee
May 18, 2026
Location: San Francisco, CA, USA
This event requires an entrance fee
May 19, 2026
Location: Soltau, DE
This event requires an entrance fee
May 19, 2026
Location: Aarhus, DK
This is a free event.
May 19, 2026
This is a virtual event
This is a free event.
Learn from production
Aditya Suryawanshi
This is a war story about a 3-person startup that replaced a $14,850/month over-engineered Kubernetes setup on AWS with Fly.io for $680, cutting P99 latency from 320ms to 180ms and deploy time from 8 minutes to 45 seconds.
DV Engineering
This case study shows how DoubleVerify built a Kubernetes and Ray serving platform to deploy and scale ML models in production.
It also covers RayService wrapped with Helm, fault tolerance with external Redis, and platform gains like 30% lower GPU cost.
Danny Steenman
This case study shows how NetworkLessons migrated from Kubernetes to ECS Fargate using AWS CDK, reducing operational complexity while implementing multi-account architecture, automated cost controls, and infrastructure as code.
Sphoorthi Charan Nayakudugari
This case study explains how the authors used dynamic MIG partitioning to split large GPUs like NVIDIA A100/H100 into multiple isolated slices, letting many small jobs share GPU efficiently.
Matching jobs
DevOps Engineer with Epic Kids Inc.
Salary: $90 to $484K a year
Location: remote from
Tech stack: Kubernetes, GCP, Helm, ArgoCD, Docker, Python, Airflow, Terraform, GitHub Actions, Jenkins
DevOps Engineer with Prime Intellect
Salary: $150K to $300K a year
Location: based in the office (and remote from home) in San Francisco, CA, USA
Tech stack: Kubernetes, GCP, Go, Python, Rust, Typescript, Terraform, Ansible, Grafana, Prometheus
DevOps Engineer with TMS LLC
Salary: $85 a day
Location: remote from
Tech stack: Kubernetes, Azure, Terraform, Azure DevOps
DevSecOps Engineer with CHAOS Industries
Salary: $140K to $220K a year
Location: based in the office in Hawthorne, CA, USA
Tech stack: Kubernetes, Azure, On-premise, Docker, C++, Go, Java, Python, Rust, Azure DevOps
DevSecOps Engineer with PactFi
Salary: $12.6K to $445.5K a year
Location: based in the office in New York, NY, USA
Tech stack: Kubernetes, AWS, Docker, Python, Redis, Pulumi, Terraform, Jenkins
Build something
augusthottie
This tutorial shows how to add Prometheus, Grafana, Alertmanager, custom metrics, ServiceMonitors, dashboards, and alert rules to an EKS cluster through GitOps.
Andrew Pitt
This tutorial shows how to run an open source LLM on OpenShift with Red Hat AI Inference Server based on vLLM, using a PVC, GPU-backed deployment, OpenAI-compatible endpoint, model switching, and an optional AnythingLLM UI.
Franck Pachot
This tutorial shows how to install CloudNativePG 1.28 operator and deploy a three-node PostgreSQL cluster with synchronous replication and quorum-based failover, then tests transient failure recovery by pausing the primary container.
Joseignacio Carretero
This tutorial teaches how to set up a local DNS server specifically for demo environments using dnsmasq and Docker containers.
More articles
Netflix Technology Blog
This article explains how Netflix traced severe container launch slowdowns to Linux mount lock contention, image layer mount storms, and CPU architecture differences while scaling containers on modern Kubernetes infrastructure.
This article shows how to build a self-healing registry mirror on GKE with zot and automation that copies remote images locally and rewrites deployments to avoid Docker Hub rate limits and ImagePullBackOff failures.
This article explains how the Kubernetes Image Promoter was rewritten to improve rate limiting, observability, and resilience in the pipeline that publishes and signs images for registry·k8s·io.
Vadim Alekseev
This article compares major Kubernetes log collectors with a reproducible benchmark focused on: