Spotlight
Vedran Lebo
This article introduces ctx_, a CLI tool that switches an entire DevOps working context at once, including Kubernetes context, cloud credentials, environment variables, VPN, SSH tunnels, secrets, and browser profile.
This article explains five Ingress-NGINX behaviors that can break migrations, including path-matching differences, regex quirks, rewrite behavior, and annotation mismatches when migrating to another ingress solution.
Kalyan Josyula
This case study shows how a team traced repeated pod OOM kills in ASP.NET Core to native memory growth from zombie SignalR connections, glibc fragmentation, and kernel socket buffers.
This article explains that BuildKit is not just Docker’s build engine but a general-purpose framework that can turn custom frontend definitions into images, tarballs, local artifacts, and package outputs.
Tools and utilities
Kubeconform is a Kubernetes manifests validation tool.
Context Builder is a CLI tool that extracts metadata from Kubernetes, Grafana, Datadog and other systems to generate structured context files for AI agents, improving debugging accuracy and reducing guesswork.
Cluster Agent Swarm Skills is a collection of specialized AI agent skills for Kubernetes and OpenShift operations, covering cluster management, GitOps, security, observability, incident response, and platform workflow orchestration.
eksup analyzes your EKS cluster and generates a step-by-step upgrade playbook, flagging deprecated APIs, add-on version mismatches, and node group issues before you upgrade.
k10s is a terminal dashboard for watching multiple Kubernetes clusters at once, with side-by-side views, health signals, warnings, and recent logs in one screen.
Events starting soon
May 21, 2026
Location: Barton, AU
This is a free event.
May 21, 2026
Location: Geneva, CH
This event requires an entrance fee
May 21, 2026
Location: Madrid, ES
This event requires an entrance fee
Use PARTNER-DISC-20-KUBE to get 20% off
May 21, 2026
Location: Tokyo, JP and virtual
This is a free event.
May 21, 2026
Location: Zürich, CH
This is a free event.
May 21, 2026
Location: Groningen, NL
This is a free event.
Learn from production
Dat Ton
This case study explains how cURL 65 errors and DNS resolution failures on AWS EKS were caused by Linux kernel network limits being exceeded, resolved by increasing netdev_budget, netdev_budget_usecs, and netdev_max_backlog parameters.
Aditya Suryawanshi
This is a war story about a 3-person startup that replaced a $14,850/month over-engineered Kubernetes setup on AWS with Fly.io for $680, cutting P99 latency from 320ms to 180ms and deploy time from 8 minutes to 45 seconds.
DV Engineering
This case study shows how DoubleVerify built a Kubernetes and Ray serving platform to deploy and scale ML models in production.
It also covers RayService wrapped with Helm, fault tolerance with external Redis, and platform gains like 30% lower GPU cost.
Danny Steenman
This case study shows how NetworkLessons migrated from Kubernetes to ECS Fargate using AWS CDK, reducing operational complexity while implementing multi-account architecture, automated cost controls, and infrastructure as code.
Matching jobs
Data Engineer with Capital Technology Group
Salary: $26 to $297.88K a year
Location: remote from
Tech stack: Kubernetes, AWS, Docker, Java, Python, SQL, PostgreSQL, Kafka, Terraform, Splunk
Data Engineer with Rivian and Volkswagen Group Technologies
Salary: US$97.2K to US$220.22K a year
Location: based in the office in Vancouver, CA
Tech stack: Kubernetes, Go, Spark
DevOps Engineer with Blink Health
Salary: $90 to $484K a year
Location: remote from
Tech stack: Kubernetes, AWS, Grafana, Nagios, NewRelic
DevOps Engineer with Bluestaq
Salary: $95K to $140K a year
Location: based in the office in Colorado Springs, CO, USA
Tech stack: Kubernetes, AWS, Azure, GCP, On-premise, Powershell, Python, Shell, Terraform, Ansible
DevOps Engineer with Box
Salary: $90 to $484K a year
Location: based in the office in Warsaw, PL
Tech stack: Kubernetes, AWS, Azure, GCP, Docker, Go, Python, Ruby, Typescript, GitHub Actions
Build something
augusthottie
This tutorial shows how to add Prometheus, Grafana, Alertmanager, custom metrics, ServiceMonitors, dashboards, and alert rules to an EKS cluster through GitOps.
Andrew Pitt
This tutorial shows how to run an open source LLM on OpenShift with Red Hat AI Inference Server based on vLLM, using a PVC, GPU-backed deployment, OpenAI-compatible endpoint, model switching, and an optional AnythingLLM UI.
Franck Pachot
This tutorial shows how to install CloudNativePG 1.28 operator and deploy a three-node PostgreSQL cluster with synchronous replication and quorum-based failover, then tests transient failure recovery by pausing the primary container.
Joseignacio Carretero
This tutorial teaches how to set up a local DNS server specifically for demo environments using dnsmasq and Docker containers.
More articles
Netflix Technology Blog
This article explains how Netflix traced severe container launch slowdowns to Linux mount lock contention, image layer mount storms, and CPU architecture differences while scaling containers on modern Kubernetes infrastructure.
This article shows how to build a self-healing registry mirror on GKE with zot and automation that copies remote images locally and rewrites deployments to avoid Docker Hub rate limits and ImagePullBackOff failures.
This article explains how the Kubernetes Image Promoter was rewritten to improve rate limiting, observability, and resilience in the pipeline that publishes and signs images for registry·k8s·io.
Vadim Alekseev
This article compares major Kubernetes log collectors with a reproducible benchmark focused on: