Spotlight
Jack Lindamood
This case study shows how OOM Killer terminated a critical network daemon on Kubernetes nodes, causing a network outage.
It covers debugging via serial console and implementing memory reservations to prevent system-critical process termination.
Ojas Singh
This tutorial teaches how to run a production-ready vLLM inference server on Kubernetes with AMD Instinct GPUs using containerd, the AMD GPU Operator, persistent storage, and MetalLB.
hybrid2k3
This article explains why vanilla Kubernetes has no real login event and shows a practical session-tracking workaround using credential-id fingerprints from audit logs, with a side-by-side comparison against OpenShift OAuth behavior.
Dylan Da Costa
This tutorial explains how to design CloudNativePG for production failure by using plugin-based backups, WAL archiving, point-in-time recovery, snapshots, and PgBouncer so recovery is treated as the real operational priority.
Tools and utilities
Kappal runs your existing docker-compose.yaml on Kubernetes using familiar commands like up, down, logs, exec.
Kubie is a tool that provides an alternative to kubectx, kubens, and the k on prompt modification script, offering context switching, namespace switching, and prompt customization.
Harbor is a CNCF-graduated open source container registry that stores, signs, and scans images, with built-in RBAC, LDAP/OIDC support, vulnerability scanning, policy-based replication, and a full REST API.
zeropod is a tool that automatically checkpoints containers to disk after a certain amount of time of the last TCP connection, allowing for fast and seamless scaling down to zero.
Yoke is an IaC tool inspired by Helm that leverages WebAssembly and Go to dynamically deploy Kubernetes packages with executable runtime capabilities.
Events starting soon
May 25, 2026
This is a virtual event
This is a free event.
May 26, 2026
This is a virtual event
This is a free event.
May 26, 2026
This is a virtual event
This is a free event.
May 26, 2026
This is a virtual event
This is a free event.
May 26, 2026
This is a virtual event
This is a free event.
May 27, 2026
Location: Sibiu, RO
This event requires an entrance fee
Forced platform migrations are usually treated as something to survive. At Scout24, a mandatory OS migration became an opportunity to rethink Kubernetes autoscaling, node provisioning, and infrastructure efficiency.
John Ford explains how Scout24 moved its EKS-based Infinity platform from a polling autoscaler and over-provisioned capacity to Karpenter and Bottlerocket. The result was faster node startup, a safer migration path, and about a 30% infrastructure reduction without major downtime.
In this interview:
Learn from production
Kalyan Josyula
This case study shows how a team traced repeated pod OOM kills in ASP.NET Core to native memory growth from zombie SignalR connections, glibc fragmentation, and kernel socket buffers.
Nick Roan
This case study shows how a single RAG chunk size change collapsed vLLM prefix-cache hit rate from 85% to 4%, triggering an 80% GPU replica increase while latency stayed flat.
It also includes the fix: adding a two-phase cache replay gate in CI.
Dat Ton
This case study explains how cURL 65 errors and DNS resolution failures on AWS EKS were caused by Linux kernel network limits being exceeded, resolved by increasing netdev_budget, netdev_budget_usecs, and netdev_max_backlog parameters.
Matt Camp
This case study shows how Unitary built Osmia, an open-source orchestration layer on EKS to run autonomous AI coding agents safely at scale using pod isolation, Karpenter, IRSA-based secrets, and real-time trajectory scoring.
Matching jobs
DevOps Engineer with Dragos, Inc.
Salary: $165K a year
Location: remote from
Tech stack: Kubernetes, AWS, GCP, Docker, Go, Python, Terraform, Ansible
DevOps Engineer with Iambic Therapeutics, Inc
Salary: $27 to $484K a year
Location: remote from
Tech stack: Kubernetes, AWS, Docker, Python, Shell, Terraform, Ansible
DevOps Engineer with PracticeTek
Salary: $100K to $139K a year
Location: remote from
Tech stack: Kubernetes, AWS, Helm, Python, Shell, Redis, PostgreSQL, MySQL, Terraform
Engineering Manager with Dave
Salary: $58.5K to $473K a year
Location: remote from
Tech stack: Kubernetes, Docker, GraphQL, Typescript, MySQL
Machine Learning Engineer with Dragos, Inc.
Salary: $165K a year
Location: remote from
Tech stack: Kubernetes, AWS, Azure, Docker
Build something
Dilan Dashintha
This tutorial shows how to use Cilium and Hubble to enforce HTTP path based network policies in Kubernetes with eBPF, so you can allow or block specific endpoints without sidecars.
Amolpratap Singh
This tutorial explains TLS and certificate debugging from root CA basics to Kubernetes secrets, with OpenSSL and curl commands for inspecting certs, validating handshakes, and fixing common production errors.
Shanaka Jayasundera
This tutorial shows how to set up Kubernetes Gateway API on EKS using Istio Ambient Mesh exposed through AWS ALB, with Terraform, ArgoCD, and a layered architecture separating infra from app deployment.
Gaurang Malvankar
This tutorial explains how to prevent, detect, and clean up leaked secrets in Git repositories using .env files, Kubernetes Secrets, Gitleaks, GitGuardian, and git-filter-repo.
Call for Papers closing soon
6
days
Location: Istanbul, TR
In-person conference organized by DevOps Turkey.
The conference starts on the 29 September 2026.
6
days
Location: St. Louis, MO, USA
In-person conference organized by Sketch.
The conference starts on the 22 August 2026.
7
days
KubeCon + CloudNativeCon North America 2026
Location: Los Angeles, CA, USA
In-person conference organized by Linux Foundation.
The conference starts on the 26 October 2026.
7
days
Location: Belgrade, RS
In-person conference organized by heapspace.
The conference starts on the 6 November 2026.
7
days
Location: Bergen, NO
In-person conference organized by CND Norway.
The conference starts on the 27 October 2026.
7
days
Location: Istanbul, TR
In-person conference organized by Devopsdays.
The conference starts on the 24 October 2026.
7
days
Kubernetes Community Days Melbourne 2026
Location: Melbourne, AU
In-person conference organized by KCD Australia.
The conference starts on the 4 August 2026.
More articles
Joe Sasson
This article explains how a single-node OpenShift cluster can be turned into a multi-tenant on-premises GPU platform with reservation-based scheduling, MIG partitioning, time slicing, isolated namespaces, and controller-driven self-healing.
Vedran Lebo
This article introduces ctx_, a CLI tool that switches an entire DevOps working context at once, including Kubernetes context, cloud credentials, environment variables, VPN, SSH tunnels, secrets, and browser profile.
Philip McClarence
This article covers running PostgreSQL on Kubernetes in production — comparing Zalando, Crunchy and CloudNativePG operators, storage class decisions, backup strategies, connection pooling, and a take on when Kubernetes is overkill for databases.
Bharat Kalra
This article covers an ingress-nginx to Istio migration, architectural decisions, the RE2 vs PCRE regex incompatibility gotcha, URL rewrite differences, resource overhead comparison between sidecar and ambient mode, and a phased migration strategy.