Spotlight
Scott Cotton
This article introduces Tony format, a tool that unifies matching, patching, and diffing operations on YAML and JSON using a single typed tree representation with tag-based extensions like !dive, !key, and !if for structural transformations.
Dilip Kola
This article explains how to build cost-efficient microservices on AKS by classifying state as irreplaceable or regenerable, using managed PostgreSQL for critical data while self-hosting Redis, RabbitMQ, and observability tools in Kubernetes.
Borwornpob
This article describes OpenMirai's deployment pipeline using GitHub Actions for CI, Argo CD for GitOps, and a separate deployment repository, with staging-first testing and scheduled production releases during off-peak hours.
Michael Preston
This article explains the challenges of running Java at scale on Kubernetes, covering JVM memory management with container limits, heap sizing with MaxRAMPercentage, CPU throttling, and garbage collector selection for containers.
Tools and utilities
flux-operator simplifies the configuration of Flux multi-tenancy lockdown, sharding, horizontal and vertical scaling, persistent storage, and allows fine-tuning the Flux controllers with Kustomize patches.
KCL allows developers to create modular, scalable, and stable configurations.
CronJob Guardian monitors Kubernetes CronJobs with dead-man's switch detection, SLA tracking for success rates and duration regressions, intelligent alerting via Slack/PagerDuty/webhook/email, and a built-in web dashboard with charts and metrics export.
Forecastle is a control panel which dynamically discovers and provides a launchpad to access applications deployed on Kubernetes.
This tool provides a comprehensive test suite to evaluate real-world behavior (latency, scale, route propagation, traffic) of Kubernetes Gateway API implementations, beyond basic conformance.
Events starting soon
April 2, 2026
Location: Cluj-Napoca, RO
This is a free event.
April 2, 2026
Location: London, GB
This is a free event.
April 3, 2026
This is a virtual event
This is a free event.
April 4, 2026
Location: Bengaluru, IN
This is a free event.
April 4, 2026
Location: Ibadan, NG
This is a free event.
April 4, 2026
This is a virtual event
This event requires an entrance fee
Learn from production
Farid Guluzade
This case study shows how reducing JVM MaxRAMPercentage, cutting the Hikari connection pool from 50 to 20, and implementing aggressive HPA scale-up (0s stabilization, 4 pods/min) doubled traffic capacity while cutting baseline pods from 26 to 10.
Ron Matsliah
This article describes how the team at Next Insurance built an AI-powered microservice that watches build failures via Jenkins, analyzes logs automatically and posts clear, helpful feedback to Slack.
Scout24
This case study shows how Scout24 turned an Amazon Linux 2 end-of-life deadline into a 30% reduction in nodes across their EKS clusters by combining OS migration with Karpenter adoption.
Tamer Abdulghani
This tutorial teaches how to build and deploy production-ready micro frontends in Kubernetes.
It also covers configuring ingress routing for module federation, and implementing CI/CD pipelines for isolated micro frontend teams.
Matching jobs
Data Engineer with Astranis
Salary: $29 a year
Location: based in the office in San Francisco, CA, USA
Tech stack: Kubernetes, Python, SQL
Data Engineer with Coface
Salary: $64.8K to $2.48L a year
Location: based in the office in Mumbai, IN
Tech stack: Kubernetes, Docker, SQL, Python
DevOps Engineer with AssureSoft
Salary: $90K to $198K a year
Location: remote from
Tech stack: Kubernetes, AWS, GCP, ArgoCD, Terraform
DevOps Engineer with Avride
Salary: $54K to $325.49K a year
Location: based in the office in Austin, TX, USA
Tech stack: Kubernetes, AWS, Docker, C++, Python, GitHub Actions
DevOps Engineer with Candidly
Salary: $70.74K to $539K a year
Location: remote from
Tech stack: Kubernetes, AWS, Azure, On-premise, Docker, Python, SQL, MySQL, Cloudformation, CDK
Build something
Tobby Kuo
This tutorial teaches how to build an end-to-end real-time baggage tracking system using Kafka for event streaming, Flink for state processing, ClickHouse for analytics, and Grafana for visualization on Kubernetes.
Shanaka Jayasundera
This tutorial shows how to expose Kubernetes Gateway API from AKS through Azure Application Gateway by fixing health probe failures with a dedicated HTTPRoute and connection timeouts using externalTrafficPolicy Local for Azure DSR.
Ahmad Asmar
This tutorial shows how to use Kyverno policy engine to generate Pod Disruption Budgets for Kubernetes deployments with multiple replicas, preventing downtime during Karpenter node consolidation through intelligent API lookups and label matching.
piotr.minkowski
This tutorial teaches how to use the In-Place Pod Resize feature in Kubernetes version 1.35 combined with Kube Startup CPU Boost controller to speed up Java application startup by temporarily increasing CPU resources during the startup phase.
More articles
Ar Hakboian
This article describes an experiment using three autonomous AI agents to conduct multi-agent SRE incident investigations in a sandboxed Kubernetes environment with real tooling access.
Vasily Pilitsyn
This article shows how KRO manages ephemeral test environments as single Kubernetes API objects by orchestrating resource deployment in dependency order, with readiness conditions and a unified status across namespace, frontend, backend, and database.
Rodrigo Caldas
This article shows how to use tofu-controller to manage Terraform resources with GitOps for external systems like Grafana dashboards and HashiCorp Vault policies with continuous reconciliation and automatic drift detection.
Velorum
This article covers 10 Kubernetes HPA scaling signals beyond CPU — RPS, queue lag, p95 latency, in-flight requests, error rate, and saturation metrics — with YAML examples and pitfalls for each.