Kalyan Josyula
This case study shows how a team traced repeated pod OOM kills in ASP.NET Core to native memory growth from zombie SignalR connections, glibc fragmentation, and kernel socket buffers.
Nick Roan
This case study shows how a single RAG chunk size change collapsed vLLM prefix-cache hit rate from 85% to 4%, triggering an 80% GPU replica increase while latency stayed flat.
It also includes the fix: adding a two-phase cache replay gate in CI.
Dat Ton
This case study explains how cURL 65 errors and DNS resolution failures on AWS EKS were caused by Linux kernel network limits being exceeded, resolved by increasing netdev_budget, netdev_budget_usecs, and netdev_max_backlog parameters.
Matt Camp
This case study shows how Unitary built Osmia, an open-source orchestration layer on EKS to run autonomous AI coding agents safely at scale using pod isolation, Karpenter, IRSA-based secrets, and real-time trajectory scoring.
Aditya Suryawanshi
This is a war story about a 3-person startup that replaced a $14,850/month over-engineered Kubernetes setup on AWS with Fly.io for $680, cutting P99 latency from 320ms to 180ms and deploy time from 8 minutes to 45 seconds.
Events starting soon
May 21, 2026
Location: Barton, AU
This is a free event.
May 21, 2026
Location: Geneva, CH
This event requires an entrance fee
May 21, 2026
Location: Madrid, ES
This event requires an entrance fee
Use PARTNER-DISC-20-KUBE to get 20% off
May 21, 2026
Location: Tokyo, JP and virtual
This is a free event.
May 21, 2026
Location: Zürich, CH
This is a free event.
May 21, 2026
Location: Groningen, NL
This is a free event.
More Case Studies
Ejiroghene Laurel Dafe
This case study shows how one engineer resolved two real Kubernetes production incidents involving an overly aggressive Ingress rate limit and Istio breaking non-HTTP socket traffic.
Maxim Nazarenko
This case study explains how to migrate bound Kubernetes volumes from deprecated in-tree Azure Disk provisioning to CSI with in-place PVC re-binding, minimal restarts, and no data loss across production disks.
DV Engineering
This case study shows how DoubleVerify built a Kubernetes and Ray serving platform to deploy and scale ML models in production.
It also covers RayService wrapped with Helm, fault tolerance with external Redis, and platform gains like 30% lower GPU cost.
Varun Arora
This case study shows building a centralized multi-account AWS monitoring platform managing 25+ accounts using Python Boto3 to fetch resource configurations into MongoDB, with Flask API and Next.js frontend achieving $30k annual savings.
Firas Sboui
This case study shows how to run SQL Server on Azure Kubernetes Service using StatefulSets, persistent volumes, and GitOps for multi-tenant database deployments.
Matching jobs
Data Engineer with Capital Technology Group
Salary: $26 to $297.88K a year
Location: remote from
Tech stack: Kubernetes, AWS, Docker, Java, Python, SQL, PostgreSQL, Kafka, Terraform, Splunk
Data Engineer with Rivian and Volkswagen Group Technologies
Salary: US$97.2K to US$220.22K a year
Location: based in the office in Vancouver, CA
Tech stack: Kubernetes, Go, Spark
DevOps Engineer with Blink Health
Salary: $90 to $484K a year
Location: remote from
Tech stack: Kubernetes, AWS, Grafana, Nagios, NewRelic
DevOps Engineer with Bluestaq
Salary: $95K to $140K a year
Location: based in the office in Colorado Springs, CO, USA
Tech stack: Kubernetes, AWS, Azure, GCP, On-premise, Powershell, Python, Shell, Terraform, Ansible
DevOps Engineer with Box
Salary: $90 to $484K a year
Location: based in the office in Warsaw, PL
Tech stack: Kubernetes, AWS, Azure, GCP, Docker, Go, Python, Ruby, Typescript, GitHub Actions