Karpenter v1 vs Cluster Autoscaler: A Production Migration Story
I’d been running Cluster Autoscaler on our production EKS cluster for years. It worked. It wasn’t exciting, it wasn’t cheap, but it …
Read Article →166 articles about aws development, tools, and best practices
I’d been running Cluster Autoscaler on our production EKS cluster for years. It worked. It wasn’t exciting, it wasn’t cheap, but it …
Read Article →Single-account AWS is a ticking time bomb. I don’t say that lightly. I’ve watched it blow up firsthand, and I’ve spent more hours …
Read Article →You don’t know your system is resilient until you’ve broken it on purpose.
I believed our payment processing service was fault tolerant. …
Read Article →Bedrock is AWS finally getting AI right. I don’t say that lightly. I’ve watched AWS stumble through SageMaker’s complexity, watched …
Read Article →If you’re not running scheduled terraform plan, you have drift. You just don’t know it yet.
I learned this the hard way. A colleague made …
Read Article →Everything I’ve learned building on AWS since 2012, organized by domain.
Aurora Serverless v2 is what v1 should have been. I don’t say that lightly — I ran v1 in production for two years and spent more time fighting …
Read Article →I deleted roughly 2,000 lines of orchestration code from our payment processing service last year. Replaced it with about 200 lines of Amazon States …
Read Article →EventBridge is the most underused AWS service. I’ll die on that hill. Teams will build these elaborate Rube Goldberg machines out of SNS topics, …
Read Article →I use both. Terraform for multi-cloud, CDK when it’s pure AWS and the team knows TypeScript. That’s the short answer. But the long answer …
Read Article →VPNs are not zero trust. Stop calling them that.
I can’t count how many times I’ve sat in architecture reviews where someone points at a …
Read Article →I got a call from a startup founder last year. “Our AWS bill just hit $47,000 and we have twelve engineers.” They’d been running for …
Read Article →I once inherited a project with a single main.tf that was over 3,000 lines long. No modules. No abstractions. Just one enormous file that deployed an …
I started learning Rust as someone who’d spent years writing Python scripts and Go services for cloud infrastructure. My first reaction was …
Read Article →ECS is underrated. Most teams picking EKS don’t need it. I’ve been saying this for years, and I’ll keep saying it until the industry …
Read Article →I’ve lost count of how many times someone’s told me “Lambda has cold start problems” like it’s some fatal flaw. It …
Read Article →I’ve been managing Terraform state across production environments for years now, and if there’s one thing I’m certain of, it’s …
Read Article →Cloud security misconfigurations have become one of the leading causes of data breaches and security incidents. As organizations rapidly adopt cloud …
Read Article →In today’s cloud-native world, applications require access to numerous secrets—API keys, database credentials, encryption keys, and other …
Read Article →If you need to assume a role in AWS and then revert back to the previous role once complete, you can use the following snippet of code:
# Save …In today’s cloud-centric world, managing infrastructure manually is no longer feasible. As organizations scale their cloud presence, the …
Read Article →So, what policies does my role need to create an AWS EKS cluster and be the admin?
AmazonEKSClusterPolicy: This policy grants permissions to create …
As organizations continue to migrate workloads to the cloud, many are experiencing the phenomenon known as “cloud shock”—the realization …
Read Article →Managing Docker images in Amazon Elastic Container Registry (ECR) is a common task for many developers and DevOps practitioners. Often, you need to …
Read Article →