Senior Platform & Site Reliability Engineer — Chicago, IL
10 years bridging production networking and cloud infrastructure at scale. Currently architecting Instructure's multi-region EKS platform that powers Canvas LMS for tens of millions of learners worldwide. I combine deep networking foundations with modern platform engineering — and I own migrations end-to-end.
Built and scaled "Trigger" — the platform powering Canvas LMS deployments. Helm/Kustomize manifests, Kyverno policy-as-code, Akuity-managed ArgoCD GitOps. Led migration of production workloads off legacy Cloudgate/Condor PaaS onto a Node.js/Express + MongoDB deployment API.
Phased AppGate ZTNA rollout replacing Teleport and standalone SSH keys for the entire engineering org. Wired ArgoCD prod Okta SSO via SCIM and rotated IAM Identity Center SCIM tokens across multi-account AWS Organizations.
Migrated two parallel TGW meshes off VyOS-on-EC2, then led a fleet-wide security-group referencing rollout across 34 TGWs in 9 regions — eliminating thousands of CIDR-based rules. Authored reusable Terraform modules and orchestrated Terraform Cloud workspace applies at scale.
CrowdStrike CSPM across 9 AWS accounts, AWS Config continuous-mode StackSets org-wide, CloudZero cost-attribution labels across the EKS fleet, and WAF tuning during an active post-compromise investigation.