Summary
Senior DevOps engineer with 8 years designing and operating large-scale cloud infrastructure across AWS and GCP. Proven record of building self-healing systems, accelerating release velocity, and embedding reliability engineering practices that reduce incidents without slowing teams down. Brings equal fluency in platform architecture and the human side of DevOps — documentation, mentorship, and cross-functional trust.
Skills
Cloud: AWS, GCP, Azure · IaC: Terraform, Pulumi, CloudFormation · Containers: Kubernetes, Docker, Helm · CI/CD: GitHub Actions, ArgoCD, Jenkins, CircleCI · Observability: Prometheus, Grafana, Datadog, PagerDuty · Languages: Python, Bash, Go · Security: Vault, SOPS, IAM, SOC 2
Experience
Staff DevOps Engineer
Helix Infrastructure · Portland, OR | Jan 2021 – Present
- Architected multi-region Kubernetes platform on AWS serving 300+ microservices, achieving 99.995% uptime across 24 months of continuous operation.
- Rebuilt CI/CD platform using GitHub Actions and ArgoCD, cutting average deployment time from 47 minutes to 6 minutes and increasing deploy frequency by 5×.
- Led cloud cost optimisation initiative across 14 engineering teams, reducing monthly AWS spend by $380K/yr through right-sizing, spot instance adoption, and reserved capacity planning.
- Established SRE practice from scratch including SLO frameworks, runbook standards, and on-call rotations, reducing mean time to recovery by 71% over 18 months.
- Mentored 5 mid-level engineers toward senior promotion; 3 now independently own platform domains.
Senior DevOps Engineer
Lattice Systems · Seattle, WA | Mar 2018 – Dec 2020
- Migrated 60+ legacy services from bare-metal to GCP using Terraform and Kubernetes, reducing provisioning time from 3 days to 22 minutes with zero production data loss.
- Designed secrets management architecture using HashiCorp Vault, eliminating 100% of hardcoded credentials across the codebase and achieving SOC 2 Type II compliance.
- Built centralised observability stack with Prometheus and Grafana covering 2,400+ metrics, reducing alert noise by 58% through intelligent threshold tuning.
DevOps Engineer
Kova Digital · San Francisco, CA | Aug 2015 – Feb 2018
- Introduced infrastructure-as-code practices using Terraform across a 40-person engineering org, reducing environment drift incidents by 83%.
- Automated nightly database backup and restore pipeline covering 12TB of production data, meeting RTO of under 4 hours for disaster recovery scenarios.
Open Source & Projects
KubeGuard — Kubernetes RBAC AuditorGo · Kubernetes API · github.com/tariqosman/kubeguard
- Built open-source RBAC auditing tool for Kubernetes clusters, adopted by 3 enterprise teams and accruing 980 GitHub stars within 6 months of release.
Contributor — hashicorp/terraformGo · github.com/hashicorp/terraform
- Merged 4 pull requests improving provider error handling and state locking reliability across high-concurrency environments.
Education
B.S. in Computer ScienceOregon State University | May 2015
Relevant Coursework: Distributed Systems, Operating Systems, Networks, Cloud Computing
Certifications
Certified Kubernetes Administrator (CKA)CNCF · 2022
AWS Certified DevOps Engineer — ProfessionalAmazon Web Services · 2021
HashiCorp Certified Terraform AssociateHashiCorp · 2020