Greater Seattle Area
• I engineer production systems where failure is assumed and risk is modeled. ;• 15+ years building high-availability infrastructure across multi-region and regulated cloud environments. ;• Focused on SLO-driven design, error budgets, failure domain isolation, and throughput-backed capacity planning. ;• Disaster recovery validated through restore math, not documentation. ;• Reliability is a constraint problem, not a redundancy feature.
𝗔𝗺𝗮𝘇𝗼𝗻 𝗟𝗘𝗢[𝗳𝗼𝗿𝗺𝗲𝗿𝗹𝘆 𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗞𝘂𝗶𝗽𝗲𝗿] Designed and managed Infrastructure as Code using AWS CDK. Architected and developed Python-based APIs running on EC2, Lambda, and API Gateway. Implemented secure, cross-partition cloud architectures leveraging IAM, x.509/mTLS, and signature-based access control to support manufacturing and ERP workloads across AWS GovCloud and commercial partition.
• Implemented Kubernetes networking enhancements, including Cluster Mesh and eBPF, to support distributed AI workloads and reduce cross-cluster latency. • Integrated Agentic AI architectures leveraging MCP and RAG to deliver intelligent, context-aware cloud automation and observability workflows, improving incident response and deployment precision.
• Architected and operated a highly available Splunk platform on AWS, implementing monitoring and alerting integrations across Splunk, Wavefront, PagerDuty, and Slack APIs/Bots to improve reliability and MTTR. • Automated infrastructure provisioning using AWS CloudFormation and Terraform, and applied GitOps principles with Jenkins and ArgoCD to manage infrastructure and Kubernetes-based deployments consistently across environments. • Implemented configuration management using Ansible and Chef to ensure repeatable and compliant EC2 configurations. • Developed automation for operational and compliance workflows using Python, Bash, Go, and JavaScript, leveraging AWS Lambda and SSM. • Built and maintained Docker and Kubernetes sidecar images for Splunk components, supporting scalable, containerized observability.
• Led architecture for mission-critical SaaS platforms (including TurboTax), designing highly available and horizontally scalable systems capable of handling tax-season traffic spikes, supporting millions of concurrent users and billions of transactions with zero downtime. • Engineered resilient production architectures using Veritas clustering, F5 load balancers, and storage replication, and automated configuration management with Bash, Puppet, Chef, and SaltStack to improve reliability, consistency, and operational efficiency at scale.
• Implemented Veritas High Availability and Storage Foundation for Oracle RAC, ensuring seamless operations. • Established SAN storage availability with active-active synchronous replication, enhancing data redundancy and disaster recovery capabilities.
• Provided consultation for plan and implementation of Symantec's Enterprise Storage Foundation for Oracle RAC, and Business Continuity products Veritas Volume Replicator at a client location. • Collaborated with cross-functional teams to optimize storage solutions • Implemented innovative strategies to enhance data storage efficiency and ensure seamless business continuity for the client.
• Provided expertise for Symantec’s Storage Foundation SFHA Oracle RAC and Veritas Volume Replicator on Solaris servers. • Supported mission-critical Solaris business servers. • Ensured seamless operation of Sparc and x86 servers for strategic initiatives. • Collaborated with the team to optimize server performance and reliability.
• Architected and designed complex server virtualization solution using Solaris zones for GAP Inc. financial systems on Oracle R12 refresh project. • Implemented innovative solutions to enhance system performance and scalability for remote work environments. • Collaborated with cross-functional teams to ensure seamless integration and successful project delivery.
• Architect, design, and code Solaris infrastructure deployment program for mission critical business.