Phakhruddin Abdullah

Senior SRE / DevOps | Distributed Systems | Multi-Partition AWS | Platform Engineering

Greater Seattle Area

About

• I engineer production systems where failure is assumed and risk is modeled. ;• 15+ years building high-availability infrastructure across multi-region and regulated cloud environments. ;• Focused on SLO-driven design, error budgets, failure domain isolation, and throughput-backed capacity planning. ;• Disaster recovery validated through restore math, not documentation. ;• Reliability is a constraint problem, not a redundancy feature.

Experience

  • Systems Development Engineer at Amazon
    Jun 2025 - Present · 1 yr 2 mos

    𝗔𝗺𝗮𝘇𝗼𝗻 𝗟𝗘𝗢[𝗳𝗼𝗿𝗺𝗲𝗿𝗹𝘆 𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗞𝘂𝗶𝗽𝗲𝗿] Designed and managed Infrastructure as Code using AWS CDK. Architected and developed Python-based APIs running on EC2, Lambda, and API Gateway. Implemented secure, cross-partition cloud architectures leveraging IAM, x.509/mTLS, and signature-based access control to support manufacturing and ERP workloads across AWS GovCloud and commercial partition.

  • DevSecOps | SRE Engineer at OpsnexAI
    Oct 2024 - Present · 1 yr 10 mos

    • Implemented Kubernetes networking enhancements, including Cluster Mesh and eBPF, to support distributed AI workloads and reduce cross-cluster latency. • Integrated Agentic AI architectures leveraging MCP and RAG to deliver intelligent, context-aware cloud automation and observability workflows, improving incident response and deployment precision.

  • Intuit (San Diego, California, United States)
    • Staff Dev Ops Engineer - Observability
      Sep 2018 - Sep 2024 · 6 yrs 1 mo

      • Architected and operated a highly available Splunk platform on AWS, implementing monitoring and alerting integrations across Splunk, Wavefront, PagerDuty, and Slack APIs/Bots to improve reliability and MTTR. • Automated infrastructure provisioning using AWS CloudFormation and Terraform, and applied GitOps principles with Jenkins and ArgoCD to manage infrastructure and Kubernetes-based deployments consistently across environments. • Implemented configuration management using Ansible and Chef to ensure repeatable and compliant EC2 configurations. • Developed automation for operational and compliance workflows using Python, Bash, Go, and JavaScript, leveraging AWS Lambda and SSM. • Built and maintained Docker and Kubernetes sidecar images for Splunk components, supporting scalable, containerized observability.

    • Senior System Engineer
      Apr 2014 - Sep 2018 · 4 yrs 6 mos

      • Led architecture for mission-critical SaaS platforms (including TurboTax), designing highly available and horizontally scalable systems capable of handling tax-season traffic spikes, supporting millions of concurrent users and billions of transactions with zero downtime. • Engineered resilient production architectures using Veritas clustering, F5 load balancers, and storage replication, and automated configuration management with Bash, Puppet, Chef, and SaltStack to improve reliability, consistency, and operational efficiency at scale.

    • Linux Consultant
      Oct 2010 - Apr 2014 · 3 yrs 7 mos

      • Implemented Veritas High Availability and Storage Foundation for Oracle RAC, ensuring seamless operations. • Established SAN storage availability with active-active synchronous replication, enhancing data redundancy and disaster recovery capabilities.

  • Veritas Storage Consultant at Symantec
    Apr 2009 - Nov 2010 · 1 yr 8 mos

    • Provided consultation for plan and implementation of Symantec's Enterprise Storage Foundation for Oracle RAC, and Business Continuity products Veritas Volume Replicator at a client location. • Collaborated with cross-functional teams to optimize storage solutions • Implemented innovative strategies to enhance data storage efficiency and ensure seamless business continuity for the client.

  • Sun Microsystems (Contract · 3 yrs 8 mos)
    • Solaris/Linux Consultant (Professsional Service)
      Nov 2008 - Apr 2009 · 6 mos

      • Provided expertise for Symantec’s Storage Foundation SFHA Oracle RAC and Veritas Volume Replicator on Solaris servers. • Supported mission-critical Solaris business servers. • Ensured seamless operation of Sparc and x86 servers for strategic initiatives. • Collaborated with the team to optimize server performance and reliability.

    • Solaris Consultant (Professsional Service)
      Apr 2008 - Oct 2008 · 7 mos

      • Architected and designed complex server virtualization solution using Solaris zones for GAP Inc. financial systems on Oracle R12 refresh project. • Implemented innovative solutions to enhance system performance and scalability for remote work environments. • Collaborated with cross-functional teams to ensure seamless integration and successful project delivery.

    • System Engineer (Professsional Service)
      Jan 2008 - Apr 2008 · 4 mos

      • Architect, design, and code Solaris infrastructure deployment program for mission critical business.