Leander, Texas, United States
Engineering leader with 20+ years in software engineering and 18+ years leading backend distributed systems teams. I specialize in building high-throughput, low-latency platform systems that power real-time user experiences at global scale. I’ve led teams at MongoDB, Google, and Amazon building mission-critical backend platforms, including ML, AI-powered systems, large-scale data pipelines, and developer-facing infrastructure. My work consistently sits at the intersection of product, platform, data, and ML, AI, enabling rapid innovation while maintaining Tier-1 reliability. I focus on: Designing and evolving scalable platform architectures that unlock new product capabilities Driving operational excellence (SLOs, SLIs, observability, incident response) in high-availability systems Building and mentoring high-performing engineering teams, including developing senior engineers and future leaders Defining clear roadmaps and success metrics aligned with business impact Partnering cross-functionally with Product, ML, AI, Data, and Analytics teams I’ve built teams from scratch, stabilized and scaled existing organizations, and led systems handling real-time, high-volume workloads across cloud-native environments.
AWS AI Working on: - open source debugger for SageMaker DL Training (https://aws.amazon.com/blogs/aws/amazon-sagemaker-debugger-debug-your-machine-learning-models/) - open source Deep Java Library (https://djl.ai/) - open source deep engine MXNet (https://mxnet.apache.org/) - CI/CD/Training and perf benchmarking
Performance advertising. Fastest growing business @Amazon. A lot of experiments, metric driven projects and services severely optimized for latency