Honolulu, Hawaii, United States
When working on a problem, I never think about beauty; I think only of how to solve the problem. But when I have finished, if the solution is not beautiful, I know that it is wrong. - R Buckminster Fuller
I lead Site Reliability Engineering at Onebrief, focused on reliability, observability, incident response, and deployments across both standard and air-gapped environments. My role has been a mix of people leadership and hands-on infrastructure work, helping the company scale operationally while supporting demanding customer environments. Built and expanded SRE practices around observability, incident response, and production operations Supported high-touch deployments and troubleshooting in secure and air-gapped environments Improved system visibility, operational readiness, and debugging workflows across engineering Helped deliver infrastructure support for new deployment targets. Worked closely with platform and engineering leadership on reliability strategy, team structure, and operational ownership Continued hands-on work across Kubernetes, Helm, Linux, monitoring, and production support while managing engineers
Designed and maintained Kubernetes-based infrastructure and observability systems supporting production and restricted deployments Built telemetry pipelines using OpenTelemetry, Grafana Alloy, Prometheus, Loki, Tempo, and Grafana OSS Implemented frontend and backend tracing to improve performance analysis, debugging, and customer-impact visibility Supported air-gapped deployment packaging, infrastructure troubleshooting, monitoring, and operational readiness Worked across Helm, Kubernetes, Linux, AWS, secure networking, and deployment tooling to improve reliability and repeatability Helped evaluate and support multiple hosting and deployment patterns for government and defense use cases Contributed to deployment tooling and infrastructure strategies for resilient fielded environments
Led the platform team as a key architect and decision-maker. contributed meaningful technical and non-technical progress towards achieving a Continuous authority to operate (cATO) for the program. led to cutting costs for our customers by 70% through the automation of infrastructure, using Packer and Terraform. managed hundreds of cloud servers on a small high-performing team. managed over 30 Kubernetes clusters. deployed several Kubernetes apps in a gitops fashion. worked extensively on gitlab-ci pipeline templates for developer workflows and common pipeline tools. integrated several new products(cilium, artifactory, ansible automation platform,fleet) into the platform to better assist and ease the burden on software developers. architected and implemented applications that achieved 5 9's of availability. mentored and reviewed code for junior engineers. Maintained platform that supported 500,000+ end users. Scrum master and key contributor to sprint planning and platform direction. worked on the company side to expand our contracts by writing detailed capability assements to allow procurement of other government contracts.
Managed and Developed a Platform/environment for developers to make deployments and software development faster and more secure. utilized gitlab, rancher, keycloak, harbor, and other tools as part of the platform. integrated numerous tools with our single sign-on platform keycloak which was able to achieve client cert authentication. deployed and developed internal documentation for harbor container registry including setting up the trivy container scanning so that the platform had updated security information.
DevOps engineering work in a small agile team mostly focusing on AWS and OpenShift environments running dockized applications. Deployed through Jenkins pipelines from the application git repository. Developed and contributed to the team groovy library. Instituted an agile release plan and scrum process.
devops engineering work on the canyon program, mostly focusing on git,Jenkins, and python. worked with limited supervision to implement improvements across the program for both developers and testers. collaborated with my fellow systems engineers to come up with unit test frameworks and implement more automation through deployment and testing.
I served as a Systems Administrator supporting the GOES Program. I was responsible for a variety of tasks including daily system checks, system patches, process improvements, scripting, and new COTS integration.
Responsible for providing technical support to engineers on a variety of tasks. Conducted tests and recorded data to assist with engineering evaluation or analysis. Maintained and edited engineering documentation, reports and drawings. Successfully implemented AutoCad LT into design projects. Used standard test equipment to troubleshoot circuits. Assisted in the development of High Voltage test equipment