Bucharest, Bucharest, Romania
✔ Senior engineering leader with 15+ years of progressive experience in IT and Cloud Infrastructure, specializing in large-scale Kubernetes platform operations and Site Reliability Engineering ✔ Managing multi-functional teams across EMEA, scaling from individual contributors to manager-of-managers structure supporting 430+ production clusters serving Adobe's global infrastructure ✔ Proven track record in strategic project delivery: leading $15M+ cloud migrations, building automated fleet management systems, and establishing E2E testing frameworks for platform reliability ✔ Champion of DevOps culture, SRE best practices, and modern operational excellence - reducing manual overhead by 50% through intelligent automation and AI-powered tooling ✔ Technical leadership in cutting-edge technologies: Kubernetes (EKS, ROSA, AKS), Cloud Computing (AWS, Azure), Infrastructure as Code, GitOps, and AI-assisted development workflows ✔ Certified Product Owner with expertise in Agile methodologies, OKR-driven planning, and building high-performing engineering teams through mentorship and career progression ✔ Executive MBA graduate (Tiffin University, 2023) combining technical depth with business acumen for strategic decision-making
Leadership & Team Management: - Leading 23 engineers across 3 functional groups with a phased transition to manager-of-managers structure - Developing next-generation leaders through targeted mentorship, promoting 2 engineers to management roles in FY26 - Managing global matrix teams and EMEA operational support for Adobe's Kubernetes platform Strategic Project Delivery: - Leading $15-18M/year migration (24+ production clusters, 150M+ inference requests/day) - Architecting automated Kubernetes release rollout system targeting 50% reduction in engineering resources and 8-10 week fleet upgrade cycles - Building E2E Testing Framework for 430+ production clusters to establish 99.9% availability baselines and continuous reliability validation Platform & Product Ownership: - Product Owner for tools and automation managing a fleet of 430+ Kubernetes clusters across EKS, ROSA, and AKS - Driving Ethos Kubernetes platform roadmap with focus on reliability, automation, and operational excellence - Implementing AI-powered agile project management workflows for quarterly planning, capacity modeling, and OKR tracking Operational Excellence: - Overseeing 40% operational allocation while maintaining 60% project delivery capacity - Reducing manual cluster upgrade overhead from 55% (860 operations) to <10% through intelligent automation - Establishing SRE best practices, SLO/SLI frameworks, and data-driven reliability improvements across the platform
• Operationalize and contribute to the next-generation platform: Mesos, Marathon and Docker • Create and support tools and applications that help automate and sustain large scale infrastructure • Support and maintain global application production environments • Automate common, repeatable tasks at large scale • Work with a wide variety of AWS cloud services • Design and maintain production monitoring systems • Evaluate and manage application and environment security • Work in a diverse and global team environment • Promoter of the DevOps mindset
• Determined and provided a necessary level of technical documentation during requirements gathering, based on technical services group standards, code, functional designs and discussions with potential staff, industry and vendors • Proactively collaborated with producers to ensure that appropriate staffing resources were assigned and effectively utilized • Worked closely with quality assurance resources to create test plans and ensured that issues were properly fixed and regressed • Provided technical consultancy to department staff and participated in business development efforts including proposals, development plans and presentations as directed • Assisted technical staff with scoping identified project deliverables and created project specific documentation such as functional and technical specifications • Acted as point of technical solution throughout the lifecycle of projects as they related to relevant decisions • Collaborated with external technology vendors, internal staff members and third party consultants • Managed incidents, designed technical solutions, supported software testing and releases
• Conducted training sessions and delivered presentations for the online team from QCAA • Ensured client’s support tickets were resolved in a timely and professional manner • Build strong relationships team members and customers at various levels • Collected client and document knowledge about businesses and technical setup • Performed feasibility studies, technical solutions and proof of concept development
• Mentored new software development employees • Performed software deployment and system administration • Provided accurate requirement estimations and impact assessments
• Operated several important projects of various sizes using main platforms such as: PHP, MySQL and jQuery • Prototyped software applications for new projects and provided accurate cost estimates