Tomasz Michalak

Software Architect | AI reliability engineering at scale | ex-Atlassian Principal SRE | Co-founder, Hands-on Architects

Poznan, Wielkopolskie, Poland

About

Teams don't stall because their AI model is wrong. They stall because the harness around it isn't built yet - no verification layer, no constraints, no way to know when the agent is succeeding quietly at the wrong thing. I've been close to this problem twice. First at Atlassian, in the engine room of Jira Cloud - 400+ shards, 50k+ tenants, top-tier customers - where the hard part was rarely the technology. It was getting the team clear on what "right" even looked like - before the code, before the sprint, before the slide deck. "Tomasz brought clarity to the problem space, aligned the engineering team and stakeholders around a path forward, and delivered a clear and concise plan that was later executed on successfully." — Sam Jebeile, Head of Engineering, Atlassian Now I'm watching the same pattern unfold while building AI agent infrastructure at a US-headquartered SaaS company - as the founding member of a Sr. Director-sponsored AI Guild, co-drafting the AI adoption strategy for our Poland engineering offices. Alongside that, I co-founded Hands-on Architects with Maciej Laskowski. We write and advise on reliability and architecture at the point where theory meets the production incident. If that's your team's problem space, DMs are open.

Experience

  • Senior Software Engineer at Egnyte
    Dec 2025 - Present · 8 mos

    Operating beyond the Senior Software Engineer title - within five months at Egnyte, my scope expanded from feature engineering to architecture contributions, AI strategy co-design with senior leadership, and internal thought leadership reaching VP-level stakeholders. Joined the MCP (Model Context Protocol) initiative, closely cooperating with a Principal Engineer while maintaining full autonomy. Contributed to Architecture Decision Records, initiated harness engineering practices for AI-assisted development, improved developer experience, organized DDD workshops bridging product and engineering, and created a DevOps training program for the team. Described our project learnings through two internal blog posts (93 combined views across ~1,000 employees). Key initiative: introduced AI-assisted e2e testing into the CI pipeline using a 4-agent harness (Planner, Implementer, Reviewer, Runner). The metric - commits per step (revision cycles per test step) - showed a 40x improvement: from 2.00 in Suite 01 to 0.05 in Suite 06, where the most complex 20-step scenario completed in a single commit. Co-founded Egnyte Poland's AI Guild with the Sr. Director - a cross-functional group assessing AI adoption maturity across teams and defining AI strategy for the engineering organization. Co-designed member acceptance criteria, assessment methodology (adapted from the Harness Model), governance, and pilot programs. Presented AI vision to SVP Products. Technologies: Python, Java, Spring (Boot, AI, Cloud, JPA), Docker, Kubernetes, GCP, Azure, MySQL, Redis, MCP

  • Principal Site Reliability Engineer at Atlassian
    Aug 2024 - Nov 2025 · 1 yr 4 mos

    As a Principal Site Reliability Engineer (SRE, P60) at Atlassian, I played both a strategic and hands-on role in shaping the reliability and performance of Atlassian's products (Monoliths: Jira & Confluence). My responsibilities included defining and executing the SRE strategy & roadmap, leading major architectural changes (shard provisioning & deployment, tenant allocation), and enhancing team practices to ensure high availability and reliability of services. I partnered with SRE leadership to define the strategic direction for all SRE teams (in EMEA, US, and APAC), driving initiatives like load prioritization, rate limiting, load-shedding, and sharding standardisation. Beyond strategic initiatives, I was also deeply involved in hands-on coding, mentoring engineers, and participating in incident response and post-incident reviews to continuously improve our reliability practices. My role involved also: - Advising on defining SLIs and SLOs for Jira (across 400+ shards), contributing to reliability metrics and service health. - Investigating and resolving performance issues for high-impact customers, such as Revolut and General Motors, which was recognized as a critical success for the customer support team. - Serving in the EMEA on-call rotation, providing support during critical incidents and outages. - Participating in company-wide initiatives such as Hackathons (ShipIt) and contributing to the Atlassian community through mentorship and knowledge sharing (e.g., pair programming, code reviews, blog posts, and technical talks). - Improving team practices by introducing structured planning, prioritization, and collaborative practices (e.g., pair programming, backlog grooming, retrospectives). Technologies: AWS (EC2, EBS, ELB, ASG, RDS/Aurora, SQS, SNS, S3, CloudWatch, CloudFormation, CDK), Docker, Spinnaker, Bitbucket, Go, Java, Spring, NGINX, Splunk, SignalFX, Memcached, Statsig, LaunchDarkly, Terraform

  • Software Architect at Dynamic Solutions
    Mar 2022 - Jul 2024 · 2 yrs 5 mos

    As a Software Architect at Dynamic Solutions, I led the transformation of the company's products into cloud-native applications and established modern software engineering practices, including CI/CD, automated testing, Architecture Decision Records (ADRs), and knowledge sharing. I architected and implemented two flagship products while providing strategic technical consultation across multiple client engagements. Projects: - WebSight CMS (websight.io): Transformed a traditional monolithic OSGi-based CMS into a modern, cloud-native application (adopting Sling Models) using containerisation and Kubernetes orchestration for scalable, multi-channel digital experience delivery. - StreamX (streamx.dev): Designed and implemented a real-time event-driven content distribution platform enabling seamless data integration and personalisation at scale. Client Consulting: - Provided expertise on Adobe Experience Manager as a Service (AEMaaS) and Adobe Edge Delivery Services (EDS) for pharmaceutical and other industries. - Analysed and optimised frontend performance for aviation clients, focusing on user experience enhancements and backend integration. Key achievements: - Cloud-Native Transformation: Architected containerized applications on Kubernetes, implementing Infrastructure as Code (Terraform, Helm) across AWS and GCP to ensure scalability, reliability, and portability. - Event-Driven Architecture: Implemented MVP event-streaming solution using Apache Pulsar, enabling ahead-of-time processing and seamless composition of CMS, Commerce, PIM, and CRM systems across web, search, and mobile channels. - CI/CD Leadership: Developed robust CI/CD pipelines (GitHub Actions) automating build, test, and deployment processes for rapid, reliable software delivery. - Architecture Governance: Documented architectural decisions using ADRs, capturing rationale, trade-offs, context, and alignment with project goals and constraints. - Presented products at Aviation Festival in Lisbon.

  • Principal Software Engineer | Software Engineering House Lead at Wunderman Thompson Commerce & Technology
    Jan 2021 - Feb 2022 · 1 yr 2 mos

    As Principal Software Engineer and Developer Advocate at Wunderman Thompson Technology, I led the Software Engineering House (Java Practice Lead) during the company's transformation from Cognifide. I was part of the leadership team, championing modern software engineering practices, DevOps culture, and cloud-native architecture across the organisation. Key Responsibilities & Achievements: - Engineering Leadership: Established and led the Software Engineering practice, introducing Architecture Decision Records (ADRs) and Technology Radar for better decision-making. Implemented O'Reilly Learning platform to foster continuous learning and professional development. - Multi-Project Technical Lead: Delivered 10+ projects simultaneously, from small cloud-native serverless applications to large-scale digital platforms for global brands (500+ sites). Provided critical technical leadership during crucial delivery phases. - Cloud-Native Architecture: Architected and implemented solutions on AWS and GCP, guiding teams in micro-services, Kubernetes, and serverless computing (AWS Lambda). Championed Infrastructure as Code (Terraform, AWS CloudFormation, AWS CDK) for consistent, repeatable deployments. - Technical Consulting: Provided expertise across frontend development (React, TypeScript), end-to-end testing (Cypress), CI/CD (GitHub Actions), and cloud architecture. Led competency matrix initiative using Neo4j to identify skill gaps and training needs. - DevOps Advocate: Fostered DevOps culture, promoting collaboration between development and operations. Implemented CI/CD pipelines, automated testing, and IaC practices to streamline workflows and improve deployment efficiency.

  • Cognifide (On-site)
    • Principal Software Engineer
      Jul 2017 - Jan 2021 · 3 yrs 7 mos

      As a Principal Software Engineer at Cognifide, I led the development and strategic direction of advanced integration solutions for Adobe Experience Manager (AEM), e-commerce platforms, search engines, and marketing tools. I pioneered an innovative CMS integration approach that I extended into Knot.x, an open-source reactive integration layer built on Vert.x technology. This project became a cornerstone of the company's integration capabilities and demonstrated my ability to drive innovation at scale. I remained actively involved as a hands-on contributor in complex integration projects, providing technical expertise on the most challenging initiatives. Key Responsibilities & Achievements: - Integration Architecture: Revolutionized AEM integration processes by designing and implementing Knot.x using modern microservices architecture and event-driven design principles. The solution improved integration flexibility, performance, and maintainability across multiple client projects. - Technical Leadership: Collaborated with cross-functional teams including architects, developers, business stakeholders, and system engineers to ensure seamless implementation of complex integration scenarios. - Community Building: Established and nurtured the Knot.x open-source community, promoting collaboration and knowledge sharing among internal and external stakeholders. Organised knowledge-sharing sessions and training programs that fostered a culture of continuous learning. - Conference Speaker: Presented at renowned conferences (Devoxx, GeeCon) and actively participated in Java User Group (JUG) meetings in Szczecin and Poznań, showcasing expertise in event-driven integration patterns and reactive systems. - Infrastructure Excellence: Strengthened system engineering capabilities through extensive work with Chef, Vagrant, and Docker, ensuring consistent and automated deployment processes.

    • Senior Software Engineer
      Jan 2014 - Jun 2017 · 3 yrs 6 mos

      As a Senior Backend Software Engineer at Cognifide, I specialised in Adobe Experience Manager (AEM) and its integration with e-commerce platforms, search engines, and marketing tools. I designed and implemented AEM solutions for clients across pharmaceuticals, finance, banking, automotive, and beauty industries, delivering high-quality digital experiences through close collaboration with cross-functional teams. I was a key contributor on 7+ projects, leading integrations with Apache Solr, Magento, and Shopify. I actively participated in architecture discussions, knowledge-sharing sessions, hackathons, and code reviews to maintain best practices and coding standards. Key Technical Expertise: - AEM Development: Deep expertise in AEM architecture (CQ 5.6.x, AEM 6.x), OSGi, JCR, Sling, and AEM APIs. Developed components, templates, workflows, and services with seamless third-party integrations. - Integration Specialist: Integrated external systems via RESTful APIs, SOAP web services, Apache Solr, and e-commerce platforms (Magento, Shopify). - Marketing Automation: Implemented personalised campaigns and A/B testing with Adobe Campaign and Adobe Target. - Infrastructure Automation: Automated deployment and configuration using Chef and Vagrant, ensuring environment consistency across development, testing, and production. - CI/CD: Built Jenkins pipelines integrated with Git, JIRA, and SonarQube for automated build, test, and deployment processes.