Marcelio Jr Di Marco

SRE/DEVOPS ENGINEER/AZURE/AWS/KUBERNETES/CI/CD AUTOMATION/CLOUD INFRASTRUCTURE & RELIABILITY

Johannesburg Metropolitan Area

About

I am a DevOps Engineer / Site Reliability Engineer with hands-on experience in cloud platforms, Kubernetes, monitoring, CI/CD automation, and observability. I’m passionate about building reliable, scalable systems and supporting teams with tools that make delivery faster and more efficient. I began my career as a software developer, working with Python, Java, HTML, CSS, JavaScript, Git, GitLab, and object-oriented programming. This foundation has given me a strong understanding of application development lifecycles, which I leverage to bridge the gap between development and operations. I enjoy working in collaborative environments, solving complex challenges, and learning new technologies as the industry evolves. My focus areas include Grafana, Prometheus, Loki, Thanos, Kubernetes, Azure (AKS, ARO, Velero), Automation (GitOps), CI/CD pipelines (Jenkins, GitHub Actions, ArgoCD), Kafka and Databases (MariaDB, PostgreSQL, Debezium). I believe in creating technology that integrates seamlessly into our daily lives and drives meaningful impact. My goal is to continue growing as a DevOps/SRE professional while contributing to projects that make systems more reliable, resilient, and valuable for both users and businesses. I'm determined to make a difference in the world of technology that will benefit humanity.

Experience

  • Cloud LYDR (Full-time · 2 yrs 2 mos)
    • DevOps Engineer / SRE
      Nov 2023 - Dec 2025 · 2 yrs 2 mos

      Engaged in on-the-job training and professional development in DevOps and SRE practices. Building skills in monitoring, networking, cloud infrastructure, Linux administration, scripting (Bash/Python), Kubernetes, Docker, Git tools, and CI/CD pipelines. Gaining exposure to multi-cloud environments (Azure, AWS) and modern infrastructure architecture. Contracted as an SRE to BancX, applying skills directly in production environments while continuing to expand technical expertise.

    • Junior SRE/DevOps Engineer
      Nov 2023 - Dec 2025 · 2 yrs 2 mos

      Leading the development and optimization of monitoring systems across multiple Kubernetes clusters and environments. Designing, implementing and maintaining Grafana dashboards to visualize critical metrics (PVCs, URLs, SSL certificates, logs), providing real-time visibility for engineers and stakeholders. Enhancing alerting systems to proactively detect issues in uat and production, improving uptime and customer satisfaction. Contributing to observability stack improvements leveraging Grafana, Prometheus, Thanos, Loki, Node Exporter, and Blackbox Exporter. Automating and streamlining deployments using Helm, ArgoCD, Jenkins, and GitHub. Supporting hybrid cloud environments with Microsoft Azure and Red Hat OpenShift, ensuring resilient infrastructure for mission-critical applications. Collaborating closely with cross-functional teams using Jira, Slack, and Postman, fostering efficient communication and problem-solving. Continuously learning, adapting, and applying new DevOps/SRE skills to improve system reliability and reduce operational risk.

    • Junior SRE/DevOps Engineer
      Nov 2023 - Dec 2025 · 2 yrs 2 mos

  • Site Reliability Engineer at BancX
    Jan 2024 - Nov 2025 · 1 yr 11 mos

    Leading the development and optimization of monitoring systems across multiple Kubernetes clusters and environments. Designing, implementing and maintaining Grafana dashboards to visualize critical metrics (PVCs, URLs, SSL certificates, logs), providing real-time visibility for engineers and stakeholders. Enhancing alerting systems to proactively detect issues in uat and production, improving uptime and customer satisfaction. Contributing to observability stack improvements leveraging Grafana, Prometheus, Thanos, Loki, Node Exporter, and Blackbox Exporter. Automating and streamlining deployments using Helm, ArgoCD, Jenkins, and GitHub. Supporting hybrid cloud environments with Microsoft Azure and Red Hat OpenShift, ensuring resilient infrastructure for mission-critical applications. Collaborating closely with cross-functional teams using Jira, Slack, and Postman, fostering efficient communication and problem-solving. Continuously learning, adapting, and applying new DevOps/SRE skills to improve system reliability and reduce operational risk.

  • Software Developer at GK Africa
    Jan 2023 - Feb 2023 · 2 mos

    Contributed to POS (Point of Sale) software development. Designed, implemented, and tested features using Promo (in-house language) and Java. Gained practical experience in software development lifecycle within a professional team environment.