Málaga, Andalusia, Spain
- Proficient in Data Engineering, Data Platforms, Data Warehousing, DevOps, and MLOps - Skilled in Microsoft Azure and AWS (AWS Data Engineer Certified) - Experience with Databricks, Spark, Clickhouse, Greenplum MPP DB, SQL, Python, Airflow, DBT, Kafka + Kafka Connect/Debezium, NiFi, Streamsets, Python, Scala, SAP HANA DB - Strong knowledge of Azure Cloud, AWS, Mongo DB, Cassandra, Hive, Oracle DB, Oracle Golden Gate, Postgres, MS SQL, SAP Business Objects, Microstrategy, Superset, Metabase, Docker, Git, Terraform, Kubernetes, Grafana - Experience of working remotely with distributed international teams
Big Data projects Stack: Spark, Trino/Aciberg, GCP, Airflow
Architected and implemented a scalable Data Platform. Stack: AWS, Spark, Clickhouse, Greenplum, Airflow, DBT, Kafka, Debezium, Oracle db/Oracle Golden Gate, Python, SQL, Kubernetes
Responsible for cloud-related projects Stack: Azure Cloud, Databricks, Clickhouse, Greenplum, Airflow, SQL, Python, NiFi, Kafka, Terraform, Kubernetes
As a lead engineer I was responsible for - Data Platform development (~100 Tb compressed data w/o replication and ~1000 dataloads) from scratch with the goal of data collecting from different sources and delivering it to consumers - Migration of the core of our Data Platform - Greenplum DB - from bare metal to Yandex cloud which included functional and performance testing - Team management - technical interviewing, onboarding, performance development and growing up the team of DE's from 2 to 20 Stack: AWS, Spark, Clickhouse, Greenplum, SAP Data Services ETL, Airflow, NiFi, Kafka, SQL, Python, Kubernetes
- Engineering of DWH/BI including merging of two largest retailers - Support of datawarehouses based on SAP and Hadoop stack Stack: SAP BW, SAP HANA DB, SAP Data Services, SAP Business Objects, Hadoop (Hive + Python), Oracle DB