Bucharest, Romania
• Worked as a lead developer in the big data team which manages datasets for financial optimization projects. From May 2019 to October 2020 I was a contractor employed by Extia Romania. • Collaborated with data analysts in defining Spark batch jobs using JSON configs. • Increased tool friendliness by customizing GitLab pipelines and applying a Git branching model based on cluster environments. • Assisted the materialization of ideas into internal tools: data lineage graphs with Graphviz, monitoring dashboards in DataStudio, documentation template generator. • Migrated existing data pipelines from the on-premise cluster to Google Cloud Platform using a standardized stack created by Renault Digital. Spark, Scala, Maven, Python, Hive, BigQuery, Oozie, Airflow, Docker, GitLab
• Worked on the data analytics and insights team, focused on understanding the customer lifecycle. • Maintained an internal ETL application, added integration tests using Docker containers and configured Bamboo to deploy artifacts in Artifactory. • Defined Spark streaming jobs. Spark, Scala, sbt, Python, Kafka, Docker, Bitbucket
• Worked on a big data project which handled cellular network data. • Developed Spark applications with generic connectors, reusable Scala libraries, custom Flume sources and configured Jenkins. • Defined Spark batch jobs and used Oozie for defining recurrent and recovery flows. • Configured the Cloudera Hadoop distribution and secured the services using TLS and Kerberos. Spark, Scala, Java, Maven, Flume, Kafka, Hive, Kudu, Bitbucket
• Worked on a team specializing in building fault tolerant services for betting applications. • Developed a reporting service and expanded the NuGet library used by projects. Windows Service, WebApi, C#, SQL Server, Fiddler, TFS