New York, New York, United States
Developed a framework to use AWS Glue and AWS Redshift Spectrum to allow scalable access company data directly from our S3 Data Lake, helping lighten AWS Redshift cluster storage loads and allowing improved performance to replace existing data pipelines with a more robust data flow, as well as alleviating cost for excess redundant storage on both S3 and Redshift clusters. Worked with Systems Engineers to deploy SAS and Jenkins infrastructure for our data analytics and marketing teams. Performed a POC on an in-house data lineage system that would used by data engineers to describe data flow for business units. Hands-on experience with AWS services and creating ETL processes from various sources by using S3, Amazon Redshift, Redshift Spectrum, Lambda, Step Functions, and Glue as well as the Hadoop ecosystem. Understanding business units and consumers’ needs to provide them with clean and useful data to be used in analytical modeling and reporting as well as providing solutions to data issues that arise in the downstream data they use.
Full Stack Data Engineer. Developed a multi-processing python script to expedite full load API calls. Used AWS services S3, Amazon Redshift, Amazon Redshift Spectrum to document Adobe Analytics data. Developed a cohort analysis using tableau.
Developed client-facing API documentation. Participated in an intern project with teams across Sprinklr to build an editable info-graphic sales presentation for marketing strategy. Learned and practiced the importance of writing clean and readable code that would be used for future development projects.
Responded to emergencies and provision of emergency care and treatment required until Professional Emergency Services arrive.