About the Company
This position requires candidates to obtain a Public Trust Clearance. This is a 6 month contract with high likelihood of becoming permanent.
About the Role
We are seeking a skilled and driven Data Engineer to lead our data modernization initiatives. In this role, you will be instrumental in migrating, transforming, and optimizing our legacy SAS data infrastructure into a modern, scalable, and cloud-native data ecosystem.
Responsibilities
- Legacy Migration & Modernization: Analyze, reverse-engineer, and migrate legacy SAS datasets, macros, and ETL processes into modern Python-based applications and SQL workflows.
- Pipeline Development: Design, build, and maintain scalable, efficient, and automated ETL/ELT pipelines using Python (Pandas, PySpark, etc.) and orchestrators (e.g., Apache Airflow).
- Database Engineering: Design and optimize PostgreSQL database schemas, write complex and high-performing SQL queries, and manage data indexing and partitioning.
- Cloud Infrastructure: Deploy, manage, and optimize data workflows within AWS utilizing services such as S3, EC2, RDS, Lambda, and Redshift/Glue.
- Data Quality & Governance: Implement rigorous data validation, cleansing, and monitoring frameworks to guarantee data integrity during and after the migration.
- Collaboration & Documentation: Partner with business analysts and data scientists to understand legacy business logic and document the new cloud architecture.
Qualifications
- Experience: 5+ years of experience in Data Engineering, with a proven track record in data modernization or legacy migration projects.
- SAS Expertise: Strong reading proficiency in SAS (Base SAS, SAS Macros, PROC SQL) to understand and extract legacy business logic.
- Python Proficiency: Advanced programming skills in Python for data manipulation, scripting, and ETL development.
- SQL & PostgreSQL: Expert-level SQL skills and deep experience working with PostgreSQL (performance tuning, schema design, optimization).
- AWS Cloud: Hands-on experience with AWS data ecosystems (S3, RDS, IAM, Lambda, Glue, or Airflow/MWAA).
- ETL/Orchestration: Experience with modern data orchestration tools like Apache Airflow, Prefect, or AWS Step Functions.
Pay range and compensation package
$60-65/hr
Equal Opportunity Statement
We are committed to diversity and inclusivity.