Data Engineer | High-growth spatial data InsurTech | Remote (UK) | Up to £65K + bonus
About the Company
As our Data Engineer, you won’t just work with data - you’ll be the one who brings it to life. We are a rapidly growing InsurTech firm specialising in high-fidelity geospatial data services, and we’re looking for someone to help us turn our data strategy into reality.
We work where we’re happiest, offering a remote-first environment (UK-based) with occasional travel for team collaboration. You’ll be at the heart of our mission, shaping the next generation of our data lakehouse and helping us provide actionable insights that make property underwriting smarter.
About the Role
You will help design, build, and maintain powerful, scalable data pipelines and platforms that keep our data easy to find, trustworthy, and ready for real-world use. This is a hands-on role where you’ll utilise cutting-edge tools like Databricks and modern cloud technologies to automate ETL processes and uncover insights from rich geospatial data.
We are looking for someone who champions best practices in metadata, testing, and governance. You will play a key role in delivering our medallion architecture, ensuring well-structured bronze, silver, and gold layers that power our suite of data products.
Responsibilities
- Design and run robust data pipelines using Databricks and AWS, transforming large-scale geospatial datasets into high-quality, usable data.
- Automate core ETL processes to enable smooth, reliable, and scalable data flows across the organisation.
- Support the delivery of our medallion architecture and develop metadata frameworks to make data easier to understand and trust.
- Partner with senior engineers and product teams to deliver genuinely useful data products.
- Advocate for best practices in testing and documentation, contributing to automated frameworks that keep data geospatially sound.
- Spot opportunities to make things faster by optimising pipelines and queries with techniques like spatial indexing.
- Keep everything aligned with security and licensing standards—no shortcuts.
Required Skills
- Proven ability to design, build, and maintain pipelines that just work, keeping workflows running smoothly behind the scenes.
- Comfortable working in the cloud, specifically with Databricks, and familiar with how AWS services power scalable platforms.
- Python is second nature to you, whether for building pipelines, automating processes, or writing tests.
- Strong SQL skills with an interest in (or experience with) PostgreSQL/PostGIS for spatial data.
- Experience structuring data through Bronze, Silver, and Gold layers using Python and Spark SQL.
- You should know, and have experience with, every keyword in the second half of this sentence: Lakeflow Spark Declarative Pipelines, ETL, Delta, Unity Catalog, Data Quality, Medallion Architecture, Change Data Capture, SCD Types, Python, Git, and GitHub.
Preferred Skills
- Curious about geospatial data and excited to learn tools like GDAL and DuckDB.
- Experience using BI tools (like Power BI) to tell clear, compelling stories with data.
- Fluent in friendly, informative chats and comfortable with change in a fast-moving environment.
What's in it for you
- A genuinely lovely, people-first team that supports growth and learning.
- Base comp of up to £65K plus a bonus.
- Health insurance, great pension, parental leave, and other benefits.
- The opportunity to learn and build your career in a PE-backed firm.