Cairo, Egypt
With over 7 years of experience as a Data Engineer at SWVL, TRM Labs, and 0x Labs. I specialize in building and maintaining the foundational data systems that drive business success. I'm passionate about architecting solutions that convert complex data challenges into tangible value. I consistently deliver by designing, developing, and optimizing high-performance data pipelines and infrastructure. My toolkit includes Python, SQL, and Java, alongside deep knowledge of data modeling, data warehousing, and advanced ETL strategies (both batch and streaming). I'm committed to enabling organizations to leverage their data for impactful analysis and informed decision-making.
Solana Ecosystem Integration: - Led the engineering effort to integrate Solana into existing data pipelines, successfully adapting systems originally built for EVM concepts to Solana’s account-based model. - Developed custom indexing and parsing for 0x Protocol trades, enabling near-realtime tracking of trading volume and protocol revenue on Solana. Infrastructure Optimization & Migrations: - Engineered a 48x performance improvement on critical data workloads, reducing query execution times from 4 hours to 5 minutes. Data Quality & Observability Framework: - Architected and deployed a comprehensive data quality infrastructure using Soda and custom observability tools to establish a high-trust environment for protocol data. - Standardized monitoring practices to ensure data regressions were identified and mitigated.
- Scaled the Blockchain Data Engineering team from a solo initiative to a squad of 6 specialized engineers, establishing the strategic roadmap and owning the end-to-end lifecycle of blockchain data ingestion and indexing across the organization. - Architected a robust ingestion framework interacting with diverse RPC endpoints for real-time blockchain data coverage, including sophisticated chain re-org handling to ensure data integrity for real-time risk-scoring and monitoring products. - Fully owned the deployment and scaling of Solana and Near protocols — the platform's most complex, high-throughput data challenges — managing indexing that accounted for ~60% of total platform traffic. - Executed the industry's first comprehensive Solana data support milestone, collaborating with 4 distinct teams to ensure seamless integration across the entire product suite, positioning TRM as a leader in high-performance blockchain forensics. - Led stress-testing initiatives to validate infrastructure resilience at unprecedented data scales, resolving bottlenecks pre-production and achieving a zero-downtime rollout while maintaining continuous availability for global financial and regulatory stakeholders. - Mentored engineers and standardized development practices to build scalable data products serving both the core TRM API and internal forensic teams.
- Joined as an early engineer to architect the core foundations of a data platform currently processing Petabytes of blockchain data. - Designed and implemented scalable data models that successfully expanded the platform's support from 2 initial chains to over 50+ diverse blockchain networks. - Authored the first iteration of the TRM API, establishing the primary data delivery layer for downstream forensics and risk products. - Developed a high-impact Proof of Concept (POC) for real-time data pipelines, transitioning the infrastructure toward low-latency stream processing. - Established standardized ETL patterns and data modeling best practices that enabled rapid multi-chain expansion without architectural regressions.
- Engineered and maintained high-throughput ELT pipelines to orchestrate seamless data flow from production MongoDB clusters into BigQuery, supporting millions of ride transactions. - Architected scalable, event-driven systems using Apache Kafka and Apache Flink, enabling real-time streaming analytics that drove critical operational decisions. - Deployed and managed complex data workflows via Apache Airflow on Kubernetes (K8s) within GCP, ensuring 99.9% availability of the data platform during 10x growth periods. - Pioneered a real-time compliance system from scratch, which served as a primary technical blocker for market entry; its successful deployment directly enabled Swvl’s expansion into multiple new international markets. - Optimized large-scale data processing jobs using Apache Spark on AWS EMR, leveraging a multi-language stack (Python, Scala, Java) to reduce processing costs and latency. - Designed analytical data models that empowered cross-functional teams with real-time dashboards, shifting the organizational culture toward data-driven operational agility.
- Guided students through complex concepts of compiler design, including lexical analysis, parsing, semantic analysis, intermediate code generation, and optimization. - Facilitated practical application of theoretical knowledge by assisting students with lab assignments and projects focused on building compiler components. - Provided individualized support and clarification on course material, ensuring student comprehension and problem-solving skills in compiler construction. - Evaluated and graded student assignments, offering constructive feedback to enhance their understanding of compiler principles and development. - Adapted teaching methods to support remote/hybrid learning environments, ensuring continuity and effectiveness of instruction during the period.
- Provided instruction and support for an Operating Systems course, adapting to a fully remote learning environment due to the COVID-19 pandemic. - Clearly explained complex operating systems concepts to students, fostering understanding and engagement in a virtual setting. - Developed and graded exams and laboratory assignments to assess student comprehension and practical application of course material. - Designed and implemented a comprehensive capstone project, challenging students to apply theoretical knowledge to real-world scenarios. - Managed all aspects of course support remotely, ensuring a smooth and effective learning experience for students during an unprecedented period.