Singapore
I’m a seasoned data engineer with 12 years of experience in data warehousing and data governance across large-scale e-commerce environments. Currently, I’m actively exploring the intersection of big data and AI—building LLM-powered assistants for data discovery, metric explanation, and self-service SQL generation using GPT-4o and RAG architecture. Passionate about turning complex data into trusted, reusable assets—and making data more intelligent, accessible, and efficient for everyone.
Lead the end-to-end data engineering lifecycle for large-model development—collaborating closely with algorithm teams on model training, deployment and iteration; designing, deploying and integrating online and distributed data pipelines for multi-source ingestion, cleaning, labelling, automated data production and parameter-efficient fine-tuning (e.g., LoRA); establishing feedback loops, data return flows and quality-monitoring systems to continuously optimise real-world model performance; and building the data foundation through taxonomy, schema standardisation, feature engineering and team leadership to ensure high-quality cross-functional delivery and talent development.
Leading the fundamental data engineer team at Shopee, responsible for building the unified data layer and driving data governance across the MP e-commerce domain. Focus areas include compute/storage optimisation, data quality improvement, SLA stability, and sensitive data access control—enabling high-reusability, high-quality real-time and batch data for downstream BI, algorithm, and local data teams.