Hesham Haroon

AI Lead | Senior NLP & LLM Engineer | GenAI, RAG, Multi-Agent Systems | Arabic LLMs & NLP | MLOps, AWS, vLLM

Cairo, Egypt

About

I ship LLM systems that work in production. Not prototypes. Not demos. My focus is Arabic-first AI: building retrieval systems, agents, and chatbots that handle MSA and dialects at scale. I've automated 40-60% of customer queries on WhatsApp, built AI assistants for Saudi ministries, and led on-prem GenAI deployments that reduced manual work by 25-35%. I care about the hard parts: evaluation that catches real failures, inference costs that don't explode, and retrieval that actually retrieves. I've fine-tuned LLMs on domain data, optimized RAG pipelines, and built observability into LLM systems from day one. Core stack: Python, PyTorch, vLLM, llama.cpp, LangChain, Kubernetes, AWS. Published researcher in Arabic NLP. Mentor at lablab.ai. Currently leading AI at ECC and consulting for legal AI at i-Legal. Let's talk if you're building Arabic AI or deploying LLMs in production.

Experience