Post by Prateek Jalgaonkar
Lead Analytics Engineer @ Cigna Evernorth | Building Scalable Healthcare Analytics Systems
😴Spark Lazy Evaluation (aka “I’ll do it later” mode) Spark be like: 🙂↕️ “You want to filter?” “Cool.” 🙂↕️“Add a join?” “Noted.” 🙂↕️“Another transformation?” “Relax, I’m just planning.” Nothing actually runs. Spark is busy 😏: Writing things down 📝 Drawing a DAG Optimizing the plan Only when you say .show() / .count() / .write() 👇 -> Spark goes: 🚀 “Alright, NOW we run.” That’s Lazy Evaluation. Plan first. Execute later. Complain never 😄 #ApacheSpark #DataEngineering #BigData #LazyEvaluation #LearningInPublic