Post by Sonal Holankar
Lead Software Engineer@Capgemini | Ex-Jio | Data Engineering | SQL | Python | Data Modelling | Cloud (Azure/AWS/GCP) | Big Data | ETL Pipelines | Data Analytics | Big Data Enthusiast |
#InterestingBigDataEngineering π π¨ Databricks Free Edition β What I Discovered (Hands-On!) π¨ I recently spent some time exploring the Databricks Free Edition workspace, and here are a few practical limitations I encountered while working on real use cases π π Key Observations: 1οΈβ£ Scala + Serverless = Not Yet Supported If you're planning to use Scala, Serverless compute wonβt work (at least for now). Youβll need to switch to a classic cluster that supports Scala. 2οΈβ£ PySpark RDDs Not Allowed Tried working with RDDs on Serverless compute β no luck β Looks like Databricks is pushing more toward DataFrame-based APIs. 3οΈβ£ PERSIST TABLE Not Supported Attempted to persist tables using SQLβ¦ but Serverless compute doesnβt support it yet. πΈ Iβve attached screenshots below for reference so you can see exactly what errors/limitations look like in practice. π‘ My Takeaway: Serverless is great for quick setups and ease of use, but if you're doing advanced operations or using Scala/RDDs, classic clusters are still essential. #Reference : https://lnkd.in/exyCrHns π€ Curious to hear from you: Have you faced any other limitations in Databricks Free Edition? Or found workarounds for these? Drop your thoughts in the comments π #Databricks #BigData #DataEngineering #PySpark #Scala #CloudComputing