Azure Databricks: A Data Consultant's Dream
Use Cases and Deployment Scope
As a Big Data Consultant. Azure Databricks is my favorite tool in the house!
The biggest problems with data consulting is a plethora of programming languages it deals in, from SQL, Scala,R, Python, Java etc.
That is exactly where Azure Databricks excels! It supports all languages in a single notebook with an equivalent performance for all! Club that with a visually pleasing UI, features that integrate the entire data lifecycle, and an architecture that gets the best of spark and you have one of the best data tools in your hand!
Pros
- Data Processing and Transformations based on Spark
- Delta Lakehouse when clubbed with an external cloud storage
- Governance using Unity Catalog to unify IAM
- Delta Live Tables is a product, which although relatively newer, has a great potential with the visuals of a pipeline.
Cons
- The new UI is a bit clunky compared to the old UI. It also adds new elements in the sidebar which are not relevant to the workspace. Can be worked upon
- Delta Live Tables, although powerful, has a lot of things that can be improved, including error debugging, support for new things
- Concurrent requests need some more optimisation and work in the delta lake tables.
Return on Investment
- The support team is amazing, they help you at every stage of the projects, from sales to delivery.
- On a framework level, it has had an amazing impact and has reduced the clients overall data platform costs by a staggering 65%
- There has been a 40% Manual work requirement on average for the clients when they move to Azure Databricks Data Platform
Usability
Alternatives Considered
Jupyter Notebook, Azure Synapse Analytics and Cloudera Data Platform
Other Software Used
Azure Data Factory, Cloudera Data Platform, Apache Iceberg

