TrustRadius: an HG Insights company

Azure Data Lake Analytics

Score8.3 out of 10

17 Reviews and Ratings

What is Azure Data Lake Analytics?

Microsoft's Azure Data Lake Analytics is a BI service for processing big data jobs without consideration for infrastructure.

Categories & Use Cases

" Azure Data Lake Analytics is accurate, fast, and reliable."

Use Cases and Deployment Scope

To make our work more accessible and more efficient, Azure Data Lake has provided fast access to and analysis of data. This product solves our need for quick reporting on cross-platform applications and bulk data from partners. We can manage on-premises access and roles because the analytics service integrates with Azure Active Directory. There are no clusters, virtual machines, or servers to manage, maintain, or fine-tune—the utility of a highly adaptable, Azure Blob Storage-based information lake that is also secure. Azure Data Lake Analytics' simple interface makes it a reliable and easy-to-use program for beginners. SQL benefits are combined with user code flexibility through the inclusion of U-SQL. Scalable distributed runtime for U-SQL allows us to analyze data across SQL Servers in Azure (SQL database and data warehouse) in a streamlined manner.

Pros

  • It combines big data.
  • Monitors and alerts are helpful.
  • Report visualization relies on analytics.
  • It is compatible with Power BI services for report generation.

Cons

  • The data pipeline is managed and monitored inefficiently.
  • Streaming and event processing workloads are lacking.
  • It's memory-intensive but useful for networking data and cloud storage.

Most Important Features

  • Easy to store for in-depth data analysis.
  • User-friendly and straightforward rest API.
  • SQL and C# scripting combined to make it easy to use.

Return on Investment

  • It lets us manage and scan data, making our work easy and efficient.
  • It helped me manage real-time data, process it, and send it to reporting.
  • Data centralization or data warehousing projects are being implemented with its help.

Alternatives Considered

Databricks Lakehouse Platform (Unified Analytics Platform)

Other Software Used

Splunk Enterprise, Microsoft Power BI, Google BigQuery, Confluent Platform

Playing with data have never been easy with Azure Data Lake Analytic

Use Cases and Deployment Scope

We have Azure Storage Blobs, which is traditionally one of the many ways that we would and probably more often and not the default way where we would store data. We make our data workforce by putting in Azure Storage Blobs We use the Azure SQL Database, a traditional SQL-based database. Microsoft makes that available to us in the Azure platform and we can host our data there. We also have a SQL Database, running an Azure on a Virtual Machine, if we don't want to use the Azure base SQL DB directly.

Pros

  • Allows us to take in data, unstructured or structured
  • Good documentation
  • SaaS

Cons

  • AWS Glue could be more effective.
  • There is no 24/7 support.
  • Documentation is not available online.

Most Important Features

  • Easy usage
  • Effective data storage
  • Report

Return on Investment

  • Of course price
  • Documentation is not available online.
  • Slow progress.

Alternatives Considered

Apache Spark

Other Software Used

a3doc cloud, Accountancy Cloud, Accio Data

Fast and scalable azure data lake analytics!

Use Cases and Deployment Scope

My primary use case in using and investigating Azure Data Lake Analytics was in comparing how it fulfilled aggregate build in our data lake environment compared to how Databricks solved for our initial use cases. At the time, in building out a raw, refined, and curated zone before landing data in a warehouse multiple bidirectional transformation processes run between the Refined to Curated and then ultimately Warehouse layer. Key was scale, cost, and performance as compared to what can be done in processing aggregates via Databricks and opposite that ELT to a warehouse like Snowflake instead of load from lake to Microsoft Synapse.

Pros

  • Process large data transformation jobs using pretty much any language needed.
  • Native integration with Azure storage.
  • Top notch security that fulfills all audit needs.
  • Easy to consolidate enterprise data under one location - Single source of truth.

Cons

  • Learning curve and professional services were the only reason why we got up and running quickly - Not a downside but a need to know.

Most Important Features

  • Uniqueness to run on a per job basis
  • Security and support services (professional services) are the best in the industry.

Return on Investment

  • Has allowed us to reduce compute expenses by enabling better synchronization of workloads and user usage.
  • Ease of data virtualization or rather connection of data sources from multiple locations.

Alternatives Considered

Databricks Lakehouse Platform (Unified Analytics Platform) and Denodo

Other Software Used

Databricks Lakehouse Platform (Unified Analytics Platform), Confluent Platform, Azure Bot Service (Microsoft Bot Framework), Azure Blob Storage, Pypestream, Kore.ai

Good choice regarding features

Use Cases and Deployment Scope

We have been using Azure Data Lake Analytics as one of our data lakes, we are collecting data from many different sources, storing it on the data lake, and processing this data. As result, we have Business Intelligence tools connected to this result which we use to present some KPIs.

Pros

  • Easy usage
  • Interface
  • Connectivity

Cons

  • Sometimes requires previous experience in cloud.

Most Important Features

  • Connectivity
  • All tools centralized

Return on Investment

  • Since we have implemented this solution, we have been more able to follow what is going on in our process and sells.
  • We are also sparing some money by comparing the costs now against the costs we had on-premise.

Alternatives Considered

Alibaba Cloud Data Lake Analytics

Other Software Used

AWS Backup, AWS CloudFormation, Amazon Kinesis, Amazon Athena, Amazon QuickSight, Amazon Elasticsearch Service, Matillion, Amazon DynamoDB, Amazon EMR (Elastic MapReduce), AWS Lambda

Value for Volume

Use Cases and Deployment Scope

Used Azure Data Lake Analytics while working for a CPG major to store/process/analyze large volumes of data (daily cadence). Used Python as a programming language for processing the stored data. Also, with fluctuating data volume across weekdays/weekends, ADL analytics was helpful in processing data on demand, and scale instantly, thereby enabling us to pay for the services used/rendered.

Pros

  • Effective and efficient data storage
  • pretty fast querying ability
  • Incredibly scalable (need based usage and billing)

Cons

  • There's a bit of bias towards cloud with ADL Analytics. Depending upon a company's infra strategy and investment plans, there are some challenges with migration and integeration.
  • Not worth the time/effort/money if the organization doesn't have "Volume" of data. Cost effective only when daily loads exceed around 1million.
  • While training materials are available online, Adoption rate - Yet to pick up.

Most Important Features

  • Ability to store data in its native format (Unstructured, semi-structured, images, online reviews)
  • Scalable and flexible - according to data loads
  • Cheaper storage option

Return on Investment

  • Yet to realize its full potential - Owing to skill shortage in the org
  • Adoption across organization a challenge

Alternatives Considered

Azure Blob Storage, Azure Data Factory and Azure Data Lake Storage

Other Software Used

Azure Data Factory, Microsoft Power BI, Databricks Lakehouse Platform (Unified Analytics Platform), SAS Enterprise Guide