TrustRadius: an HG Insights company

Apache Flume Reviews & Insights

Score7.1 out of 10

9 Reviews and Ratings

Community insights

TrustRadius Insights for Apache Flume are summaries of user sentiment data from TrustRadius reviews and, when necessary, third party data sources.

Pros

Easy Interpretation of Log Data: Users have found Apache Flume to be very easy to interpret log data in near real-time. Several reviewers have mentioned that the user-friendliness and ease of use make it a convenient tool for analyzing logs efficiently.

Support for Multiple Data Sources: The ability of Apache Flume to support data collection from a variety of data sources is highly appreciated by users. Many reviewers have praised its flexibility and integration with other open-source tools, allowing them to collect large volumes of data from multiple applications and systems effortlessly.

Scalability and Reliability: The scalability, reliability, and fault tolerance of Apache Flume are highly valued by users. Numerous reviewers have highlighted its capability to handle large amounts of streaming data, ensuring smooth operations even under heavy loads.

Reviews

2 Reviews

Apache Flume for log aggregation and compliance monitoring in real-time

Rating: 8 out of 10
Incentivized

Use Cases and Deployment Scope

Apache Flume is used for aggregating and analyzing log data in near-real-time across the organization for compliance purposes with a goal to generate monthly compliance reports based on log data.

Pros

  • Apache Flume being a log-centric system, it is able to parse and aggregate log data very well.
  • It is easy to customize it for different source (producers) for log data ingestion as well as for sinks (consumers).

Cons

  • It is very specific for log data ingestion so it is pretty hard to use for anything else besides log data
  • Data replication is not built in and needs to be added on top of Apache Flume (not a hard job to do though)

Likelihood to Recommend

Apache Flume is well suited when the use case is log data ingestion and aggregate only, for example for compliance of configuration management. It is not well suited where you need a general-purpose real-time data ingestion pipeline that can receive log data and other forms of data streams (eg IoT, messages).
Vetted Review
Apache Flume
3 years of experience

Apache Flume, the way your information flows

Rating: 7 out of 10
Incentivized

Use Cases and Deployment Scope

Apache Flume is a key software piece in BigData environments, we have used it along with CDC (Change Data Capture) to ingest near real time database changes into Kafka so the data is available for realtime analysis, machine learning, dynamic dashboards and so on.
We have successfully integrated also Apache Flume in log acquisition solutions (mainly PaaS and Docker) where application log is difficult access.

Pros

  • Multiple sources of data (sources) and destinations (sinks) that allows you to move data form and to any relevant data storage
  • It is very easy to setup and run
  • Very open to personalization, you can create filters, enrichment, new sources and destinations

Cons

  • Apache Flume develops new functionality at a slower pace than other OpenSource projects, it is well behing Kafka and has some compatibiliy issues with latest releases
  • It lack HA or FT, it relies on third party management software like Hortonworks or Cloudera

Likelihood to Recommend

Apache Flume is well suited in small batch and near real time processing projects, taking data from one point to another with local processing (I mean not external enrichment). Filtering, transforming and multiple push destinations are common grounds for Flume.
It is not so nice to use if your data needs external enrichment (taking data from external databases or web services), as transactions and (micro)batches may lead to reprocessing and it relies upon the application to avoid duplicates.