Apache Cassandra vs. Riak

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Cassandra
Score 9.0 out of 10
N/A
Cassandra is a no-SQL database from Apache.N/A
Riak
Score 10.0 out of 10
N/A
Riak is a NoSQL database from Basho Technologies in Bellevue, Washington.N/A
Pricing
Apache CassandraRiak
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
CassandraRiak
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Apache CassandraRiak
Features
Apache CassandraRiak
NoSQL Databases
Comparison of NoSQL Databases features of Product A and Product B
Apache Cassandra
8.0
Ratings
10% below category average
Riak
9.4
Ratings
6% above category average
Performance8.50 Ratings10.00 Ratings
Availability8.80 Ratings10.00 Ratings
Concurrency7.60 Ratings10.00 Ratings
Security8.00 Ratings6.00 Ratings
Scalability9.50 Ratings10.00 Ratings
Data model flexibility6.70 Ratings10.00 Ratings
Deployment model flexibility7.00 Ratings10.00 Ratings
User Ratings
Apache CassandraRiak
Likelihood to Recommend
6.0
(0 ratings)
10.0
(0 ratings)
Likelihood to Renew
8.6
(0 ratings)
9.0
(0 ratings)
Usability
7.0
(0 ratings)
-
(0 ratings)
Support Rating
7.0
(0 ratings)
9.0
(0 ratings)
Implementation Rating
7.0
(0 ratings)
-
(0 ratings)
User Testimonials
Apache CassandraRiak
Likelihood to Recommend
Cassandra excels in a broad range of applications -- especially if you understand its data model and write your applications accordingly. It's an excellent choice for time-series data, and a poor choice for application queues. It performs the best if you can simply record history and compute from it, rather than going back and editing or deleting things a lot.
Read full review
Riak is very good if you need a resilient data store that can handle large amounts of documents very fast. If you have 1,000,000 documents and need to execute complex queries, it is great. Riak's SOLR engine is fast, however if you have extremely high amount of queries in a very limited time range, it can fail in a bad way.
Read full review
Pros
  • High Availability - we utilize the data replication features of Cassandra. This enables us to access our data even when several nodes have gone down
  • Data Locality - our architecture combines Cassandra storage nodes and computation nodes in the same machine. This enables us to utilize data locality and limit expensive network IO to read data.
  • Elasticity - Cassandra is a shared nothing architecture. Nodes can be added very easily and they discover the network topology. As soon as a node has joined the Cassandra ring, the data is redistributed among the existing nodes and streamed to it automatically.
Read full review
  • Reliability -- we rarely have to do anything to maintain our Riak instance. It is just online and available for whatever we throw at it.
  • The Riak Python client is an excellent tool and handles parallel writes/reads very well
  • There is a large and very receptive community or Riak users and developers who seem to be able to help with most technical questions that have arisen.
Read full review
Cons
  • No Ad-Hoc Queries: Cassandra data storage layer is basically a key-value storage system. This means that you must "model" your data around the queries you want to surface, rather than around the structure of the data itself.
  • There are no aggregations queries available in Cassandra.
  • Not fit for transactional data.
Read full review
  • Missing a free text search function
  • More security work
  • Multi-tenant reporting
  • More types of index optimised for different structures
  • Automating repairs especially after unclean shutdowns
  • WebDAV/Samba shares for Riak CS
  • Implementing the SQL queries from Riak TS in Riak KV
  • Settable replication bandwidth caps
  • Safemode start up after failure
  • More client integrations
Read full review
Likelihood to Renew
I would recommend Cassandra DB to those who know their use case very well, as well as know how they are going to store and retrieve data. If you need a guarantee in data storage and retrieval, and a DB that can be linearly grown by adding nodes across availability zones and regions, then this is the database you should choose.
Read full review
Riak works great for our use case but the fact that deletes seem to resurrect is a real issue for us. Unless we can get this solved, we'll continue to look at other products to see if our use case fits. Otherwise Riak is a great product and it fits our use case 95%. We have found work arounds to the remaining 5%.
Read full review
Usability
It’s great tool but it can be complicated when it comes administration and maintenance.
Read full review
No answers on this topic
Support Rating
Sometimes instead giving straight answer, we ‘re getting transfered to talk professional service.
Read full review
Despite Basho going bankrupt and the project becoming fully open-source, community support is reasonably good, albeit a little slow at times. Paid enterprise-grade support is also available from former Basho engineers but the same company also contributes to the community support for free for basic questions or specific knowledge areas.
Read full review
Alternatives Considered
Apache Cassandra has the best of both worlds, it is a Java based NoSQL, linearly scalable, best in class
tunable performance across different workloads, fault tolerant, distributed, masterless, time series database. We have used both Apache HBase and MongoDB for some use cases which were within hadoop setup and JSON (JavaScript Object Notation) document store respectively, but given the overall factors favoring Apache Cassandra, it is a technology choice for multiple platforms!
Read full review
MongoDB seems to have copied a lot of functionality from Riak. This may be because MongoDB hired a number of former Basho engineers when Basho went bankrupt. That said, the new functions added to Riak after it became open source have successfully differentiated itself from MongoDB.
Amazon S3 is a nice tool but when you are at significant scale with regionally specific data (joys of GDPR), it's much easier to keep it in house and Riak CS lets you do exactly that. All you need to do is point your application at Riak CS instead of Amazon S3 and it just works as if nothing has changed.
When we evaluated against Cassandra, we found the tools available did not match our needs at the time.
Read full review
Return on Investment
  • The open source version of Cassandra is only suggested for learning the basic concepts and play with its core features. Unless you really want to invest a lot in your developers and architects knowing every detail of Cassandra, I prefer the DataStax enterprise version. Although the license cost is relatively high, I think they it is worth it. I'm thinking about the support, the monitoring tool OpsCenter, and the integration of Solr and Spark (for data analysis).
  • Cassandra didn't fully replace our old and traditional relation database Oracle. In addition, it opens another door for us to deal with some special business use cases that NoSQL database can do better in a more feasible and efficient way.
Read full review
  • Riak's simple API and simple management model made it a no brainer when it came to adopting it as a technology for the team.
Read full review
ScreenShots