Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.
N/A
Posit
Score 10.0 out of 10
N/A
Posit, formerly RStudio, is a modular data science platform, combining open source and commercial products.
N/A
Pricing
Cloudera Data Science Workbench
Posit
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Data Science Workbench
Posit
Free Trial
No
Yes
Free/Freemium Version
No
Yes
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
Optional
Additional Details
—
—
More Pricing Information
Community Pulse
Cloudera Data Science Workbench
Posit
Features
Cloudera Data Science Workbench
Posit
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
Cloudera Data Science Workbench
7.5
Ratings
11% below category average
Posit
9.3
Ratings
11% above category average
Connect to Multiple Data Sources
7.00 Ratings
8.00 Ratings
Extend Existing Data Sources
8.00 Ratings
10.00 Ratings
Automatic Data Format Detection
7.00 Ratings
10.00 Ratings
MDM Integration
8.00 Ratings
00 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
Cloudera Data Science Workbench
7.6
Ratings
10% below category average
Posit
9.0
Ratings
7% above category average
Visualization
7.10 Ratings
8.00 Ratings
Interactive Data Analysis
8.00 Ratings
10.00 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
Cloudera Data Science Workbench
7.8
Ratings
4% below category average
Posit
10.0
Ratings
20% above category average
Interactive Data Cleaning and Enrichment
7.00 Ratings
10.00 Ratings
Data Transformations
8.00 Ratings
10.00 Ratings
Data Encryption
8.00 Ratings
00 Ratings
Built-in Processors
8.00 Ratings
00 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
Cloudera Data Science Workbench
7.6
Ratings
10% below category average
Posit
10.0
Ratings
18% above category average
Multiple Model Development Languages and Tools
8.00 Ratings
10.00 Ratings
Automated Machine Learning
7.00 Ratings
00 Ratings
Single platform for multiple model development
7.10 Ratings
10.00 Ratings
Self-Service Model Delivery
8.10 Ratings
10.00 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
In my humble opinion, if you are working on something related to Statistics, RStudio is your go-to tool. But if you are looking for something in Machine Learning, look out for Python. The beauty is that there are packages now by which you can write Python/SQL in R. Cross-platform functionality like such makes RStudio way ahead of its competition. A couple of chinks in RStudio armor are very small and can be considered as nagging just for the sake of argument. Other than completely based on programming language, I couldn't find significant drawbacks to using RStudio. It is one of the best free software available in the market at present.
Ability to scale across the company is limited based on the users license, cannot share a dashboard to the general view of the company.
Ability to retain session - not simple method to customize view per user (e.g., once session is ended, the users will return next time to the baseline view).
Ability to enable communication between multiple users - leave notes, tag other users, or share specific view.
There is no other platform that meets our needs. Even if it was terrible we would still use it but fortunately for us it is a very solid project with a great support team. I hope in the future to expand our use and get more licences as well as upgrade to RStudio workbench but for now we are very happy.
For someone who learns how to use the software and picks up on the "language" of R, it's very easy to use. For beginners, it can be hard and might require a course, as well as the appropriate statistical training to understand what packages to use and when
RStudio is very available and cheap to use. It needs to be updated every once in a while, but the updates tend to be quick and they do not hinder my ability to make progress. I have not experienced any RStudio outages, and I have used the application quite a bit for a variety of statistical analyses
Since R is trendy among statisticians, you can find lots of help from the data science/ stats communities. If you need help with anything related to RStudio or R, google it or search on StackOverflow, you might easily find the solution that you are looking for.
Since our organization had already implemented Cloudera Data Platform as our Big Data Warehouse platform, implementing CDSW as the go-to Analytic and Data Science Platform is the most logical and cost-effective decision to make. It integrates seamlessly with our CDH clusters and it also provides enterprise-grade security for on-premise implementation.
RStudio was provided as the most customizable. It was also strictly the most feature-rich as far as enabling our organization to script, run, and make use of R open-source packages in our data analysis workstreams. It also provided some support for python, which was useful when we had R heavy code with some python threaded in. Overall we picked Rstudio for the features it provided for our data analysis needs and the ability to interface with our existing resources.
I think that RStudio scales pretty well based on the size of the datasets I'm using. It has multithreading capabilities unlike some other statistical analysis programs which is very useful in cutting down on time. The format of RStudio's syntax also makes it very easy to replicate regardless off the scale of the analysis and data set
Using it for data science in a very big and old company, the most positive impact, from my point of view, has been the ability of spreading data culture across the group. Shortening the path from data to value.
Still it's hard to quantify economic benefits, we are struggling and it's a great point of attention, since splitting out the contribution of the single aspects of a project (and getting the RStudio pie) is complicated.
What is sure is that, in the long run, RStudio is boosting productivity and making the process in which is embedded more efficient (cost reduction).