The DataRobot AI Platform is presented as a solution that accelerates and democratizes data science by automating the end-to-end journey from data to value and allows users to deploy AI applications at scale. DataRobot provides a centrally governed platform that gives users AI to drive business outcomes, that is available on the user's cloud platform-of-choice, on-premise, or as a fully-managed service. The solutions include tools providing data preparation enabling users to explore and…
N/A
H2O.ai
Score 6.5 out of 10
N/A
An open-source end-to-end GenAI platform for air-gapped, on-premises or cloud VPC deployments. Users can Query and summarize documents or just chat with local private GPT LLMs using h2oGPT, an Apache V2 open-source project. And the commercially available Enterprise h2oGPTe provides information retrieval on internal data, privately hosts LLMs, and secures data.
DataRobot can be used for risk assessment, such as predicting the likelihood of loan default. It can handle both classification and regression tasks effectively. It relies on historical data for model training. If you have limited historical data or the data quality is poor, it may not be the best choice as it requires a sufficient amount of high-quality data for accurate model building.
Use H2O.ai whenever you need easy to use tool, when you must be cost efficient (you can not charge the client extra money for software licenses used), need a tool with lots of algorithms that are normally used in data analytics, or need to work on one machine (it is either not allowed to move data to cloud storage or simply not necessary to connect to Hadoop, etc.). Also, you can call H2O directly from Python which makes analysis more efficient.
Further improvements to their text analysis tool, to be more like the Qualtrics text analysis tool, would be a great addition. Qualtrics has templates built into their text analysis tool for customer service, quality control, etc, and will automatically slot your text responses into categories associated with certain sub areas of those larger categories.
This is not really a drawback, but rather a warning - the Drivereless AI is not a replacement for a data scientist yet, and will not replace data scientists in the next decade neither. The Driverless AI feature delivers reliable results only if the analyst is sure about the meaning of input data. The data quality is usually a major issue and no tool can detect the meaning of data in the input. Data scientists are also required for business interpretation of the findings. So be careful, and do not rely on this feature without a good understanding of what it really does in each step.
DataRobot presents a machine-learning platform designed by data scientists from an array of backgrounds, to construct and develop precise predictive modeling in a fraction of the time previously taken. The tech invloved addresses the critical shortage of data scientists by changing the speed and economics of predictive analytics. DataRobot utilizes parallel processing to evaluate models in R, Python, Spark MLlib, H2O and other open source databases. It searches for possible permutations and algorithms, features, transformation, processes, steps and tuning to yield the best models for the dataset and predictive goal.
As I am writing this report I am participating with Datarobot Engineers in an complex environment and we have their whole support. We are in Mexico and is not common to have this commitment from companies without expensive contract services. Installing is on premise and the client does not want us to take control and they, the client, is also limited because of internal IT regulations ,,, soo we are just doing magic and everybody is committed.
I've done machine learning through python before, however having to code and test each model individually was very time consuming and required a lot of expertise. The data Robot approach, is an excellent way of getting to a well placed starting point. You can then pick up the model from there and fine tune further if you need.
I have used Knime, RapidMiner, and Weka before I heard about H2O, but amongst all I really liked H2O. However, nowadays Googles AutoML and AWS SageMaker AutoML platform are really competitive, but more costly than H2O.
Positive impact: saving in infrastructure expenses - compared to other bulky tools this costs a fraction
Positive impact: ability to get quick fixes from H2O when problems arise - compared to waiting for several months/years for new releases from other vendors
Positive impact: Access to H2O core team and able to get features that are needed for our business quickly added to the core H2O product