Paxata - an excellent tool to treat text
Use Cases and Deployment Scope
Paxata is used in our business to accelerate the business process of cleaning, enrichment and preparation of data to be fed into BI dashboards to drive insights business decisions. It is being used by multiple verticals in the analytics as well as the risk practice right now to service clients.
Pros
- Visualize distributions in large data sets effectively which enable the user to quickly spot outliers and treat them appropriately
- Provides recommendation to merge datasets based on matching column values
- The cluster and edit feature in my opinion is its most powerful feature and reduces cardinality in column with text
Cons
- Doesn't provide recommendation on how to impute values
- There is a lag quite often
- We can say whether a column has errors or quality issues in the first look
Likelihood to Recommend
Paxata can be highly useful to someone who doesn't like/have any experience with writing codes to treat data before using it as input into BI dashboards. Paxata can accelerate data cleaning in environments where a large amount of unclean data is generated and business decisions on the go are required. It performs really well while dealing with natural language.
