IBM Infosphere: Incremental change log replication to cloud in real time using Apache Hadoop
Use Cases and Deployment Scope
IBM InfoSphere Data Replication uses Incremental Data Delivery technology to replicate by reading the native database logs and detecting when a change happens. This is an improved performance alternative to user queries or triggers to see changes and upload the data.IBM InfoSphere Data Replication uploads the data to Apache Hadoop to store the information in clusters. Updates the cloud information in real-time.
Pros
- Performance to detect changes using Incremental Data Delivery with logs.
- Update data in the cloud in real time.
- Saves data in clusters using Apache Hadoop
Cons
- Better documentation and examples of Console and API.
- Integration with machine learning to analyze the data.
- Examples of how to replicate data from multiple sources and databases to one data lake.
Likelihood to Recommend
IBM InfoSphere Data Replication replication allows work on the daily operation and analyzes the replicated data in real-time with no downgrade of performance. This happens thanks to the incremental data load using the log files instead of querying the database for any changes. It serves as an online backup of the database uploading the data to the cloud in real-time.