TABLE IX.
Platform | Advantages | Limitations |
---|---|---|
Apache Hadoop (MapReduce)* [11, 142] |
Horizontally scalable; fault- tolerant; designed to be deployed on commodity-grade hardware; free and open-source |
Generally most effective for batch-mode processing; not always appropriate for real- time, online analytics |
IBM InfoSphere Platform* [143] |
Includes purpose-built tools to handle streaming information; integrates with open-source tools such as Hadoop |
Commercial licensing |
Apache Spark Streaming* [144] |
Integrates with the Hadoop stack; allows one code base for both batch-mode and online analysis |
Depends on more expensive hardware with large amounts of RAM to work efficiently |
Tableau, QlikView, TIBCO Spotfire, and other visual analytics tools* |
Visualization of large and complex data sets |
Generally incomplete solutions, requiring other tools to effectively handle data storage |
Highly impactful platform.