Skip to main content
. Author manuscript; available in PMC: 2018 Mar 20.
Published in final edited form as: IEEE Trans Biomed Eng. 2016 Oct 10;64(2):263–273. doi: 10.1109/TBME.2016.2573285

TABLE IX.

Selected Platforms for Big Data Analytics

Platform Advantages Limitations
Apache Hadoop
(MapReduce)* [11,
142]
Horizontally scalable; fault-
tolerant; designed to be deployed
on commodity-grade hardware;
free and open-source
Generally most effective for
batch-mode processing; not
always appropriate for real-
time, online analytics
IBM InfoSphere
Platform* [143]
Includes purpose-built tools to
handle streaming information;
integrates with open-source tools
such as Hadoop
Commercial licensing
Apache Spark
Streaming* [144]
Integrates with the Hadoop stack;
allows one code base for both
batch-mode and online analysis
Depends on more expensive
hardware with large amounts
of RAM to work efficiently
Tableau, QlikView,
TIBCO Spotfire,
and other visual
analytics tools*
Visualization of large and
complex data sets
Generally incomplete
solutions, requiring other
tools to effectively handle
data storage
*

Highly impactful platform.