Erratum
The original version of this article [1] unfortunately contained a publisher error in Fig. 4. The figure was incorrectly captured as a duplicate of Fig. 5. The correct Fig. 4 has been published in this Erratum. See Fig. 1.
Fig. 1.

The ratio of the FHadoop/FHPC as a function of the reciprocal dataset size in Gb. The pipelines were run on the Hadoop I and II clusters, as well as a 16 core HPC node. The analytical curve f(x) = (a1x + b1)/(a2x + b2) was used to fit the data for the stretches of linear scaling of calculation time on the HPC platform. The outliers are marked with crossed symbols
Footnotes
The online version of the original article can be found under doi:10.1186/s13742-015-0058-5.
Reference
- 1.Siretskiy A, Sundqvist T, Voznesenskiy M, Spjuth O. A quantitative assessment of the Hadoop framework for analyzing massively parallel DNA sequencing data. GigaScience. 2015;4:26. doi: 10.1186/s13742-015-0058-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
