Skip to main content
. 2017 Aug 29;7:9810. doi: 10.1038/s41598-017-09247-5

Table 1.

Summary of specifications for the computation time for 103 DNA samples versus one DNA sample.

Number of samples Sample Size Computation time
FASTQ to BAM BAM to VCF Total (Hour)
1 1.0GB 0:45 2:30 3:15
1 1.5GB 1:05 3:30 4:35
1 2.8GB 2:03 6:50 8:53
103 165GB (1.5 GB ea.) 1:05 3:30 5:00*

OS: CentOS6.5, CPU: Intel Xeon 2Socket E5520 2.3 GHz 4Core × 200Node (total = 1,600 Cores), Disk Drive: MAHA distributed parallel file system for diagnosis (MAHA-FsDx: 1.4 PetaByte). *The actual time required for the analysis of 103 samples on 200 nodes of a MAHA-FsDx, parallel computer was 5 hours, instead of 4 hours 35 minutes.