Table 1.
Number of samples | Sample Size | Computation time | ||
---|---|---|---|---|
FASTQ to BAM | BAM to VCF | Total (Hour) | ||
1 | 1.0GB | 0:45 | 2:30 | 3:15 |
1 | 1.5GB | 1:05 | 3:30 | 4:35 |
1 | 2.8GB | 2:03 | 6:50 | 8:53 |
103 | 165GB (1.5 GB ea.) | 1:05 | 3:30 | 5:00* |
OS: CentOS6.5, CPU: Intel Xeon 2Socket E5520 2.3 GHz 4Core × 200Node (total = 1,600 Cores), Disk Drive: MAHA distributed parallel file system for diagnosis (MAHA-FsDx: 1.4 PetaByte). *The actual time required for the analysis of 103 samples on 200 nodes of a MAHA-FsDx, parallel computer was 5 hours, instead of 4 hours 35 minutes.