Skip to main content
. Author manuscript; available in PMC: 2012 Nov 1.
Published in final edited form as: Nat Methods. 2012 Apr 27;9(5):459–462. doi: 10.1038/nmeth.1974

Table 1.

File Formats Used in the 1000 Genomes Project

File Format Description Further information/Citation
SRF Container format for data from sequencing machines based on ZTR http://srf.sourceforge.net/
FASTQ Text based format for sequence and quality values http://en.wikipedia.org/wiki/FASTQ_format
SAM/BAM Sequence Alignment and Map format. A compact Alignment format for placement of short read data with respect to a reference genome. The consortium designed this file format. http://samtools.sourceforge.net/Reference7
VCF Variant Call Format a column based text format for storing variant calls and individual genotypes for those calls. This file format was designed by the consortium. http://vcftools.sourceforge.net/Reference9