Skip to main content
. 2014 Apr 4;15:260. doi: 10.1186/1471-2164-15-260

Figure 2.

Figure 2

Computational pipeline for data mining of short read transcriptome/genome data. Details about the mined plants and algae are provided in Table 1 and 2. SRA: sequence read archive of the NCBI; ESTs: expressed sequence tags; fasty and tfasty are two homology search commands of the FASTA package [52] (see Methods); hmmsearch is a command of the HMMER3 package [47]. The two Pfam domains include Pfam models Cellulose_synt and Glycos_transf_2. PUTs are PlantGDB-assembled unique transcripts; Q means to use as the query set in the homology search; DB means to use as the database; published Csl protein homologs are from [27].