Tavaxy
|
Personalized medicine and NGS (short DNA reads, DNA segments, phylogenetic and taxonomical analyze, EMBOSS, SAMtools, etc.) |
SCUFL, JSON, hierarchical workflow structure, asynchronous protocol and DAG style in workflow creation and execution |
Difficulty in combining bio-pipelines between Galaxy and Taverna’s workflows using SCUFL
Lack of sufficient interoperability
Does not support loops in workflow creation
Lack of opportunity of workflow sharing
|
Taverna2- Galaxy
|
Life Sciences (e.g. eukaryotic genome biology) |
SCUFL 2 (experimental), Semantics, RDF, OWL and DAG |
SCUFL 2 is still in Apache’s incubation
Does not support loops in workflow
Lack of opportunity in workflow sharing
|
Galaxy
|
NGS (QC and manipulation, Deep Tools, Mapping, RNA Analysis, SAMtools, BAM Tools, Picard, VCF Manipulation, Peak Calling, Variant Analysis, RNA Structure, Du Novo, Gemini, FASTA Manipulation, EMBOSS, etc.) |
Python, JavaScript, Shell script, OS: Linux and Mac OS X |
No proper interlinking mechanism in pipeline functionalities between dependent modules
Does not support loops in workflow creation
Does not support control-flow operations and remote services
No workflow language available rather than RDBMS
Adding new tools require advanced IT knowledge
|
KNIME
|
Pharma and healthcare (virtual high-throughput screening, chemical library enumeration, outlier detection in BioMed data and NGS analysis with KNIME Extension [107] |
Java/Eclipse, KNIME SDK and Spotfire (supports Python ad Perl scripts) |
JDBC mechanism to access the databases is slow
High latency time in requests and responses
Not scalable for large-scale data and heavy computation
No reproducibility of the computational results
|
Taverna
|
Domain-independent (bioinformatics, cheminformatics, gravitational wave analysis) |
WSDL, Java and DAG |
Not scalable for large-scale data and heavy computation
Slow response while creating large-scale workflow and submission, thereafter
No reproducibility of the computational results
|
Wings
|
Multi-omics analysis and cancer omics |
|
Not scalable for large-scale data and heavy computation
No data integration support
Lack of computational transparency
Lack of interoperability with other DWFS
|
Anduril
|
Cancer research and molecular biology, DNA, RNA and ChIP-seq, DNA and RNA microarrays, cytometry and image analysis |
Workflows are constructed using Scala, DAG notation, the AndurilScript, Developed in Java
OS: Windows, Linux, and Mac OS X
|
No data conversion support
Lack of interoperability with other DWFS
Cannot be configured on cloud infrastructure
Not suitable for workflows containing loops
|
Unipro UGENE
|
NGS: sequencing, annotationsMultiple alignments, phylogenetic trees, assemblies, RNA/ChIP-seq, raw NGS, local sequence alignment, protein sequencing, plasmid, variant calling, evolutionary biology and virology |
C ++, Qt, DAG style workflow creation and support
(Cross-platform software system)
|
Does not support loops in workflow creation
Data provenance cannot be ensured
Not scalable for large-scale data and heavy computation
Lack of computational transparency
No reproducibility of the computational results
|
Pipeline Pilot
|
NGS: gene expression and sequence data analysis, imaging, Pharma: drug–chemical material analysis, cheminformatics, ADMET, polymer properties synthesis, data modeling |
|
No control flow operation
Not scalable for large-scale data and heavy computation
Limited data provenance support
No reproducibility of the computational results
|