Table 2. Current and future data analysis needs of National Science Foundation (NSF) Biological Sciences Directorate (BIO) principal investigators (PIs) by the NSF BIO division.
Current needs | Needs in 3 years | |||||||
---|---|---|---|---|---|---|---|---|
Environ-mental biology 163 ≤ n ≤ 168 | Molecular and cellular biosciences 85 ≤ n ≤ 90 | Biological infra-structure 116 ≤ n ≤ 118 | Integrative organismal systems 159 ≤ n ≤ 161 | Environ-mental biology 124 ≤ n ≤ 162 | Molecular and cellular biosciences 59 ≤ n ≤ 85 | Biological infra-structure 85 ≤ n ≤ 117 | Integrative organismal systems 108 ≤ n ≤ 157 | |
Publish data to the community | 0.95 | 0.92 | 0.93 | 0.87 | 0.99 | 0.98 | 0.96 | 0.98 |
Sufficient data storage | 0.93 | 0.91 | 0.90 | 0.94 | 0.99 | 0.95 | 0.97 | 0.98 |
Share data with colleagues | 0.95 | 0.90 | 0.91 | 0.88 | 0.98 | 0.99 | 0.96 | 0.97 |
Updated analysis software | 0.92 | 0.88 | 0.91 | 0.91 | 0.95 | 0.93 | 0.95 | 0.99 |
Training on data management and metadata | 0.87 | 0.71 | 0.81 | 0.74 | 0.94 | 0.91 | 0.95 | 0.92 |
Support for bioinformatics and analysis | 0.80 | 0.83 | 0.72 | 0.76 | 0.89 | 0.90 | 0.88 | 0.87 |
Training on basic computing and scripting | 0.83 | 0.77 | 0.65 | 0.72 | 0.94 | 0.89 | 0.85 | 0.92 |
Search for data and discover relevant data sets | 0.75 | 0.77 | 0.77 | 0.71 | 0.93 | 0.93 | 0.91 | 0.88 |
Multistep analysis workflows or pipelines | 0.81 | 0.74 | 0.68 | 0.69 | 0.93 | 0.88 | 0.92 | 0.86 |
High performance computing (HPC) | 0.77 | 0.68 | 0.64 | 0.63 | 0.91 | 0.90 | 0.85 | 0.83 |
Training on integration of multiple data types | 0.69 | 0.57 | 0.62 | 0.65 | 0.91 | 0.93 | 0.89 | 0.91 |
Cloud computing | 0.56 | 0.41 | 0.50 | 0.46 | 0.87 | 0.85 | 0.84 | 0.87 |
Training on scaling analysis to cloud/HPC | 0.55 | 0.46 | 0.50 | 0.41 | 0.86 | 0.78 | 0.79 | 0.80 |
Percent responding affirmatively. Bold text indicates a statistically significant chi-square result between BIO divisions.