Table 1.
Type | Advantages | Disadvantages |
---|---|---|
SharedStorage | No additional cost for read/write Fastest throughput for small clusters No management of remote data |
Total storage limited to Controller disk size Nonredundant storage Throughput limited on large clusters |
PersistentStorage | Persistent data Designed for extreme scalability |
Storage and access incur cost |
LocalStorage | Best data locality High I/O throughput |
Nonrobust to worker failure Increased complexity for developer |
No-Storage | Best parallel scaling No cost for data storage |
Data must be recomputed for analysis |