Skip to main content
. 2014 Jun 16;9(6):e100195. doi: 10.1371/journal.pone.0100195

Table 1. Temporal performances (time required for computation) for each step of a typical data flow.

Dataset #1∶105 initial VMS pings Dataset #2∶106 initial VMS pings
General Step of the Analysis Tool Time required tocomplete the processing(minutes) Main Statistics Time required tocomplete theprocessing (minutes) Main Statistics
VMS Data Management Edit RawData <1 0 NAs found in latitude degrees;0 NAs found in latitude minutes;0 NAs found in latitude seconds;0 NAs found in latitude direction;0 Latitudes out of range (−90/90);0 NAs found in longitude degrees;0 NAs found in longitude minutes;0 NAs found in longitude seconds;0 NAs found in longitude direction;0 Longitudes out of range (−180/180);0 NAs found in dates; 0 NAs found inhours; 0 NAs found in minutes; 0 NAsfound in seconds; 0 dates found withbad format; 0 NAs found in knotsspeed; 0 NAs found in degrees heading <1 0 NAs found in latitude degrees; 0 NAs found inlatitude minutes; 0 NAs found in latitude seconds;0 NAs found in latitude direction; 0 Latitudes outof range (−90/90); 0 NAs found in longitude degrees;0 NAs found in longitude minutes; 0 NAs found inlongitude seconds; 0 NAs found in longitude direction;0 Longitudes out of range (−180/180); 0 NAs found indates; 0 NAs found in hours; 0 NAs found in minutes;0 NAs found in seconds; 0 dates found with badformat; 0 NAs found in knots speed; 0 NAs found in degrees heading
CreateDatabase <1 / <1 /
Load DBin theVMS DataViewer <1 / <1 /
Clean DBData 20 Found 4584 (4.58% of total) duplicatedpings; Found 31019 (31.02% of total)pings in harbour; Found 7550 (7.55%of total) pings on land; Found 38(0.04% of total) not coherent pings 150 Found 98381 (9.84% of total) duplicated pings;Found 272996 (27.31% of total) pings in harbour;Found 124015 (12.41% of total) pings on land;Found 279 (0.03% of total) not coherent pings
Track Cutting 5 3883 tracks detected 40 36994 tracks detected
Interpolation(10 minutesfrequency) 20 Number of pings changed increasesfrom 105 (real) to 4.9*105 (interpolated) 180 Number of pings changed increases from 106(real) to 4.4*106 (interpolated)
Assign Bathymetry(1 degree resolution) 150 (Fast& Heavy Algorithm)/60(Slow & Light) / 1000 (Fast& HeavyAlgorithm)/400(Slow & Light)$ /
Assign Area(MediterraneanGSAs) 100 / 900 /
Subtotal 295 (∼5 hours) 2237 2270 (∼37 hours)
Logbook Data Management (2×105 initial records) Tool Time required to complete the processing (minutes) Main Statistics
Edit Raw Data <1 20 NAs found in Start Times; 140 NAs found in End Times;0 NAs found in Species; 0 NAs found in Quantity;Removed 0.08% of data, that is 160 logbooks
Create Database 180 This step requires only few minutes for the EFLALO format
Métier Discovery(searching between2–30 groups on thewhole dataset with100 samples of 1000records each) 20 The best partitioning corresponded to 11 métiers. These are alsoavailable into to the package as reference dataset for Métier Classification
Métier editing Depends by the user, reasonablyno more than half an hour 69397 records in the database; 387 species
Métier Classification 20 /
Subtotal ∼250 (∼ 4 hours)
VMS-Logbook Analysis Tool Time required to complete the processing (minutes) Main Statistics Time required to complete the processing (minutes) Main Statistics
Logbook- VMSMatching 5 60.3% of VMS tracks with matching in LB database 30 57.2% of VMS tracks with matching in LB database
Predict Métier: DataPreparation 10 / 50 /
Predict Métier:Training ANN 3 Prediction completed for all the tracks without métiers by LB database 3 Prediction completed for all the trackswithout métiers by LB database
Find Fishing Points 30 / 45 /
Subtotal ∼50 (∼1 hours) ∼128 (∼2 hours)
Data Output Tool Time required to complete the processing (minutes) Main Statistics Time required to complete the processing (minutes) Main Statistics
Gridding & Mapping(for each métier) 2 / 9 /
DCF Indicators(for each métier) <1 / <1 /
Trawled Area Viewer Untested / / /
Subtotal ∼3 ∼10
Complete data flow Total ∼600 minutes (∼10 hours) ∼3180 minutes (∼53 hours/2.5 days)

These data was measured on three sample datasets (two for VMS and one for Logbook).

$

The Fast & Heavy Algorithm was tested on a different personal computer with 16 Gb of Ram.