Skip to main content
. 2017 Dec 4;8:56. doi: 10.1186/s13326-017-0166-5

Table 7.

Matching Strategies in the participating systems - 1

System String-based strategies Structure-based strategies Constraint-based strategies Instance-based strategies
AgreementMaker SubString, Edit-Distance, TF-IDF
ALIN SimMetrics APIa, WS4J APIb - -
AML Jaccard, I-Sub
Anchor-Flood Jaro-Winkler -
AOAS Jaro-Winkler - -
AOT, AOTL Edit-Distance, Block-Distance,
SLIM-Winkler, Jaro-Winkler, - - -
Smith-Winkler, Needleman-Wunsch
AROMA Jaro-Winkler -
ASMOV Edit-Distance
BLOOMS Jaccard, Exact Match, Lin, - - -
Jaro-Winkler
CIDER-CL Soft TF-IDF, Jaro-Winkler - -
CODI Edit-Distance, Jaro-Winkler, Cosine,
Smith-Waterman, Jaccard,
Overlap coefficient
COMMAND UMBC similarity Model - -
CroMatcher N-Gram, TF-IDF
CSA Edit-Distance, Wu-Palmer, TF-IDF -
DKP-AOM, DKP-AOM-Lite SimMetrics APIa -
DSSim Jaccard, Jaro-Winkler - -
Eff2Match Exact Match, TF-IDF - -
Falcon-AO I-Sub, TF-IDF - -
FCA-Map Exact Match - -
GeRoMeSuite+SMB Edit-Distance, Jaro-Winkler, -
I-Sub, Soft TF-IDF,
SecondString Libraryc
GMap Edit-Distance, TF-IDF - -
GOMMA, GOMMA-bk Exact Match, N-gram -
Hertuda Damerau-Levenshteind - - -
HotMatch Damerau-Levenshteind
IAMA Edit-Distance - -

aSimMetrics API is a Java library that includes such string metrics as Jaccard, Jaro-Winkler and N-gram

bWS4J (WordNet Similarity for Java) is a Java API containing string metrics like Wu-Palmer, Jiang-Conrath and Lin

cSecondString library is a package containing string metrics such as Edit-Distance, Jaro, TF-IDF

dDamerau-Levenshtein is a variant of Edit-distance that adds adjacent symbols’ transpositions into the distance measures