Skip to main content
. 2019 Apr 17;127(4):047005. doi: 10.1289/EHP4200

Figure 2.

Figure 2 is a flowchart showing the data preprocessing in the AOP help Finder tool.

Data preprocessing in the AOP-helpFinder tool. Before identifying and scoring associations between a chemical and an adverse outcome pathway (AOP) event, text information collected from multiple sources (publications, databases) was preprocessed in order to obtain stemmed data, which were stored in a relational in-house database. Negation words (never, neither, no, not, did not, hasn’t, should not, …) were identified with squares surrounded with small dots, stop words (coordinating conjunctions, punctuations, most common words such as “the”) with squares surrounded with large dots, and “words to stem” with solid lines (e.g., files file or finder find, considering that a word has a single stem, namely the part of the word that is common to all its inflected variants).