SI1_1-7: Molecular descriptor analysis. SI2_1-3: Pseudocode of algorithms implemented in the proposed workflow. SI3_1: Optimized descriptors for the construction of similarity networks SI4_1-4: Data and information on case studies.