Abbreviations after the underscore mean respectively: “H”—heavy chain, “L”- light chain, “all”—all sequences, “human”—only human antibody sequences, “nonhuman”—only non human antibody sequences, “cst”—clinical stage therapeutics, “market”—therapeutics on the market. A. Liability distribution per dataset without applying any flags. B. Liability distribution per dataset after applying all three flags at once. C. Liability distribution per dataset after applying the ‘buried’ flag (Note that only the rightmost datasets with paired heavy/light chains are affected). D. Liability distribution per dataset after applying the ‘germline’ flag. E. Liability distribution per dataset after applying the ‘therapeutic’ flag.