Table 5. Frequency of top ten lemmatized words used in keywords, titles and abstract text from the ALSPAC publications.
The full list of words as output by the pipeline is available in the output data (see data availability). The numbers in parentheses are the count.
| Keywords | Title | Abstract |
|---|---|---|
| study (1513) | study (357) | child (2517) |
| child (1284) | child (291) | age (2034) |
| human (1257) | cohort (259) | association (1905) |
| female (1050) | childhood (220) | associated (1696) |
| male (859) | association (220) | study (1675) |
| factor (720) | birth (146) | year (1553) |
| infant (568) | age (129) | risk (1142) |
| longitudinal (562) | risk (128) | maternal (1120) |
| pregnancy (470) | maternal (122) | ci (915) |
| adolescent (470) | associated (117) | cohort (904) |