Skip to main content
. 2023 Jan 26;14:1098439. doi: 10.3389/fgene.2023.1098439

FIGURE 1.

FIGURE 1

Classic PRSs and ML-driven PRSs the polygenic risk score for a target individual and phenotype of interest (y) is based on the individual's genetic data (x g ) but can also include other data types (x e ). The score is calculated using a linear regression (with weights β) or a machine learning model f θ (e.g. a neural network with parameters θ). The parameters (β, θ) are learned using a separate training cohort. Note, however, that while the linear regession cofficients β are often publicly available or can be derived from published summary statistics, to train the neural network f θ it is necessary to have access to individual level data in the training cohort.