Skip to main content
. 2022 Dec 15;14:141. doi: 10.1186/s13073-022-01142-7

Fig. 1.

Fig. 1

Summary of the key steps within this penetrance estimation approach. Legend: Step 1: Variant frequencies (M) and weighting factors (W) are defined for a valid subset of the familial (F), sporadic (S), unaffected (U), and affected (A) states (see Table 1) to calculate rate of one of these states, arbitrarily labelled state X, among families harbouring the pathogenic variant across those states with data provided, RXobs. Step 2: Eqs. (58) are applied to calculate P(familial), P(sporadic), P(unaffected), and P(affected), for a series of penetrance values, fi=0,,1, at a defined sibship size, N, and with disease risk g for people not harbouring the variant. The rate of state X expected at each fi among variant harbouring families from those states represented in Step 1, RXiex, is calculated and stored alongside the corresponding fi in a lookup table. Step 3: The lookup table is queried using RXobs to identify the closest RXiex value and corresponding fi. Step 4: Bias in the obtained fi estimate is corrected by simulating a population of families representative of the sample data, estimating the difference between true and estimated penetrance values in this population between f=0,,1 and adjusting the estimated fi by error predicted within a polynomial regression model fitted upon the simulated estimate errors. Optional step: Confidence intervals for RXobs can be calculated from error in the estimates of M provided [48]; Penetrance is estimated as in Steps 3 and 4 for the interval bounds. All steps within this approach are comprehensively detailed in Additional File: Sect. 1.1