Skip to main content
. 2024 Sep 3;30(12):3578–3589. doi: 10.1038/s41591-024-03239-5

Extended Data Fig. 1. Indexing and Querying in PrecisionChain.

Extended Data Fig. 1

(A) Indexing in Domain view is done by clinical table and then OMOP vocabulary hierarchy. Each clinical domain has its own exclusive set of streams. Concepts are grouped by ancestor concept using the vocabulary hierarchy. Each ancestor group gets its own stream. Indexing in Person view is by person (person ID). All data (clinical or genetic) for a patient are inserted into the same stream. For clinical this is irrespective of domain and for variant this is irrespective of genomic coordinate bin. Note within a single stream, multiple patient data can be inserted. Indexing in Variant/Gene view is by genomic coordinate bin. All variants/genes within a set of continuous genomic coordinates are added to a single stream. Indexing in Analysis is by analysis type. Data for kinship and population stratification is stored per sample and data for sequencing metadata is stored by metadata type. (B) Flowchart of query process. User inputs required fields into the query module. The mapping stream is searched for the location of the stream holding data for that concept ID. The stream location is extracted from the mapping stream and the concept ID is searched in that stream. Person IDs returned from the stream search are retrieved and processed into a table. If additional search filters are added, these are processed on the returned data.