Abstract
Introduction:
Health insurance registries, which capture insurance coverage and demographic information for entire populations, are a critical component of population health surveillance and research when using administrative data. Lack of standardization of registry information across Canada’s provinces and territories could affect the comparability of surveillance measures. We assessed the contents of health insurance registries across Canada to describe the populations covered and document registry similarities and differences.
Methods:
A survey about the data and population identifiers in health insurance registries was developed by the study team and representatives from the Public Health Agency of Canada. The survey was completed by key informants from most provinces and territories and then descriptively analyzed.
Results:
Responses were received from all provinces; partial responses were received from the Northwest Territories. Demographic information in health insurance registries, such as primary address, date of birth and sex, were captured in all jurisdictions. Data captured on familial relationships, ethnicity and socioeconomic status varied among jurisdictions, as did start and end dates of coverage and frequency of registry updates. Identifiers for specific populations, such as First Nations individuals, were captured in some, but not all jurisdictions.
Conclusion:
Health insurance registries are a rich source of information about the insured populations of the provinces and territories. However, data heterogeneity may affect who is included and excluded in population surveillance estimates produced using administrative health data. Development of a harmonized data framework could support timely and comparable population health research and surveillance results from multi-jurisdiction studies.
Keywords: health insurance registries, administrative health data, population identifiers
Highlights
All reporting Canadian health insurance registries contain data going back to at least 1996. The earliest year of available data was 1968, in Saskatchewan.
Some features of registry data, such as change in coverage, primary address, date of birth and sex, were consistent across all reporting jurisdictions. Other features, such as family unit identifiers, socioeconomic status and population identifiers, varied across reporting jurisdictions.
Differences in Canadian health insurance registries present opportunities for studies that compare the populations covered by provincial/territorial health insurance plans. Creation of a harmonized data framework would benefit national surveillance initiatives and multijurisdiction studies.
Introduction
Administrative health data are data originally collected for purposes of managing and monitoring the health care system. However, these data are also commonly used to measure and describe population health within Canada,1-7 because they are routinely collected, inexpensive to use and have nearly complete population coverage. Administrative health data include physician billing claims, hospital discharge abstracts, prescription medication dispensations and health insurance registries.8 Health insurance registries are a critical component of population-based surveillance and research studies in Canada, because they contain information about members of provincial or territorial populations who are eligible to receive publicly funded health services.9 Understanding the populations captured in health insurance registries and the data that are collected about these populations is essential for comparing health measures between jurisdictions and over time, as well as determining generalizability of these comparisons.
Health care coverage in Canada primarily falls under provincial or territorial jurisdiction.10 There is no standard for how data in health insurance registries are collected and coded, or for how different populations, such as those covered by federal health insurance, are identified in registry files. Differences in health insurance registries across Canada can present challenges to the comparability of surveillance and research studies.
The Canadian Chronic Disease Surveillance System (CCDSS) is a network supported by the Public Health Agency of Canada (PHAC) that aims to further the use of Canadian administrative health data for chronic disease surveillance and supports health care planning and health policies and programs.8 Through the CCDSS, the provinces and territories provide PHAC with aggregate data derived from administrative health data that are then used to generate national estimates and trends over time for over twenty chronic diseases and conditions. Heterogeneity in the data contained within health insurance registries can affect who is included in or excluded from the CCDSS estimates. It also impacts which jurisdictions can conduct health research and surveillance on specific subpopulations (e.g. First Nations individuals).
There has been limited research on the data contained in Canadian provincial and territorial health insurance registries. The few studies that do exist primarily focus on health insurance registries from a single jurisdiction.5,9,11 Two studies focus on the Manitoba health insurance registry,9,11 and one provides a brief description of the Quebec registry as part of that province’s Integrated Chronic Disease Surveillance System.5 Studies that include multiple jurisdictions are limited to reviews; no direct comparisons across jurisdictions have been made.12,13 Detailed information about identifiable populations and those individuals who are included in or excluded from population-based health insurance registries could (1)improve our understanding of surveillance estimates produced from CCDSS data, and (2)inform the potential uses of registry data to describe specific subpopulations.
The purpose of this study was to assess the data contained within health insurance registries across Canadian provinces and territories. The objectives were to describe the populations covered and document registry similarities and differences.
Methods
Data were collected using a survey developed by the project team with input from content experts and advisors at PHAC. As well, the project team received input from research staff at the Manitoba Centre for Health Policy with experience working with population health insurance registry data. The survey included five sections: (1)general information (i.e. indication of province or territory); (2)temporal data coverage; (3)start and end dates of coverage for residents of the province or territory; (4)population characteristics captured in registry data; and (5)population identification. The final draft of the survey was approved by PHAC before distribution. A copy of the survey is available upon request.
A list of key informants from each of the provinces and territories was identified by the project team with input from PHAC. Informants were primarily identified from members of the CCDSS Science Committee and Data Quality Working Group. Informants who held positions that involved working with administrative data were contacted, as they were most likely to be knowledgeable about the registry data in their jurisdiction. In cases where two contacts were available with similar expertise, both were contacted.
In April 2019, the project team emailed the survey to the key respondents, who were given one week to complete it; extensions were provided as needed. Key informants were free to consult other experts in their jurisdiction to complete the survey if needed. If key informants were unable to complete the survey, they were asked to recommend an alternative informant. Phone and email follow-ups were conducted by the project team to clarify responses when needed.
This study was conducted out of the University of Manitoba. Research ethics approval was not required as data were collected on health insurance registry contents, not human participants.14
Results
Thirteen key informants from the 10 provinces responded to the survey (two from Saskatchewan, Manitoba and New Brunswick), and one key informant from the Northwest Territories provided limited information. Where two key informants were consulted, informants worked together to fill out a single survey, which was returned to study researchers. No survey response was received from Yukon or Nunavut, as no one with sufficient expertise (i.e. knowledge of registry data) to address the questions was identified. Key informants included individuals employed by provincial ministries of health and experts associated with population-based research data repositories, such as ICES. Both Saskatchewan and Prince Edward Island indicated that registry staff were consulted when completing the survey.
Temporal data coverage
Information about temporal coverage of the health insurance registries is provided in Figure 1. The earliest available year of data was reported by Saskatchewan (1968). While Manitoba’s key informant reported that the earliest available year of data was from 1984, other published sources indicate this province’s health insurance registry data extends back to 1970.9,15 However, data from these early years may not be consistently available to all data users and may be difficult to use when generating surveillance and research reports. All provinces reported that “snapshots” (i.e. a file to preserve the registry data at that point in time) of their registries are regularly saved and stored.
Figure 1. Timelines of data available in Canadian health insurance registries for each participating jurisdiction (excluding the Northwest Territories).

Start and end of coverage
Table 1 summarizes the responses provided around start and end dates of health insurance coverage for individuals. There was heterogeneity across the reporting jurisdictions on the capture of information about mobility of health-insured residents (i.e. where an individual moved to when leaving the jurisdiction and where they moved from when entering the jurisdiction). For most jurisdictions, individuals were added to the registry three months after moving into the jurisdiction and flagged for end of coverage three months after moving out of the jurisdiction. The timeline for being added to the registry and being flagged for end of coverage in cases of birth and death were variable across jurisdictions. There was substantial diversity in the end-of-coverage cancellation codes (e.g. deceased, left country, duplicate record) provided by the key informants; only Ontario indicated no end-of-coverage codes were available. While data quality checks for start and end of coverage were implemented in a number of jurisdictions (i.e. British Columbia, Alberta, Saskatchewan, Ontario, Quebec, Nova Scotia and Newfoundland and Labrador), key informants did not always have information about data quality assessment processes or did not report that data quality assessments were undertaken.
Table 1. Start and end of individual coverage information in provincial/territorial health insurance registries, Canada, 2019.
Population characteristics and attributes captured
All jurisdictions reported that their registries contained a primary address, birth date and sex for health-insured residents (Table 2).
Table 2. Population characteristics and attributes captured in provincial/territorial health insurance registries, Canada, 2019.
A family unit identifier, that is, a code to indicate familial relationships, was reported for Alberta, Saskatchewan, Manitoba, New Brunswick and the Northwest Territories. Prince Edward Island reported having a household identifier, but noted limitations in its use (e.g. individuals in a group home have the same household identifier). British Columbia indicated that familial relationships among residents could be ascertained from other administrative data sources. Family relationship codes (i.e. indication of relationships among residents, such as parent, offspring, spouse or sibling) were available in Alberta, Saskatchewan, Manitoba and New Brunswick.
Only Ontario reported having information about ethnicity in its health insurance registry. Identifiable categories for ethnicity were general, Chinese and South Asian. These categories are derived from an algorithm applied to the data and not directly collected.
Measures of socioeconomic status (e.g. income quintile) were available in the health insurance registries of Ontario and Quebec; Ontario’s measure was algorithm-driven. British Columbia indicated socioeconomic status could be determined for a subset of individuals by linking to a supplementary database.
Lastly, place of birth was available in Alberta and Prince Edward Island, although other jurisdictions did note that linkage of registry data with other sources could aid in identifying this information.
Population identification
Table 3 provides information about the populations identifiable in health insurance registries. All jurisdictions indicated they could identify at least some of the populations in question, except for Quebec. Quality of population identifiers was heterogeneous. For example, the First Nations identifier in Manitoba’s health insurance registry may result in misclassification of individuals, because it is based on self-report. This means that a First Nations individual would only be flagged as such if that individual disclosed this information to insurance registry staff. In Alberta, the federal government had previously verified First Nations status, but stopped when Alberta eliminated health insurance premiums in 2009. To aid in preserving data quality, Alberta has maintained the flag for those present prior to the change, and any offspring are conferred unofficial status for reporting purposes. However, misclassification may still occur.
Table 3. Population identification information in provincial/territorial health insurance registries, Canada, 2019.
Discussion
The results of this survey provide important insights about the data contained within health insurance registries across Canada (i.e. 10 provinces and the Northwest Territories). Health insurance registry data go back as far as 1968 (in the case of Saskatchewan). All reporting jurisdictions had data going back to at least 1996. Individuals moving in or out of the jurisdiction are generally added or flagged for end of coverage three months after moving; timelines are more variable for beginning or ending of coverage due to births and deaths. Events recorded for end of coverage differ among jurisdictions. All jurisdictions record a primary address, date of birth and sex. Family identifiers, ethnicity, socioeconomic status and place of birth are recorded in select jurisdictions. Ability to identify members of certain populations, such as First Nations, the Canadian Armed Forces and inmates of federal penitentiaries, is variable across jurisdictions.
Previous research has compared data contained in physician services databases across Canada.16 However, there has been limited research on provincial and territorial health insurance registry data and its quality. Publications on registry data have reported on Manitoba’s and British Columbia’s health insurance registries.9,11,17 However, the most recent Manitoban publication was in 1999,9 and British Columbia’s paper focussed on the development of a research registry to which the provincial health insurance registry contributed.17 Tang et al.18 reported on ethnic classifications available in Canadian health insurance registries and found that flags only existed on First Nations people in the registries. In contrast, we found that Ontario had flags derived from an algorithm for subpopulations of South Asian and Chinese descent. A systematic review conducted by Hinds et al.13 did not identify any studies that had investigated the quality of health insurance registry information.
Capture of population characteristics and attributes allows for surveillance measures to be stratified by potential risk factors (e.g. social determinants of health such as socioeconomic status, immigration status and ethnicity). Results reported here help summarize the jurisdictions that are suitable for these analyses. However, heterogeneity among jurisdictions in population identification also has implications for who is included in the CCDSS estimates, and suggests that inclusions and exclusions may not be consistent across jurisdictions.
A major value of the health insurance registry comes from preserving snapshots, or timestamped records, of registry data.9 Our study results indicate that over 20years of registry data are available in all reporting jurisdictions, with snapshots available to capture changes in the registry. There are challenges associated with using health insurance registries for longitudinal studies: health insurance registries are not static, registry data and data quality change over time, and older data are not always linkable. Nonetheless, the benefits of being able to use these data to conduct longitudinal cohort and intergenerational studies with minimal cost and data collection far outweigh the challenges.
Strengths and limitations
This study is unique in providing insights about the data contained within health insurance registries across Canada using a standardized survey with near complete coverage of the provinces and territories. Surveys were completed by individuals with in-depth knowledge of the registry data and access to other informants to provide additional information if needed.
However, there are some limitations. Our study did not employ a validated survey for information capture, as no such survey exists. This hinders international comparisons. As well, health insurance registries are complex databases, and it is not possible to capture all nuances of their features in a survey conducted at a single point in time. Not all elements collected during the registration process will be available to potential data users. Many of these elements are administrative in nature. Many systems are dynamic, being continually updated, so extracts or snapshots are created with a methodology to provide the most accurate and consistent view of the population.
Future research
Future studies could be undertaken to validate key population characteristics included in health insurance registry data, including dates of birth and death, coverage cancellation codes, location of residence and identifiers for specific populations. Another research opportunity lies in assessing the timeliness of characteristic updates. However, a key challenge with such studies is identifying and accessing appropriate population-based validation data sources that can be linked to health insurance registration data. While vital statistics data could be used to assess the accuracy of dates of birth and death, validation data sources for other key population characteristics may not be readily available in all jurisdictions.
Previous studies have compared aggregate population counts obtained from health insurance registries to counts from Statistics Canada census data.19,20 However, potential sources of inaccuracies were not determined. Therefore, future research could look at potential sources of discrepancy between population counts in registry and census data, and estimate the impact of these discrepancies on health measures.
Exploring changes in health insurance registration coverage over time is another potential area of future research. Registries are not static; exclusions may not be consistent over time. Comprehensive information about changes in the data can help ensure accuracy of health trend estimates.
Future research should investigate the feasibility of a common data framework, such as the Generalized Data Model proposed by Danese et al.21 to facilitate the systematic and standardized capture of information in health insurance registries. However, access to some registry data elements may be subject to privacy legislation, and data are primarily collected for administrative, not research, purposes. This could lead to challenges in implementing a common data framework.
Finally, there is a potential role for a national organization to facilitate the harmonization or standardization of data in health insurance registries. Examples of such organizations include the Canadian Institute for Health Information, which has recently proposed standards for collecting data on patient ethnicity,22 and Health Data Research Network Canada, which aims to support multi-jurisdiction studies by connecting and establishing research data infrastructure.23
Acknowledgements
This work was made possible through collaboration between PHAC and the respective provincial and territorial governments of British Columbia, Alberta, Saskatchewan, Manitoba, Ontario, Quebec, New Brunswick, Nova Scotia, Prince Edward Island, Newfoundland and Labrador and Yukon. No endorsement by the provinces and territories is intended or should be inferred.
Conflicts of interest
The authors have no conflicts of interest to declare.
Authors’ contributions and statement
NCH and LML drafted the manuscript and all authors contributed to its revisions. LML, CR, JE, SO, LM and KH defined the scope of the survey. NCH, LML, CR, JE, SO, LM and KH developed the survey. LML and NCH contacted key informants and collected completed surveys. LR, KAMP, MA, MS, RP, MM, AY, JS, YL, LWS, FS, AA, BZ and JA reviewed the manuscript for accuracy and ensured appropriate interpretation of health registry data.
The content and views expressed in this article are those of the authors and do not necessarily reflect those of the Government of Canada.
References
- LeBlanc AG, Gao YJ, McRae L, Pelletier C, et al. At-a-glance—Twenty years of diabetes surveillance using the Canadian Chronic Disease Surveillance System. Health Promot Chronic Dis Prev Can. 2019;39((11)):306–9. doi: 10.24095/hpcdp.39.11.03. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robitaille C, Dai S, Waters C, et al, et al. Diagnosed hypertension in Canada: incidence, prevalence and associated mortality. CMAJ. 2012;184((1)):E49–E56. doi: 10.1503/cmaj.101863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robitaille C, Bancej C, Dai S, et al, et al. Surveillance of ischemic heart disease should include physician billing claims: population-based evidence from administrative health data across seven Canadian provinces. BMC Cardiovasc Disord. 2013:88–E56. doi: 10.1186/1471-2261-13-88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blais C, Dai S, Waters C, et al, et al. Assessing the burden of hospitalized and community-care heart failure in Canada. Can J Cardiol. 2014;30((3)):352–8. doi: 10.1016/j.cjca.2013.12.013. [DOI] [PubMed] [Google Scholar]
- Blais C, Jean S, Sirois C, et al, et al. Quebec Integrated Chronic Disease Surveillance System (QICDSS), an innovative approach. Chronic Dis Inj Can. 2014:226–35. [PubMed] [Google Scholar]
- Gershon AS, Guan J, Wang C, To T, et al. Trends in asthma prevalence and incidence in Ontario, Canada, 1996-2005: a population study. Am J Epidemiol. 2010;172((6)):a population study–35. doi: 10.1093/aje/kwq189. [DOI] [PubMed] [Google Scholar]
- Hux JE, Ivis F, Flintoft V, Bica A, et al. Diabetes in Ontario: determination of prevalence and incidence using a validated administrative data algorithm. Diabetes Care. 2002;25((3)):512–6. doi: 10.2337/diacare.25.3.512. [DOI] [PubMed] [Google Scholar]
- Lix L, Ayles J, Bartholomew S, et al, et al. The Canadian Chronic Disease Surveillance System: a model for collaborative surveillance. Int J Popul Data Sci. 2018;3((3)):433–6. doi: 10.23889/ijpds.v3i3.433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roos LL, Nicol JP, et al. A research registry: uses, development, and accuracy. J Clin Epidemiol. 1999;52((1)):39–47. doi: 10.1016/s0895-4356(98)00126-7. [DOI] [PubMed] [Google Scholar]
- Government of Canada. Ottawa(ON): Canada’s health care system [Internet] Available from: https://www.canada.ca/en/health-canada/services/health-care-system/reports-publications/health-care-system/canada.html. [Google Scholar]
- Roos LL, Mustard CA, Nicol JP, et al, et al. Registries and administrative data: organization and accuracy. Med Care. 1993;31((3)):201–12. doi: 10.1097/00005650-199303000-00002. [DOI] [PubMed] [Google Scholar]
- Roos LL, Gupta S, Jebamani L, et al. Data quality in an information-rich environment: Canada as an example. Can J Aging. 2005 doi: 10.1353/cja.2005.0055. [DOI] [PubMed] [Google Scholar]
- Hinds A, Lix LM, Smith M, Quan H, Sanmartin C, et al. Quality of administrative health databases in Canada: a scoping review. Can J Public Health. 2016;107((1)):e56–e61. doi: 10.17269/cjph.107.5244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- University of Manitoba. Winnipeg(MB): Research requiring ethics review [Internet] Available from: https://www.umanitoba.ca/research/orec/ethics_medicine/requiring_review.html. [Google Scholar]
- University of Manitoba. Winnipeg(MB): Manitoba population research data repository data descriptions [Internet] Available from: https://umanitoba.ca/faculties/health_sciences/medicine/units/chs/departmental_units/mchp/resources/repository/descriptions.html?ds=Insurance. [Google Scholar]
- Lix LM, Walker R, Quan H, Nesdole R, Yang J, Chen G, et al. Features of physician services databases in Canada. Chronic Dis Inj Can. 2012;32((4)):186–93. [PubMed] [Google Scholar]
- Kerluke K, Black C, Peterson S, MacDonald A, McKendry R, et al. Developing and maintaining a population research registry to support primary healthcare research. Healthc Policy. 2009;5((Spec No)) [PMC free article] [PubMed] [Google Scholar]
- Tang K, Lucyk K, Quan H, et al. Canadian data sources on ethnic classifications [abstract] J Gen Intern Med. :S144–S145. [Google Scholar]
- Ellison J, Nagamuthu C, Vanderloo S, McRae B, Waters C, et al. Estimating chronic disease rates in Canada: which population-wide denominator to use. Health Promot Chronic Dis Prev Can. 2016;36((10)):224–30. doi: 10.24095/hpcdp.36.10.03. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin Y, et al. Alberta Health and Wellness. Edmonton(AB): 2009. Comparison of Alberta population counts between the AHCIP Registry and the 2006 Census. Available from: https://open.alberta.ca/publications/9780778582502#detailed. [Google Scholar]
- Danese MD, Halperin M, Duryea J, Duryea R, et al. The Generalized Data Model for clinical research. BMC Med Inform Decis Mak. 2019;19((1)):117–30. doi: 10.1186/s12911-019-0837-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- CIHI. Ottawa(ON): 2020. Proposed standards for race-based and Indigenous identity data collection and health reporting in Canada. [Google Scholar]
- Health Data Research Network Canada. Vancouver(BC): About [Internet] Available from: https://www.hdrn.ca/en/about. [Google Scholar]







