Skip to main content
Applied Clinical Informatics logoLink to Applied Clinical Informatics
. 2012 Jul 6;3(3):258–275. doi: 10.4338/ACI-2011-12-RA-0074

Dealing with the Archetypes Development Process for a Regional EHR System

MR Santos 1,, MP Bax 1, D Kalra 2
PMCID: PMC3613029  PMID: 23646075

Abstract

Objectives

This paper aims to present the archetype modelling process used for the Health Department of Minas Gerais State, Brazil (SES/MG), to support building its regional EHR system, and the lessons learned during this process.

Methods

This study was undertaken within the Minas Gerais project. The EHR system architecture was built assuming the reference model from the ISO 13606 norm. The whole archetype development process took about ten months, coordinated by a clinical team co-ordinated by three health professionals and one systems analyst from the SES/MG. They were supported by around 30 health professionals from the internal SES/MG areas, and 5 systems analysts from the PRODEMGE. Based on a bottom-up approach, the project team used technical interviews and brainstorming sessions to conduct the modelling process.

Results

The main steps of the archetype modelling process were identified and described, and 20 archetypes were created.

Lessons learned:

  • The set of principles established during the selection of PCS elements helped the clinical team to keep the focus in their objectives;

  • The initial focus on the archetype structural organization aspects was important;

  • The data elements identified were subjected to a rigorous analysis aimed at determining the most suitable clinical domain;

  • Levelling the concepts to accommodate them within the hierarchical levels in the reference model was definitely no easy task, and the use of a mind mapping tool facilitated the modelling process;

  • Part of the difficulty experienced by the clinical team was related to a view focused on the original forms previously used;

  • The use of worksheets facilitated the modelling process by health professionals;

  • It was important to have a health professional that knew about the domain tables and health classifications from the Brazilian Federal Government as member in the clinical team.

Conclusion

The archetypes (referencing terminology, domain tables and term lists) provided a favorable condition for the use of a controlled vocabulary between the central repository and the EMR systems and, probably, will increase the chances of preserving the semantics from the knowledge domain. Finally, the reference model from the ISO 13606 norm, along with the archetypes, proved sufficient to meet the specificities for the creation of an EHR system for basic healthcare in a Brazilian state.

Keywords: Health information systems, electronic health record, archetype modelling, two-level modelling, archetypes, ISO 13606; openEHR

1. Introduction

The development of Information Systems (IS) that can satisfy the fundamental requirements of the clinical data electronic recording process has attracted the efforts of the healthcare information technology scientific community [1]. These types of IS, generically known as the Electronic Health Record (EHR) systems, must, on the one hand, have the flexibility to represent specific medical knowledge in a way that allows it to assimilate its complexity and rapid evolution (of clinical concepts) and, on the other hand, satisfy the need for interoperability with other EHR systems. Such systems are built based on different data models, terminologies, concepts etc. As they exchange clinical data, there is a risk that mistaken interpretations may lower the accuracy of the communicated record and negatively impact on the quality of medical care.

Interoperability between EHR systems implies an agreement on the structure and meaning of the data being communicated. One proposal to handle this complexity (dynamic nature and sharing of the concepts) is two-level modelling. In this approach, two models are used: a generic information model (reference model) and a knowledge domain model, closer to the domain specialist (not the IT specialist). Suitable for the modelling of complex systems that involve a large number of domain specialists, the approach has strongly influenced the design of interoperability standards in the EHR systems field.

An example of an interoperability standard that uses this approach is the ISO 13606 norm [2], which specifies a reference model aimed at EHR system interoperability. This ISO norm proposes a model that represents either the complete EHR or selected parts of an EHR, and it is a simplification of the approach proposed by the openEHR Foundation1. In this way, data can be extracted from an EHR system for the purpose of communicating with another system or responding to a requesting system. [3–5]. This norm was used in the case of the Health Department of Minas Gerais State (SES/MG)2 [6].

This article aims to present the results of the archetype modelling process used by SES/MG to support the establishment of a regional EHR system – based on a central repository – and the lessons learned from this activity.

2. Background

2.1. Two-level modelling

The idea of separation between a model of information and a model of knowledge has its origins in previous research within the Information Science field [7–12]. These concepts were applied in the health informatics field by certain European projects [3, 13–15] and subsequently expanded by the openEHR Foundation [16].

According to openEHR approach, the two-level modelling (or multi-level modelling), distinguishes between a reference model (information level) that is used to represent the generic properties of health record data, and an archetype model (knowledge level) used to represent the specific information that needs to be documented within health records by each clinical domain. The advantage of this split is the possibility to construct a domain specialist model (knowledge model) that is more independent from the developers’ activities, which remain focused on the reference model [17, 18].

The reference model represents the global traits of the components of an EHR, the manner in which they are aggregated and the data context required to meet legal and ethical requirements, and the provenance (origin) of the data. Following this approach, it defines a set of classes that make up the structural blocks of an EHR, reflecting its stable traits. In order for this model to be stable and able to handle any particular clinical domain, it should be complemented by metadata structures reflecting domain-specific traits, restricted by the general classes from the reference model. The formalism for the domain-specific traits is known as archetypes [3].

Archetypes are formal expressions of a domain content model in the form of declarations of structured constraints based on a Reference Model [1]. They specify a hierarchy of sub-classes that are components of a record, defining or constraining the names of each sub-class and other relevant attribute values, such as optionality and multiplicity at any point of the hierarchy, the types of data and value intervals the data elements may assume, and dependence constraints [3]. Generally designed for broad use, they can, however, be specialized to include local particularities and can accommodate any number of idioms and terminologies [18]. Archetype instances conform to a formal model known as archetype model [19].

2.2. ISO 13606 Norm

The ISO 13606 standard was developed by the Technical Committee ISO/TC 215, Health Informatics, and conceived from practical experience obtained during the implementation of the European precursor pre-standard, ENV 13606. This norm has focus on the communication between electronic health record systems. Its reference model is a simplification of the architecture put forward by the openEHR Foundation, and it is also based on the two-level modelling approach. In this alignment, the main classes of its reference model aims to establish an information model to communicate part or the entire electronic health record of a given patient (EHR extracts): preserving the original clinical meaning intended by the author; and reflecting the confidentiality of each data as intended by both author and patient [2].

Its reference model defines the following hierarchical levels for EHR data organization (containers): EHR_Extract – the electronic health record for one person; Folder – high-level organization of the EHR; Composition – Clinical care session, encounter or document; Section – clinical headings reflecting the workflow and consultation process; Entry – Clinical “statements” about observations, evaluations, etc; Cluster – Nested multi-part data structures; Element – Leaf nodes with single data values. See these hierarchical levels in Figure 1.

Fig. 1.

Fig. 1

Regional EHR System Architecure

The domain knowledge in the ISO 13606 norm also can be represented by a set of archetypes. According to the part 2 of the norm, an archetype represents a set of constraints placed upon the structure of the data present in the reference model. Usually individual archetypes are represented as serialized objects through a formalism named Archetype Definition Language (ADL).

2.3 Archetype Modelling

Archetypes can be developed through similar principles to the ones used in the creation of guidelines or protocols, based on best clinical practices and through development by multidisciplinary teams [20]. However, the clinical knowledge represented by an archetype goes far beyond that represented by a guideline. The archetype requires evidence about a knowledge object containing the details on the aspects of an intervention that has been undertaken and documented. Furthermore, it requires evidence associated with the concepts such as: rules, measurement intervals, data types, presentation formats, data representation conditions (codes, terminologies), etc. The data contained in an archetype should be sufficient to allow its interpretation on its own and be as complete as possible so that it may meet the needs of multiple sectors, purposes and priorities. However, they may be used in specific contexts, reflecting the data elements typical of a more focused environment [18, 20–21].

The openEHR Data Modelling Approach (ODMA) is a known process for the development of archetypes [22] that suggests the following steps should be taken:

  • 1.

    search in all media (records in paper, guidelines, specialized magazines, etc.) for possible data elements;

  • 2.

    group the elements into coherent concepts;

  • 3.

    search for the identified concepts within a repository of archetypes and verify the existence of compatible archetypes;

  • 4.

    develop the new archetypes that are needed by using a specific editor;

  • 5.

    design templates to adapt archetypes to local circumstances or in order to combine different archetypes to represent a higher-level clinical documentation workflow.

2.4. Regional EHR System of Minas Gerais State

The SES/MG has begun a project to develop an EHR for all the cities in the state, which includes building a state-wide EHR repository, aimed at consolidating demographic data and producing a Patient’s Clinical Summary (PCS) of its citizens for the Family Healthcare Programs (FHP).

The regional EHR system proposed for this project was based on the division of the State into macro-regions and on the implementation of vendor-based Electronic Medical Record (EMR) system to handle each one of them. Each macro-region, composed of several healthcare units, should use an EMR system and have a local datacenter. Therefore, the EHR system architecture was based on a central repository and on the creation of a messaging infrastructure, based on the ISO 13606 norm, aimed at ensuring the interoperability of the contracted EMR systems – the decision by use this norm was made mainly because its focus on EHR/EMR systems interoperability. The repository was designed to contain demographic data and the PCS necessary to meet the demands of the FHP. The architecture prescribes the use of archetypes to represent knowledge and includes four basic services: clinical data services, demographic data services, archetype services and terminology services [23]. See Figure 2.

Fig. 2.

Fig. 2

ISO 13606 hierarchical levels [3]

The decision to use archetypes in the central repository was based on the flexibility and ease these artefacts can offer for medical knowledge reuse. Additionally, the archetype-based approach enabled the SES/MG clinical team to take direct responsibility for any evolutionary changes in the concepts specific to the health domain and for any new clinical data representations, while PRODEMGE would bear the “more technological” responsibility of maintaining the archetype processing infrastructure and managing the servers that host the central repository. This decision was made with the goal of obtaining the advantages expected with the two-level modelling approach [3].

The EHR system architecture was built on a Service Oriented Architecture (SOA) approach, the clinical data services were specifically designed to allow the exchange of patients’ clinical data packages between systems and the central repository. These packages are called EHR extracts, which were developed based on the ISO 13606 norm reference model (Section 2.2) and complemented by a set of archetypes that define the central repository concepts. Thus, two services were proposed: one for sending data to the central repository and another to permit requests for data from the central repository – see Figure 2. In order to send data to the central repository, every day the EMR systems operating in the macro-regions will have to dispatch extracts composed of clinical data sets (within compositions) generated for each patient at the healthcare units and update the central repository.

At the time of preparing this manuscript, the selection process for the EMR systems that would participate in the State’s EHR solution is in process, being conducted by the PRODEMGE3. Interoperability tests with the central repository are scheduled to begin in mid-2012 and, by 2013, the pilot period and the activation of the system in some regions/cities of the State are expected to have begun.

3. Methods

This study was undertaken in SES/MG. The whole archetype development process took about ten months. This process was based on the following steps: selecting data elements from the patient’s clinical summary; identifying concepts that could become archetypes; modelling and validating entry archetypes; identifying domain tables, terminology and rules of permanence; searching for external sources of archetypes that could be used within this project; and codifying archetypes in ADL. The modelling process was coordinated by a clinical team formed by 3 health professionals and one systems analyst from the SES/MG, supported by the first author of this study. Additionally, the activities of data elements selection and concept definition involved around 30 health professionals from the internal SES/MG areas, and 5 systems analysts from the PRODEMGE. The clinical team used technical interviews and brainstorming sessions to conduct the modelling process.

The result validation process was mostly accomplished through the formalization of the data in spreadsheets that, once generated, were at each step sent to the participants by e-mail. The idea was to give each reviewer a means of collaborating on the basic clinical information structure without a special tool, and without having to face the full detail required to define each archetype. After validation, the recording of the approval was also done by e-mail. Some of the results were obtained during meetings held for this purpose and recorded in the minutes of the meetings.

4. Archetype Modelling Process in Minas Gerais Project

The archetypes were used exclusively to model clinical data that were to be kept in the central repository. The demographic data were represented in accordance with the model published by the Federal Government to support the national health card5. The aforementioned steps and the manner in which the clinical team conducted the archetype conception are detailed next.

4.1. Selection of Elements for the Patient’s Clinical Summary

An essential part of the EHR scenario, the PCS, represents a set of clinical data elements resulting from the care provided at the state healthcare units. Initially, a set of principles guided the selection of the PCS data elements. These data elements should:

  • 1.

    be sufficient to allow for patient mobility between regions of the State without compromising care;

  • 2.

    include data for the monitoring of basic healthcare management indicators established by the SES/MG;

  • 3.

    be the result of a consensus among the teams of various clinical specialties, who are in complete agreement with the comprehensive plan and contemplate the life cycle stages mentioned in the guidelines;

  • 4.

    be structured (not free text) and, whenever possible, be compatible with the national healthcare terminology used by the Federal Government and with the domain tables used by the SES/MG.

Grounded on these principles, the clinical team used the FHP documents and the guidelines as a starting point for the identification of the mandatory data elements for the EMR systems that would be used. These documents were used in the various workshops carried out with the internal areas of the SES/MG. As a result, the clinical team identified several necessary data elements. Excluding similar or repeated data across the life cycle stages, 206 data elements were identified for the central repository: 84 relating to public health management and 122 directed at patient mobility – only those data elements related with patient mobility (clinical data) were subject of the archetype development. The demographic data elements were not included in this study (Table 1).

Tab. 1.

122 data elements identified for the PCS

# Data Element
1 Periodontitis
2 Type of Delivery
3 Gestational Age
4 Intercurrences during prenatal period
5 Other Intercurrences during prenatal period
6 VDRL
7 Toxoplasmosis
8 Anti HIV
9 HBsAg
10 Name of complementary examination
11 Specification of complementary examination
12 Complementary examination result date
13 Complementary examination result
14 Glycated hemoglobin dosage result date
15 Glycated hemoglobin dosage result
16 Neonatal screening result: Hypothyroidism
17 Neonatal screening result: Phenylketonuria
18 Neonatal screening result: Cystic fibrosis
19 Neonatal screening result: Sickle-cell disease
20 Indirect Coombs result date
21 Indirect Coombs test result
22 Hemoglobin test result
23 Hematocrit test result
24 Fasting Glycemia test result
25 Routine urine test result
26 Urinalisys test result
27 Ultrasonography test result
28 Apgar score first minute
29 Apgar score fifth minute
30 History of dental traumatism / cause
31 Other causes of dental traumatism
32 Classification of newborns
33 Skull Perimeter
34 Intercurrences during neonatal period
35 Other Intercurrences during neonatal period
36 Measured value
37 Weight
38 Orthoses and prostheses
39 Presence of disability
40 Allergy
41 Description of Allergy
42 Name of immunobiologicals
43 Immunobiologicals dosage
44 Immunobiologicals lot number
45 Expiration date of Immunobiologicals
46 Immunobiologicals application date
47 Immunobiologicals Adverse events
48 Immunobiologicals are up-to-date
49 Consumption of Alcohol
50 Tobaccoism
51 Use of other substances / licit or illicit
52 Name of the medication
53 Posology of medication
54 Medication being used
55 Date when use of medication began
56 Medical prescription
57 Adverse effects of medication
58 Newborn Hearing Screening test
59 Hearing Screening test alterations
60 Body mass index
61 Lesions affecting the mucosa of the oral cavity
62 Fluorosis
63 Waist circumference
64 Cardiovascular risk associated with waist circumference
65 Systolic arterial pressure value
66 Diastolic arterial pressure value
67 Activities of Daily Living test – ADL – Self-care
68 Instrumental activities of daily living test – IADL
69 Falls in the last year
70 Number of falls in the last year
71 Fractures after falls
72 Activity limitation after falls
73 Incapacity after a fall
74 Compromised Mobility after a fall
75 Cognitive disability
76 Postural instability
77 Immobility
78 Incontinence
79 Iatrogenesis
80 Diagnostic impressions
81 Risk stratification for children
82 Risk stratification for adolescents
83 Risk stratification for the elderly
84 Cardiovascular risk stratification
85 Risk stratification – Diabetes
86 Risk stratification for current gestation
87 Risk stratification – Mental Health
88 Risk stratification – Tuberculosis
89 Risk stratification – Hansen’s disease
90 Risk classification – oral health
91 Number of gestations
92 Dental Caries Activity
93 Gingivitis
94 Number of Abortions
95 Number of Deliveries
96 Number of Vaginal Births
97 Number of C-Section Births
98 SISPRENATAL reference number
99 Date of Last Menstrual Period
100 Probable Date of Delivery
101 Probable Date of Delivery after first Ultrasonography test
102 Result of First Ultrasonography test
103 Prenatal Consultation Reference Number
104 Gestational Age
105 Presence of Edema
106 Description of Edema
107 Uterus
108 Uterine Height
109 Position of Fetus
110 Condition of Fetus
111 Dorsal Portion of Fetus
112 Fetal Heart Rate
113 Fetal Movements
114 Cervix
115 Dilation Condition
116 Dilation rate
117 Planned pregnancy
118 Care Plan
119 Prenatal care site
120 Relation with maternity hospital of reference
121 Scheduling Pregnant Women Group Meetings
122 Observations on Current Gestation

Thus, these 122 data elements were subjected to a rigorous analysis aimed at determining the most suitable clinical domain. After several meetings with the SES/MG internal areas, the clinical team achieved positive results. Only 10% of the 122 data elements would be free text (non-structured), the remainder were converted into internal term lists and domain tables or aligned with the national/international classifications. Once the data elements were identified, the candidate concept identification and the archetype modelling process was initiated.

4.2 Candidate Concept Identification and Archetype Modelling

Identification of the candidate concepts to be kept in the central repository, from the 122 data elements chosen, were targeted for archetype modelling. The modelling process began by considering the structural organization of the archetypes. At first, it was decided that a thematic archetype (composition) called “Clinical Summary” would be created. Also, it was agreed to use small Entry archetypes based on specific concepts, thus allowing greater reutilization flexibility. The later grouping of Entries into Sections proved to be a high complexity task, considering the still incipient modelling archetype knowledge that the team possessed. In order for them to be properly modelled, a well consolidated view of the use of these structures was needed. Therefore, postponing this modelling until future archetype versions was expected increase the chances of their correct use.

Once the archetype organizational structure had been delineated, an analysis process aimed at surveying the concepts behind the identified data element groups began. To this end, the clinical team conducted brainstorming sessions in which the possible concepts were identified and the pertinence, adequacy and coherence of the data elements for the creation of these concepts were analyzed. Several candidate concept versions were created and a substantial modelling effort was put into this step of the process.

Levelling the concepts to accommodate them within the hierarchical levels in the reference model was definitely no easy task. The use of a mind mapping tool facilitated the modelling process, allowing the visualization of the relationship between concepts and data elements and the existing overlaps. The structures of the concepts resulting from the archetype-oriented modelling process were quite different (in terms of data grouping) from those employed in the original forms from which most data elements were extracted [5].

After an adjustment and correction process, the spreadsheet was finally validated by the Project Clinical Team. It should be noted that the team decided to use worksheets to facilitate the modelling process by health professionals. Many of them had difficulties in using archetypes editors directly because of the technical skills that were required. This worksheet was created according the ISO 13606 reference model classes and data type definitions. Also, the health professionals could use this spreadsheet to inform terminology codes and domain tables to be used. See a worksheet example in Table 2.

Tab. 2.

Example of spreadsheet used during the modelling process based on ISO 13606 reference model

Entry Cluster Element Optionality Description Format Datatype CEN14796 Terminology Element multiplicity
Allergy Allergies list Classification Mandatory Classification of allergies to food, medications and other substances. Number –2 characters ORD
  • 1.

    Allergy to foods

  • 2.

    Allergy to animals

  • 3.

    Allergy to cosmetics

  • 4.

    Allergic to detergents

  • 5.

    Allergy to drugs

  • 6.

    Allergy to fungi

  • 7.

    Allergy to …


Allergy Allergies list Observation Optional Observations regarding the allergies. Alphanumeric – limited to 1000 characters SIMPLE_TEXT
Disabilities Kind of disability Optional Kind of disability the patient has. Number –1 character ORD
  • 0

    – no observed

  • 1

    – hearing

  • 2

    – Physical

  • 3

    – mental

  • 4

    – ostomy

  • 5

    – visual

  • 6

    – multiple

multivalue
Disabilities Severity degree Optional Severity degree observed Number –1 character ORD
  • 1

    – mild

  • 2

    – moderate

  • 3

    – severe

Note: in order to fit the spreadsheet to this page just some columns were included and the concepts are not complete.

4.3 Terminology Analysis

The next step concerns the identification of the domain tables and terminology codes to be used to validate each data element. Control over the employed vocabulary is required in order to ensure the interoperability of the EMR systems with the central repository. The terminology server proposed within the EHR system architecture addresses this need (Section 3). This will aggregate the codes from a descriptive terminology4, term lists and domain table from SES/MG, and national/international classifications to be used. The clinical team analyzed the domain for each data element, specifying the appropriate term lists and/or domain tables to validate data elements. Some data elements needed to be validated against Federal Government classifications (e.g. clinical procedures and vaccines).

The decision regarding which descriptive terminology to use was, at the time of writing, still awaiting a decision from the Federal Government. The Committee for Health Informatics and Information of the Ministry of Health (CIINFO/MS) was intensifying efforts towards the adoption of a descriptive terminology in Brazil. It is important to note that whenever the SES/MG does require the EMR systems to start sending coded extracts using a descriptive terminology, the reference model of ISO 13606 already covers this type of use.

According to the ISO 13606 norm, all term lists, domain tables, classifications and terminologies used in EHR extracts must be identified by OID numbers (Object Identifier). This is done in order to ensure that both the issuing and the receiving EMR systems correctly identify the terminology systems in use. As a final step, the spreadsheet used in the archetype modelling process was revised taking the identified term lists, domain tables and classifications into consideration.

At this point, the clinical team researched the openEHR Foundation Clinical Knowledge Management (CKM) portal in order to analyze the existing archetypes held there with the goal of comparing them with the modelled concepts. Despite the fact that there was still a small number of archetypes published on the CKM portal during the execution of this project, they served as research sources to guide the revision of some data elements.

4.4 Permanence Rules Analysis

As mentioned when describing the regional EHR system of Minas Gerais State (Section 3), the PCS database should be up-to-date with the clinical data for each citizen at all times. To this end, a set of rules, called permanence rules, was created to ensure that only up-to-date data elements remain in this database. Data elements have specific traits relating to the updating of the PCS. In some data elements, the last value to occur must always be maintained, while in others, the last n occurrences or all the occurrences, or even occurrences during a given time interval etc. are to be maintained. Therefore, in order for the PCS to be kept up-to-date, the appropriate permanence rule for each data element had to be identified by a specific analysis from the clinical team.

Once the EHR extract is semantically validated, it remains in the history database, preserving the composition history for each patient, and lastly, depending on the permanence rule for each data element, it may remain in the PCS database. For this reason, the platform applies permanence rules to the data elements present in the extract, keeping the PCS up-to-date. These permanence rules only apply to data elements (Elements) that have such rules defined for use during import into the repository. This is an especially important property for a summary.

In order to manage these permanence rules we decided to treat them similarly to terminology binding codes within the structure of the ADL files. In this way, all essential parameters necessary to define the knowledge structure for the central repository were held within the archetypes.

4.5 Encoding Archetypes into ADL

When the Entry validation process was completed by the clinical team, with discussions about term lists, domain tables, classifications, terminologies and permanence rules having been finalized, the archetype encoding process began. The goal of this step was the formalization of the constraints associated with each data element using ADL codes [1].

The ADL codes resulting from this activity would be used in tandem with a group of XML Schema from the reference model to validate the EHR extracts, defining the set of constraints to which the data instances should be subject in order to be considered valid by the central repository (Section 3). Finally, with the help of the linkEHR editor5, all of the concepts present in the spreadsheet approved by the clinical team were encoded in the ADL.

5. Results

The archetype modelling process carried out by the SES/MG can be summed up in the steps presented in Table 3 and yielded 20 archetypes as shown in Table 4.

Tab. 3.

Detailing of the archetype modelling steps in the SES/MG

Step Description Methodology Participants Delivered artifact Effort
Patient’s Clinical Summary (PCS) element selection Defining the data elements that would compose the PCS.
  • Workshops with the SES/MG internal areas;

  • Domain analysis for each identified element.

Clinical Team; Representatives from the internal areas of the SES/MG. 122 PCS data elements identified. 60 days
Identification of the Candidate Concepts Identifying the candidate concepts that properly accommodated the data elements chosen for the Summary.
  • Discussions to identify candidate concepts;

  • Development of Mind Maps.

Clinical Team Mind Maps including all concepts identified. 60 days
Archetype Modelling Modelling the archetypes according to the hierarquical levels prescribed by the ISO 13606 norm.
  • Modelling the candidate concepts in accordance with the classes prescribed by the ISO 13606 norm;

  • Developing an MS Excel Spreadsheet based on the RM of ISO 13606.

Clinical Team MS Excel spreadsheet with the Entry modelling results. 90 days
Terminology Analysis and Identification Validating term lists, domain table, classifications and terminology to be used.
  • Analyzing applicable descriptive terminology;

  • Analyzing term lists and domain tables for data element validation;

  • Complementing the archetype modelling with the inclusion of term lists, domain tables and terminology.

Clinical Team MS Excel spreadsheet with the Entry modelling results complemented by terminologies. 45 days
Analysis and Identification of Permanence Rules Studying the permanence rules to be used.
  • Analyzing data element permanence rules in the PCS;

  • Complementing the archetype modelling with the inclusion of the summarization rules.

Clinical Team MS Excel spreadsheet with the Entry modelling results complemented by permanence rules. It was started in parallel of the Terminology Analysis – 45 days.
Existing Archetype Research Researching existing archetypes and revising modeled Entries. Accessing the openEHR/ CKM website and analyzing the published archetypes. Clinical Team Revised archetypes. It was started in parallel of the Terminology Analysis – 45 days.
ADL Archetype Encoding Archetype encoding in ADL
  • Studying the ADL language;

  • Archetype encoding with the use of the linkEHR RC2 editor.

PRODEMGE technical team Archetypes encoded in ADL 45 days

Tab. 4.

Summary of the Archetypes for the Entries Created

Archetype Entry
ISO13606-SESMG-ENTRY. Alergias.v1 Alergias
(Allergies)
ISO13606-SESMG-ENTRY. Alimentacao.v1 Alimentação
(Diet)
ISO13606-SESMG-ENTRY. AntecedentesObstetricos.v1 Antecedentes Obstétricos
(Obstetrtic History)
ISO13606-SESMG-ENTRY.AntecedentesOdontologicos.v1 Antecedentes Odontológicos
(Dental History)
ISO13606-SESMG-ENTRY. AtividadesCotidianas.v1 Atividades Cotidianas
(Daily Activities)
ISO13606-SESMG-ENTRY.AvaliacaoFuncional.v1 Avaliação Funcional
(Functional Evaluation)
ISO13606-SESMG-ENTRY. Deficiências.v1 Deficiencias
(Disabilities)
ISO13606-SESMG-ENTRY.EstratifiçãcaoRisco.v1 Estratificacao de Risco
(Risk stratification)
ISO13606-SESMG-ENTRY.ExameClinicoOdontologico.v1 Exame Clínico Odontológico
(Dental Clinical Examination)
ISO13606-SESMG-ENTRY.ExameFisico.v1 Exame Físico
(Physical Examination)
ISO13606-SESMG-ENTRY. ExamesComplementares.v1 Exames Complementares
(Complementary Exams)
ISO13606-SESMG-ENTRY.GestacaoAtual.v1 Gestação Atual (Current Gestation)
ISO13606-SESMG-ENTRY.GigantesGeriatria.v1 Gigantes da Geriatria
(Giants of Geriatry)
ISO13606-SESMG-ENTRY. ImpressaoDiagnostica.v1 Impressão Diagnóstica
(Diagnostic Impression)
ISO13606-SESMG-ENTRY. Imunobiologicos.v1 Imunobilógicos
(Immunobiologicals)
ISO13606-SESMG-ENTRY. Medicamentos.v1 Medicamentos
(Medication)
ISO13606-SESMG-ENTRY. Mobilidade.v1 Mobilidade
(Mobilty)
ISO13606-SESMG-ENTRY. Parto.v1 Parto
(Childbirth / Delivery)
ISO13606-SESMG-ENTRY.Prenatal.v1 Pré-natal
(Prenatal care)
ISO13606-SESMG-ENTRY. Puerperio.v1 Puerpério
(Puerperium)

Note: these archetypes can be found in http://sres.saude.mg.gov.br

Most of the EHR system implementation is still work in progress in the Minas Gerais EHR project, but steps have been taken towards confirming that the archetypes are suitable. Firstly, the archetypes were validated by the clinical staff of SES/MG who considered the artefacts created through the use of the editor LinkEHR well able to reflect the information requirements set out. Secondly, these archetypes (ADL codes) were also published by the government6 in order to receive comments from the health community and vendors who participated in the bidding process for EMR systems – this bidding process was in progress at the time of this the study. Through these initiatives, we could confirm that all concepts and its constraints could be correctly formalized in ADL code. The clinical team now wishes to submit these archetypes to the openEHR CKM community in order to contribute with the international archetype development process.

6. Discussion

The two-level modelling approach is adherent to the requirements for the development of health information systems, particularly within architectures that aim for a high level of interoperability. In this sense, the archetypes are important semantic artefacts to achieve semantic interoperability between EMR/EHR systems [24–28]. However, few publications to date have specifically focused on the process of archetype development [20, 22, 29]. This paper contributes to our understanding of the necessary stages in the development of good quality archetypes that are based on the information model of ISO 13606 norm, and we present here the main lessons learned during this process.

In a traditional, top-down archetype modelling process, the candidate concepts are identified and discussed, taking many possible angles of analysis into consideration. In the present case, candidate concepts were chosen through a bottom-up approach; in other words, to represent the data elements previously chosen to constitute the PCS. As a result, all data elements in the central repository were mapped to archetypes. In this way, the constraints were mapped and made available, through ADL codes, to the EMR systems.

This EHR system architecture was designed to preserve semantics in a knowledge domain, and for this reason it includes a reference model, archetypes and terminology. These semantic artefacts are important to support the interoperability process. Thus, the architecture was designed to support interoperability between the EMR systems. However, full semantic interoperability will only be achieved when the SES/MG starts to use a descriptive terminology.

7. Lessons Learned

The lessons learned during this case study can be summed up in the following points:

  • The set of principles established during the selection of elements for the PCS guided the clinical team and helped them to keep focus on their objectives, even considering the participation of several other health professionals.

  • The process assumed to develop the archetypes began considering the structural organization of the artefacts. It was very important because the clinical team could then understand how the concepts (information) should be organized.

  • Each one of the data elements identified was subjected to a rigorous analysis aimed at determining the most suitable clinical domain. It forced the clinical team to undertake research into different sources of information resulting in more robust definitions for the concepts. Thus, only 10% of the data being free text (non-structured) constituted a gain in terms of knowledge representation and interoperability.

  • Levelling the concepts to accommodate them within the hierarchical levels in the reference model was definitely no easy task. The use of a mind mapping tool facilitated the modelling process, allowing the visualization of the relationship between concepts and data elements and the existing overlaps.

  • Part of the difficulty experienced by the clinical team in mapping the data elements into the candidate concepts was related to a view focused on the original forms previously used. The archetype modelling process should be oriented towards the formation of specialized concepts to allow their reutilization in different usage contexts.

  • The grouping of Entries into Sections proved to be a high complexity task, considering the still incipient modelling archetype knowledge the team possessed. Therefore, postponing this modelling until future archetype versions should increase the chances of their correct use.

  • The use of worksheets facilitated the modelling process by health professionals. Many of them had difficulties in using archetypes editors directly because of the technical skills that were required. These worksheets were created according the ISO 13606 reference model classes and data type definitions.

  • Considering the large number of domain table and classifications from the Federal Government in Brazil, it was very important to have a health professional that knew about this subject as member in the clinical team.

8. Conclusion

The archetype modelling process used by the SES/MG to support building the regional EHR system and the lessons learned are presented here. As a result, the steps in the archetype modelling process conducted by the SES/MG were identified and a spreadsheet of the generated archetypes was produced. From these results, one can see that the archetypes (referencing terminology, domain table and term lists) provided favourable conditions for the use of a controlled vocabulary between the central repository and the EMR systems, and probably, will increase the chances of preserving the semantics from the knowledge domain. It is worth stressing the degree of adequacy of the two-level approach that laid the foundation for the construction of the proposed architecture. Finally, the reference model from the ISO 13606 norm, along with the archetypes, proved sufficient to meet the specificities for the creation of a Regional EHR system for basic healthcare in a Brazilian state.

Clinical Relevance Statement

The archetypes are important semantic artefacts to achieve semantic interoperability among EMR/EHR systems. However, few publications to date have specifically focused on the process of archetype development. This paper contributes to our understanding of the necessary stages in the development of good quality archetypes that are based on the information model of ISO 13606 norm, and we present here the main lessons learned during this process.

Conflicts of interest

The authors declare that they have no conflicts of interest in the research.

Human Subjects Protections

The authors declare that human and animal subjects were not included in this research.

Acknowledgements

We acknowledge the support of the Health Secretary of State of Minas Gerais and Prodemge for the development of this research.

Footnotes

1 The openEHR approach is a comprehensive open specifications for EHR systems originally based on the results of the European Union’s GEHR-Project in the early 1990s – http://www.openehr.org.

2 Health Department of Minas Gerais State – http://www.saude.mg.gov.br.

3 Information technology company of Minas Gerais State – http://www.prodemge.gov.br

4 In this study we consider a descriptive terminology those destined to make the implicit knowledge in a data element explicit. Term lists and domain table from SES/MG should be the ones to determine the domain of one or various data elements.

5 The LinkEHR editor was developed by Universidad Politécnica de Valência. At this time, the linkEHR was the unique editor that allows to create archetypes based on the ISO 13606 reference model. More information can be found in http://www.linkehr.com.

6 The archetypes listed at the Table IV were published on the site: http://sres.saude.mg.gov.br.

References

  • 1.Beale T, Heard S. Archetype Definition Language 1.4: rev. 1.4.1 [Internet]. London: The OpenEHR Foundation, 2007. Available from: http://www.openehr.org/svn/specification/TRUNK/publishing/ architecture/am/adl1.4.pdf. [Google Scholar]
  • 2.International Organization For Standardization ISO 13606–1:2008: Health informatics, Electronic health record communication, Part 1: Reference model, 2008 [Internet]. Available from:http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm? csnumber=40784.
  • 3.Kalra D. Electronic Health Record Standards. IMIA Yearbook of Medical Informatics, 2006(1): 136–139 [PubMed] [Google Scholar]
  • 4.Kalra D, Barriers Approaches and research priorities for semantic interoperability in support of clinical care delivery. 2007. Available from:<http://www.semantichealth.org/DELIVERABLES/SemanticHealth_D4_l_final.pdf>.
  • 5.Garde S, Knaup P, Hovenga EJS, Heard S. Towards semantic interoperability for electronic health records: domain knowledge governance for openEHR archetypes. Methods Inf Med 2007; 46(3): 332–312 [DOI] [PubMed] [Google Scholar]
  • 6.Santos MR. Sistema de registro eletronico de saúde baseado na norma ISO 13606: aplicações na Secretaria de Estado de Saúde de Minas Gerais [Thesis]. Information Sciences Departament: Universidade Federal de Minas Gerais, 2011 [Google Scholar]
  • 7.Newell A. The knowledge level [Internet]. AI Magazine. Summer1982; 18(1). [Google Scholar]
  • 8.Gruber TR. A translation approach to portable ontology specifications. Knowledge Acquisition 1993; 5(2); 199–220 [Google Scholar]
  • 9.Guarino N. Formal Ontology, Conceptual analysis and knowledge representation. Int J Human-Computer Studies 1995; 43(5); 625–640 [Google Scholar]
  • 10.Guarino N, Giaretta P.Ontologies and knowledge bases: towards a terminological clarification. : Mars, NJI. Towards very large knowledge bases: knowledge building and knowledge sharing. Amsterdam:IOS Press, 1995. p. 25–32 [Google Scholar]
  • 11.Jonhnson SB. Model formulation: generic data modelling for clinical repositories. J Am Med Inform Assoc 1996; 3: 328–339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chandrasekaran B, Josephson R, Benjamins VR. Ontology of tasks and methods. Proceedings of the International Joint conference on artificial intelligence 1999; 16: 31–43 [Google Scholar]
  • 13.Kalra D.The Synapses project. Proceedings of the Current Perspectives in healthcare computing conference; 1997; Harrogate, England: BJHC, 1997. p. 405–406 [Google Scholar]
  • 14.Grimson J, Grimson W, Berry D, Stephens G, Felton E, Kalra D, et al. A CORBA – Based integration of distributed electronic healthcare records using the Synapses approach. IEEE Trans Inf Technol Biomed 1998; 2(3): 124–115 [DOI] [PubMed] [Google Scholar]
  • 15.Bird L, Goodchild A, Tun Z.Experiences with a two-level modelling approach to electronic health records. J Res Prac Inf Tech 2003; 35(2): 121–118 [Google Scholar]
  • 16.OpenEHR [Internet] OpenEHR future-proof and flexible; 2010. Available from:<http://www.openehr.org/about/origins.html>
  • 17.Michelsen L, Pedersen SS, Tilma HB, Andersen SK. Comparing different approaches to two-level modeling [Internet]. Stud. Health Tech. Inform. Amsterdam, 2005. Available from:<http://iospress.metapress.com/index/Ihm0l4hd611u9dey.pdf> [PubMed]
  • 18.Beale T, Heard S.Archetype definitions and principles: rev. 1.0 [Internet]. London: The OpenEHR Foundation, 2007. Available from: <http://www.openehr.org/svn/specification/TRUNK/publishing/architecture/am/archetype_principles.pdf> [Google Scholar]
  • 19.Beale T, Heard S. Architecture overview: rev. 1.1 [Internet]. London:: The OpenEHR Foundation,; 2007. Available from: < http://www.openehr.org/svn/specification/TRUNK/publishing/architecture/overview.pdf>. [Google Scholar]
  • 20.Hovenga EJS, Garde S, Carr T, Hullin CM. Innovative approaches and processes for capturing expert aged care knowledge for multiple purposes. Electronic Journal of Health Informatics 2007; 2(1). [Google Scholar]
  • 21.Leslie H, Heard S.Archetypes 101. Health Informatics Society of Australia Ltd; 2006 [Google Scholar]
  • 22.Buck J, Garde S, Kohl CD, Knaup P.Towards a comprehensive electronic patient record to support an innovative individual care concept for premature infants using the openEHR approach. Int J Med Inf 2009; 78(8): 521–511 [DOI] [PubMed] [Google Scholar]
  • 23.Santos MR, Bax MP, Kalra D.Building a logical EHR architecture based on ISO 13606 standard and Semantic Web Technologies. Proceedings of the World congress on medical and health informatics 13; 2010; South África, 2010 [PubMed] [Google Scholar]
  • 24.Kalra D.Clinical foundations and information architecture for the implementation of a federated health record service [Thesis]. University College London, London, 2002 [Google Scholar]
  • 25.Moner D, Maldonado JA, Bosca D, Fernández JT, et al. Archetype-based semantic integration and standardization of clinical data. Proceedings of the EMBS IEEE Annual International Conference 2006; 28 New York: IEEE [DOI] [PubMed] [Google Scholar]
  • 26.Atalag K.Archetype based domain modelling for health information systems [Thesis]. Department of Information Systems, The Graduate School of Informatics, Middle East Technical University, 2007 [Google Scholar]
  • 27.Munoz A, Somolinos R, Pascual M, Fragua JÁ, et al. Proof-of-concept design and development of an EN13606-based electronic health care record service. J Am Med Inform Assoc 2007; 14(1): 118–129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Wollersheim D, Sari A, Rahayu W.Archetype-based electronic health records: a literature review and evaluation of their applicability to health data interoperability and access. HIM J 2009; 38(2). [DOI] [PubMed] [Google Scholar]
  • 29.Conde AM.Towards best practice in the archetype development process [Dissertation]. Trinity College of Dublin, 2010 [Google Scholar]

Articles from Applied Clinical Informatics are provided here courtesy of Thieme Medical Publishers

RESOURCES