All submissions of the EM system will be redirected to Online Manuscript Submission System. Authors are requested to submit articles directly to Online Manuscript Submission System of respective journal.
Shinji Kobayashi1*, Naoto Kume1, Takahiro Nakahara2 and Hiroyuki Yoshihara2
1 Department of Electronic Health Record, Graduate School of Medicine, Kyoto University, Kyoto, Japan, Email: [email protected]
2 Kyushu Dental University, Fukuoka, Japan
*Correspondence: Shinji Kobayashi, Department of Electronic Health Record, Graduate School of Medicine, Kyoto University, Kyoto, Japan, Email: [email protected]

Received Date: Nov 28, 2017 / Accepted Date: Dec 30, 2017 / Published Date: Jan 10, 2018

This open-access article is distributed under the terms of the Creative Commons Attribution Non-Commercial License (CC BY-NC) (, which permits reuse, distribution and reproduction of the article, provided that the original work is properly cited and the reuse is restricted to noncommercial purposes. For commercial reuse, contact [email protected]


Objectives: We developed an electronic health records (EHR) system for regional healthcare in 2000. This EHR stores the health records of more than 6,300 patients in two regions of Japan; however, clinical updates and improved interoperability with other clinical standards, such as HL7 or others are needed. In 2015, this EHR system was upgraded to create a nationwide-scale healthcare data repository to improve the interoperability of clinical data with openEHR technology.

Methods: The clinical data in our EHR system has 16 components constructed with Medical Markup Language (MML) standards and periodic mass screening for employees and students. Therefore, we constructed mindmaps of the clinical MML and surveillance data to analyse the concept models. Based on mindmap analysis, we designed archetypes of the concepts identified using Ocean Archetype Editor. The artefacts were mainly quoted from the openEHR clinical knowledge manager (CKM). As the archetypes on CKM did not include all MML semantics, the archetypes were newly designed to complement the semantics of the MML.

Results: We developed clinical information models by archetypes that semantically equalled the EHR system. Twenty-one MML components/modules and concept models using 99 archetypes were constructed for periodic mass screening services. Most of the archetypes were quoted from CKM; however, 22 archetypes were specialised, and eight archetypes were newly designed. The reasons for specialisation were to adjust the demographics to Japanese and to extend the archetypes to the dental domain.

Conclusion: We constructed concept models with archetypes semantically equivalent to conventional data and developed new archetypes for mass screening by archetype technology. The suggested archetype technology improved the flexibility of the EHR system to cover the existing standards.


Archetype; Data modelling; Electronic health records and systems; HL7; Standards adoption; Semantic interoperability


In 2003, we developed an electronic health records (EHR) system called the “Dolphin project” for regional health providers to share clinical records and provide individual patient data to patients [1]. We also developed an XMLbased clinical information standard called Medical Markup Language (MML) to share clinical data within and among hospitals [2, 3].

We reported that this EHR system was working successfully, with more than 2,100 patients in two regions in Japan in 2010 [4], and 6,300 patients were registered in this system in 2017. Consequently, we launched a project to expand this EHR nationwide to collect clinical data from more hospitals, implement mass screening services for the nation, and reuse the data for research and public health purposes. However, the Dolphin system has three problems that limit its ability to meet the requirements of the next EHR system. First, the Dolphin system has not been sufficiently maintained or updated because it was built on a legacy web framework and was written in Perl language. Second, the Dolphin system adopted MML for the data models, but MML does not meet the clinical/technical demands for emerging clinical updates. Therefore, the Dolphin system lacks interoperability with other standards, such as Health Level Seven (HL7) [5] and others [6, 7, 8]. Interoperability is an emerging issue for the new project to utilise HL7 standardised data available from healthcare systems.

OpenEHR is the basis of the ISO 13606 [9, 10, 11, 12, 13] standard that enables interoperability of clinical concepts [14] and has been widely adopted to build other standards [15, 16]. The Clinical Information Modeling Initiative, an HL7 working group, is developing a common clinical concept model using openEHR archetype technology [17].


We decided to reconstruct the EHR system using a modern development process, with common clinical concept models based on the ISO 13606 standard with the openEHR specification, to improve flexibility and interoperability for collecting multiple types of formatted data with various standards and catch up requirements of the nationwide EHR project in Japan.


The data sources for this EHR project were hospitals that were already involved in the Dolphin project and performed periodic mass screening of employees and students. Healthcare data were collected from healthcare providers and screening services. The data were to be formatted using 16 MML content modules and others. MML data modules were analysed and constructed with semantically equivalent archetypes [18]. The mass screening service for employees adopted the Health Level Seven Clinical Document Architecture Release 2 (HL7 CDA R2), so that the surveillance data were collected by HL7 CDA R2 [19]. Because the mass screening service for students did not adopt any standards, raw data were used. Figure 1 shows all datasets analysed with regard to item, structure, components, and concepts. Ultimately 40 mindmaps were constructed.


Figure 1: Nationwide electronic health records (EHR) system. Healthcare data will be collected from hospitals, schools, and companies by HL7, MML, original format for mass screening, or others. The EHR system stores healthcare data in the ISO 13606/openEHR repository for primary/secondary use with/without anonymisation or de-identification.

Based on mindmap analysis, archetypes and templates were designed using Ocean Archetype Editor [20] and Ocean Template Designer [21]. The artefacts were mainly quoted from the openEHR clinical knowledge manager (CKM) [22] with or without modifications to maintain interoperability with regard to other standards. As the archetypes on CKM could not cover all case semantics, the archetypes were newly designed to complement the semantics used.


All datasets were constructed on 18 templates and mapped with each corresponding data source (Table 1). Archetypes were reused for multiple templates, and Table 2 shows common archetypes in representative cases.

Table 1: Number of archetypes used for each concept.

Use case Number of archetypes
Personal information 8
Creator information 7
Patient information 9
Health insurance 10
Diagnosis record 2
Lifestyle information 5
Basic clinical information 5
Initial consultation 14
Progress course 15
Surgical operation record 13
Clinical summary 19
Laboratory test 11
Report information 18
Referral letter 22
Mass screening for employee 36
Mass screening for students 50

Table 2: Frequently reused archetypes in multiple cases.

Archetypes Total Diagnostic report Progress notes Clinical summary Surgical operation record Flow sheet Referral Mass screening for employee Mass screening for students
CLUSTER.organisation-mml 26 6     5 3 6    
CLUSTER.citation 21 1 8   1   4    
EVALUATION.problem_diagnosis 15 1 1   1   1 2 9
EVALUATION.citation 13 1 4       3    
CLUSTER.person_name-mml 12 3     4   2    
CLUSTER.individual_professional-mml 10 3     2   2    
EVALUATION.clinical_synopsis 10 1 1   1 2 2    
CLUSTER.device 9         4     5
EVALUATION.health_risk 8             8  
OBSERVATION.progress_note 8   3 2     3    
INSTRUCTION.medication_order 7   4 1     2    
CLUSTER.address-japan 7     3 2   2    
OBSERVATION.exam 6   1 1       2 2
CLUSTER.telecom_details-mml 6     2 2   2    
OBSERVATION.story 5   1 1     2 1  
OBSERVATION.body_weight 4         1   2 1
INSTRUCTION.request 4 1 3            
CLUSTER.medication_preparation 4   2 1     1    
CLUSTER.daily_timing 4   2 1     1    

Figure 2 displays an example of the mindmap for a dental examination of teeth in the mass screening for students.


Figure 2: Mindmap for the examination of teeth.

Table 1 shows each case and the number of archetypes used to construct semantically equivalent data schema. All components were designed with 1–50 archetypes and were fully conformant with openEHR specifications [23]. The most complex model was the mass screening for students, which required 50 archetypes. Mass screening for students requires a dental examination (Table 1). Figure 3 provides an overview of the archetype hierarchy in for mass screening of students. The report information model was designed for use with general reporting documents, but it had to be redesigned for use with specific reports, such as imaging reports, pathological diagnosis reports, and other diagnostic reports.


Figure 3: Overview of the hierarchy of archetypes in mass screening for students.

Table 3 shows the details of archetypes used to construct logical data models. A total of 92 archetypes were needed to construct the models, and 62 archetypes were quoted from CKM without any changes. However, 22 archetypes had to be modified, and eight archetypes were created from scratch to conform to the concept models. Reasons for modifying archetypes had to do mainly with localising for demographics, insurance claims, and the inclusion of specific information for dentistry (Table 4).

Table 3: Archetype construction using the case information model.

Archetype Count of use
No change 62
Specialised 22
Newly created 8

Table 4: Example of rationales for specialised archetypes.

Concept model Rationale
Patient name To record ideographic (Kanji), syllabic (Furigana), and alphabetic names
Address To record addresses in ideographic, syllabic, and alphabetic form
Telecoms To adjust the item hierarchy to suit Japanese telecoms
Procedure To match claim information to insurance systems
Medication To record Japanese-style preparation and claims information for insurance purposes
Report To record the report date, name of requester, responder to context field
Dental examination To record specific expression for dentists

Because Health Level Seven version 2.5 (HL7 2.5) messages were also mapped to the constructed archetypes, this EHR system obtained the HL7 2.5 interface. Although the interface of the old EHR system was only MML, the newly developed EHR system used the following interfaces mapped to ISO 13606/ openEHR archetypes:

• ISO 13606/openEHR native API


• HL7 2.5

• HL7 CDA R2

• Original data format for mass screening


An EHR system requires semantic interoperability for multiple use cases, such as point of care, clinical research, and public health [24]. To achieve interoperability, EHR should have a commonly agreed upon logical model [25]. Therefore, the openEHR project has developed the CKM to share archetype concept models [26] to improve the interoperability of EHR.

The CKM is also a repository for international collaborative development of clinical concepts, and the CKM reported that 679 archetypes have been developed (cited on 19 December 2017). Archetypes on CKM are designed for general international use; however, in our case, the archetypes had to be localised mainly to express local demographics as well as the inclusion of dental examinations. As demographic expressions in CKM are widely diverse from MML, it was not possible to cover all of them. Specialising the existing archetypes was a reasonable way to localise them. Thus, although the existing HL7 and MML data models captured in the archetypes had an impedance mismatch in terms of information granularity, it was possible to devise equivalent forms with 92 archetypes; in fact, 67% (62/92) of them were adopted with no modifications. Although 1,821 users are registered with CKM, there are only 34 dentists (CKM statistics page, retrieved on 19 December 2017). Thus, there are few archetypes for dentistry information on CKM, and we had to modify the archetypes to record data from dental examinations.

Moreover, the artefacts can be modified with other archetypes to describe other cases immediately, suggesting that creating a standardised clinical concept in the openEHR project would have more potential than making an EHR system with existing data models. However, we needed more clinical concepts to meet the unique demand of local clinicians and specialists.

Because archetype concept models have fine-grained items, they are capable of covering multiple standardised formats, such as HL7 2.5 and HL7 CDA, among others. As archetypes must be interoperable, the concept models in this study could be used with other standards. In addition, the old EHR system had only an MML interface, but the new EHR system uses five interfaces with archetype technology.

We are currently implementing an EHR system with these archetype concept models and, in the near future, plan to demonstrate interoperability for secondary use.


We constructed concept models with archetypes for a nationwide EHR system, which is semantically equivalent to the legacy EHR. Mass screening of the newly developed archetypes was conducted, and 11 dental archetypes were modified based on existing archetypes. The suggested archetype technology improved flexibility for collecting health data from multiple existing standards.

Clinical Relevance Statement

The developed clinical concept models can be used to describe clinical contents. Using these models would clarify the semantics of clinical descriptions and assessments and improve machine processing for clinical data. Thus, this project should contribute to clinical data management and the aid in the discovery of novel findings from EHRs.

Conflict of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

Human Subjects Protections

The study was performed in compliance with the World Medical Association Declaration of Helsinki on Ethical Principles for Medical Research Involving Human Subjects and Japanese laws and guidelines for privacy and security.


This research was supported by the “Clinical Study Oriented ICT Infrastructure Development Project - Sustainable Massive Health and Clinical Data Repository for Secondary Use” of the Japan Agency for Medical Research and Development, AMED.