Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models

Jeon, Seungho; Seo, Jeongeun; Kim, Sukyoung; Lee, Jeongmoon; Kim, Jong-Ho; Sohn, Jang Wook; Moon, Jongsub; Joo, Hyung Joon

doi:10.2196/19597

Detailed Information

Cited 1 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Proposal and Assessment of a De-Identification Strategy to Enhance Anonymity of the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) in a Public Cloud-Computing Environment: Anonymization of Medical Data Using Privacy Models

Authors: Jeon, Seungho; Seo, Jeongeun; Kim, Sukyoung; Lee, Jeongmoon; Kim, Jong-Ho; Sohn, Jang Wook; Moon, Jongsub; Joo, Hyung Joon

Issue Date: 26-11월-2020

Publisher: JMIR PUBLICATIONS, INC

Keywords: de-identification; privacy; anonymization; common data model; Observational Health Data Sciences and Informatics

Citation: JOURNAL OF MEDICAL INTERNET RESEARCH, v.22, no.11

Indexed: SCIE
SCOPUS

Journal Title: JOURNAL OF MEDICAL INTERNET RESEARCH

Volume: 22

Number: 11

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/51447

DOI: 10.2196/19597

ISSN: 1438-8871

Abstract: Background: De-identifying personal information is critical when using personal health data for secondary research. The Observational Medical Outcomes Partnership Common Data Model (CDM), defined by the nonprofit organization Observational Health Data Sciences and Informatics, has been gaining attention for its use in the analysis of patient-level clinical data obtained from various medical institutions. When analyzing such data in a public environment such as a cloud-computing system, an appropriate de-identification strategy is required to protect patient privacy. Objective: This study proposes and evaluates a de-identification strategy that is comprised of several rules along with privacy models such as k-anonymity, 1-diversity, and t-closeness. The proposed strategy was evaluated using the actual CDM database. Methods: The CDM database used in this study was constructed by the Anam Hospital of Korea University. Analysis and evaluation were performed using the ARX anonymizing framework in combination with the k-anonymity, 1-diversity, and t-closeness privacy models. Results: The CDM database, which was constructed according to the rules established by Observational Health Data Sciences and Informatics, exhibited a low risk of re-identification: The highest re-identifiable record rate (11.3%) in the dataset was exhibited by the DRUG_EXPOSURE table, with a re-identification success rate of 0.03%. However, because all tables include at least one "highest risk" value of 100%, suitable anonymizing techniques are required; moreover, the CDM database preserves the "source values" (raw data), a combination of which could increase the risk of re-identification. Therefore, this study proposes an enhanced strategy to de-identify the source values to significantly reduce not only the highest risk in the k-anonymity, 1-diversity, and t-closeness privacy models but also the overall possibility of re-identification. Conclusions: Our proposed de-identification strategy effectively enhanced the privacy of the CDM database, thereby encouraging clinical research involving multiple centers.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Medicine > Department of Medical Science > 1. Journal Articles; College of Science and Technology > Department of Electronics and Information Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Sohn, Jang Wook photo

Sohn, Jang Wook: 의과대학 (의학과)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,982,617; Today View :6,269

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE