EDS-Privacy

Pseudonymisation and Epidemiological Research Reliability: a Tailored Approach Using a Clinical Data Warehouse
Completed
03/06/2024

-

31/05/2025

Main objectives

The sensitivity of health data requires special protection, and a number of technical and organizational procedures have been put in place within the AP-HP's Health Data Warehouse (HDW) to protect patients' personal data as effectively as possible, while enabling research projects to be carried out. Among these procedures, pseudonymization (removal of directly identifying data) and minimization (limiting data to what is necessary for the research) of the data studied reinforce this protection, but they are not neutral as they can impact the quality of the analyses carried out by the research teams. The aim of this project is to evaluate and optimize pseudonymization/minimization methods for different epidemiological study situations, in order to adapt them to the context of research on massive health data.

Challenge

A large amount of data is collected in hospitals and other healthcare facilities as part of the healthcare process, and offers significant potential for research, innovation and the improvement of healthcare systems. Centralizing data in health data warehouses (HDWs) represents an opportunity to support scientific research and innovation in the healthcare field, and to facilitate the management of hospital activity. The sensitivity of health data calls for special protection, and various measures have been implemented to ensure that patients' privacy is protected, while enabling a wide range of health research to be carried out.

Methodology/Technology used

The project consists of evaluating 3 algorithms for pseudonymizing structured data corresponding to the patient pathway in the context of 6 representative epidemiological studies.

Publications

Contacts

Ariel Cohen ()

Members

AP-HP, Inria

Other projects

COVIPREDS

Description

US Caractérisation et prédiction de la survenue de formes graves ou létales du COVID-19 à partir des données issues de l’EDS de l’AP-HP

Names of partners involved
AP-HP, Inria & Centrale Supélec

read more