GW-FEAST Data De-identification: Difference between revisions

From HIVE Lab
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
 
Line 2: Line 2:


== Introduction ==
== Introduction ==
All GW-FEAST datasets housed within the secure data environment are de-identified. This process varies slightly between data sources and each data source has it's own versioned de-identification tool and protocol. Our implementation follows the ''Safe Harbor'' recommended deidentification protocol with modifications based on the ''Expert Determination'' method. This de-identification workflow is illustrated below.
All GW-FEAST datasets housed within the secure data environment are de-identified. This process varies slightly between data sources; each has its own versioned de-identification tool and protocol. Our implementation follows the ''Safe Harbor'' recommended deidentification protocol with modifications based on the ''Expert Determination'' method. This de-identification workflow is illustrated below.


=== De-identification Workflow ===
=== De-identification Workflow ===

Latest revision as of 11:56, 21 March 2025

Go Back to the GW-FEAST Home Page.

Introduction

All GW-FEAST datasets housed within the secure data environment are de-identified. This process varies slightly between data sources; each has its own versioned de-identification tool and protocol. Our implementation follows the Safe Harbor recommended deidentification protocol with modifications based on the Expert Determination method. This de-identification workflow is illustrated below.

De-identification Workflow

De-identification Data Templates

Single-patient data templates for each data source are available for download on the GW-FEAST De-identified Data Templates page.

GWDC De-identification Tool

The GW Data Commons (GWDC) de-identification tool is used to de-identify all GWDC datasets prior to harmonization. This protocol was authored by Dr. Robel Kahsay of the Mazumder Research Group. More information is available on the GWDC De-identification Tool page.

NBCC De-identification Tool

The National Breast Cancer Coalition de-identification tool is used to de-identify all NBCC datasets prior to harmonization. This protocol was authored by Dr. Robel Kahsay of the Mazumder Research Group and v1.0 of the tool has been approved by a representative at NBCC. Each iteration of the tool will have written approval by NBCC before use. More information is available on the NBCC Data page.