GW Data Commons (GWDC) Data: Difference between revisions

From HIVE Lab
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
 
Line 4: Line 4:
The GW Data Commons (GWDC) data houses EMR data from GW Medical Faculty Associates (MFA) Caboodle Warehouse. It currently contains two prostate cancer datasets. Pending approval from the GW Institutional Review Board (IRB), it will house electronic medical records (EMR) data on multiple cancer types for 900,000+ MFA patients.
The GW Data Commons (GWDC) data houses EMR data from GW Medical Faculty Associates (MFA) Caboodle Warehouse. It currently contains two prostate cancer datasets. Pending approval from the GW Institutional Review Board (IRB), it will house electronic medical records (EMR) data on multiple cancer types for 900,000+ MFA patients.


== De-identification ==
== [[GW-FEAST Data De-identification|De-identification]] ==
Data have been thoroughly de-identified pursuant to the HIPAA Safe Harbor provision.
The [[GWDC De-identification Tool|GW Data Commons (GWDC) de-identification tool]] is used to de-identify all GW-FEAST datasets prior to harmonization. This protocol was authored by Dr. Robel Kahsay of the Mazumder Research Group.
 
=== GWDC De-identification Workflow ===
[[File:Nbcc deidn tool v1.0.png|frameless|743x743px]]


== Dataset Information ==
== Dataset Information ==

Latest revision as of 14:48, 20 March 2025

Go Back to the GW-FEAST Home Page.

Introduction

The GW Data Commons (GWDC) data houses EMR data from GW Medical Faculty Associates (MFA) Caboodle Warehouse. It currently contains two prostate cancer datasets. Pending approval from the GW Institutional Review Board (IRB), it will house electronic medical records (EMR) data on multiple cancer types for 900,000+ MFA patients.

De-identification

The GW Data Commons (GWDC) de-identification tool is used to de-identify all GW-FEAST datasets prior to harmonization. This protocol was authored by Dr. Robel Kahsay of the Mazumder Research Group.

Dataset Information

The GWDC currently contains two prostate cancer datasets: imaging data (diagnosis event, cancer staging, demographics, and MRI images) and EMR data for 423 patients.

Data Sample

To download a de-identified single-patient GWDC dataset, please visit GW-FEAST De-identified Data Templates.

References