GW Data Commons (GWDC) Data

From HIVE Lab
Revision as of 14:48, 20 March 2025 by Lorikrammer (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Go Back to the GW-FEAST Home Page.

Introduction

The GW Data Commons (GWDC) data houses EMR data from GW Medical Faculty Associates (MFA) Caboodle Warehouse. It currently contains two prostate cancer datasets. Pending approval from the GW Institutional Review Board (IRB), it will house electronic medical records (EMR) data on multiple cancer types for 900,000+ MFA patients.

De-identification

The GW Data Commons (GWDC) de-identification tool is used to de-identify all GW-FEAST datasets prior to harmonization. This protocol was authored by Dr. Robel Kahsay of the Mazumder Research Group.

Dataset Information

The GWDC currently contains two prostate cancer datasets: imaging data (diagnosis event, cancer staging, demographics, and MRI images) and EMR data for 423 patients.

Data Sample

To download a de-identified single-patient GWDC dataset, please visit GW-FEAST De-identified Data Templates.

References