National Breast Cancer Coalition (NBCC) Data: Difference between revisions
Lorikrammer (talk | contribs) mNo edit summary |
Lorikrammer (talk | contribs) mNo edit summary |
||
Line 2: | Line 2: | ||
The National Breast Cancer Coalition (NBCC) data contains twelve terabytes of -omics data for 26,546 participants. This data contains a cohort of 575 breast cancer patients and 3,479 patient family members. The original data was collected through the DNA.Land project (<nowiki>PMID 29374253</nowiki>) and is currently housed on the GW-FEAST data browser. Access to this data is restricted and must be approved by NBCC on a case-by-case basis through the NBCC Data Access Request (DAR) Form. | The National Breast Cancer Coalition (NBCC) data contains twelve terabytes of -omics data for 26,546 participants. This data contains a cohort of 575 breast cancer patients and 3,479 patient family members. The original data was collected through the DNA.Land project (<nowiki>PMID 29374253</nowiki>) and is currently housed on the GW-FEAST data browser. Access to this data is restricted and must be approved by NBCC on a case-by-case basis through the NBCC Data Access Request (DAR) Form. | ||
== De-identification == | === De-identification === | ||
The NBCC data has been de-identified using a tool (NBCC De-IDN Tool v1.0) developed by Dr. Robel Kahsay from our group<sup>1</sup>, which uses the Safe Harbor approach to de-identification. This tool is specific to the NBCC data and any changes to the parameters will be captured in subsequent versions. | The NBCC data has been de-identified using a tool (NBCC De-IDN Tool v1.0) developed by Dr. Robel Kahsay from our group<sup>1</sup>, which uses the Safe Harbor approach to de-identification. This tool is specific to the NBCC data and any changes to the parameters will be captured in subsequent versions. | ||
[[File:Nbcc deidn tool v1.0.png|frameless|743x743px]] | [[File:Nbcc deidn tool v1.0.png|frameless|743x743px]] | ||
== Data Schema == | === Data Schema === | ||
== Data Sample == | === Data Sample === | ||
To download a de-identified single-patient NBCC dataset, please visit [[GW-FEAST De-identified Data Templates]]. | To download a de-identified single-patient NBCC dataset, please visit [[GW-FEAST De-identified Data Templates]]. | ||
== References == | == References == | ||
<sup>1</sup>Dr. Robel Kahsay (Mazumder Lab, Dept. of Biochemistry & Molecular Medicine). GW Review Board (IRB), FWA00005945. Subject: NCR224302, "Analysis of prostate MRI image data and its integration with biomedical data" Haji-Momenian, Shahriar (Shawn), MD, Sepulveda, Jorge, MD, PhD; Whalen, Michael, MD; Kahsay, Robel, PhD | <sup>1</sup>Dr. Robel Kahsay (Mazumder Lab, Dept. of Biochemistry & Molecular Medicine). GW Review Board (IRB), FWA00005945. Subject: NCR224302, "Analysis of prostate MRI image data and its integration with biomedical data" Haji-Momenian, Shahriar (Shawn), MD, Sepulveda, Jorge, MD, PhD; Whalen, Michael, MD; Kahsay, Robel, PhD |
Revision as of 19:33, 13 March 2025
Introduction
The National Breast Cancer Coalition (NBCC) data contains twelve terabytes of -omics data for 26,546 participants. This data contains a cohort of 575 breast cancer patients and 3,479 patient family members. The original data was collected through the DNA.Land project (PMID 29374253) and is currently housed on the GW-FEAST data browser. Access to this data is restricted and must be approved by NBCC on a case-by-case basis through the NBCC Data Access Request (DAR) Form.
De-identification
The NBCC data has been de-identified using a tool (NBCC De-IDN Tool v1.0) developed by Dr. Robel Kahsay from our group1, which uses the Safe Harbor approach to de-identification. This tool is specific to the NBCC data and any changes to the parameters will be captured in subsequent versions.
Data Schema
Data Sample
To download a de-identified single-patient NBCC dataset, please visit GW-FEAST De-identified Data Templates.
References
1Dr. Robel Kahsay (Mazumder Lab, Dept. of Biochemistry & Molecular Medicine). GW Review Board (IRB), FWA00005945. Subject: NCR224302, "Analysis of prostate MRI image data and its integration with biomedical data" Haji-Momenian, Shahriar (Shawn), MD, Sepulveda, Jorge, MD, PhD; Whalen, Michael, MD; Kahsay, Robel, PhD