Additional notes

From HIVE Lab
Revision as of 21:09, 9 October 2024 by Hivelabwikiadmin (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Additional Notes

All the mapping files are available in the scripts repository in the folder: `pipeline/convert_step2/mapping`

The mapping files used for converting CIVIC are:

DOID:

  • `tcga_doid_mapping.csv`

CIVIC DOID child terms were mapped to DOID parent terms using the following table (generated from the OncoMX DOID mapping table):

CIVIC Entity Disease
Acral_Lentiginous_Melanoma_(DOID_6367)
Acute_Lymphoblastic_Leukemia_(DOID_9952)
Acute_Myelioid_Leukemia_(DOID_9119)
Acute_Promyelocytic_Leukemia_(DOID_0060318)
Adrenal_Gland_Pheochromocytoma_(DOID_0050892)
Angiosarcoma_(DOID_0001816)
Basal_Cell_Carcinoma_(DOID_2513)
Biliary_Tract_Cancer_(DOID_4607)
Bladder_Carcinoma_(DOID_4007)
Bladder_Urothelial_Carcinoma_(DOID_4006)
Bone_Marrow_Cancer_(DOID_4960)
Brain_Glioma_(DOID_0060108)
Breast_Cancer_(DOID_1612)
Breast_Carcinoma_(DOID_3459)
Bronchiolo-alveolar_Adenocarcinoma_(DOID_4926)
Cancer_(DOID_162)
Cervical_Cancer_(DOID_4362)
Cervix_Carcinoma_(DOID_2893)
Childhood_Acute_Lymphocytic_Leukemia_(DOID_0080144)
Childhood_Low-grade_Glioma_(DOID_0080830)
Childhood_Pilocytic_Astrocytoma_(DOID_6812)
Cholangiocarcinoma_(DOID_4947)
Chromophobe_Renal_Cell_Carcinoma_(DOID_4471)
Chronic_Leukemia_(DOID_1036)
Chronic_Lymphocytic_Leukemia_(DOID_1040)
Chronic_Myeloid_Leukemia_(DOID_8552)
Chronic_Neutrophilic_Leukemia_(DOID_0080187)
Chuvash_Polycythemia_(DOID_0060474)
Clear_Cell_Renal_Cell_Carcinoma_(DOID_4467)
Colon_Cancer_(DOID_219)
Colon_Mucinous_Adenocarcinoma_(DOID_3029)
Colorectal_Adenocarcinoma_(DOID_0050861)
Colorectal_Cancer_(DOID_9256)
Diffuse_Midline_Glioma_H3_K27M-mutant_(DOID_0080684)
Endometrial_Adenocarcinoma_(DOID_2870)
Endometrial_Cancer_(DOID_1380)
Endometrial_Hyperplasia_(DOID_0080365)
Endometrioid_Ovary_Carcinoma_(DOID_5828)
Epithelial_Ovarian_Cancer_(DOID_2152)
Esophageal_Cancer_(DOID_5041)
Esophagus_Squamous_Cell_Carcinoma_(DOID_3748)
Estrogen-receptor_Positive_Breast_Cancer_(DOID_0060075)
Ewing_Sarcoma_Of_Bone_(DOID_3386)
Follicular_Lymphoma_(DOID_0050873)
Gastrointestinal_Neuroendocrine_Tumor_(DOID_0050626)
Gastrointestinal_Stromal_Tumor_(DOID_9253)
Glioblastoma_(DOID_3068)
Hairy_Cell_Leukemia_(DOID_285)
Head_And_Neck_Cancer_(DOID_11934)
Head_And_Neck_Squamous_Cell_Carcinoma_(DOID_5520)
Intrahepatic_Cholangiocarcinoma_(DOID_4928)
Langerhans_Cell_Sarcoma_(DOID_7146)
Laryngeal_Squamous_Cell_Carcinoma_(DOID_2876)
Leukemia_(DOID_1240)
Li-Fraumeni_Syndrome_(DOID_3012)
Lung_Adenocarcinoma_(DOID_3910)
Lung_Cancer_(DOID_1324)
Lung_Small_Cell_Carcinoma_(DOID_3908)
Lung_Small_Cell_Carcinoma_(DOID_5409)
Lymphoid_Leukemia_(DOID_1037)
Lymphoma_(DOID_0060058)
Malignant_Exocrine_Pancreas_Neoplasm_(DOID_1795)
Malignant_Mesothelioma_(DOID_1790)
Mammary_Analogue_Secretory_Carcinoma_(DOID_0080808)
Medulloblastoma_(DOID_0050902)
Melanoma_(DOID_1909)
Merkel_Cell_Carcinoma_(DOID_3965)
Mucosal_Melanoma_(DOID_0050929)
Multiple_Myeloma_(DOID_9538)
Myelodysplastic_Syndrome_(DOID_0050908)
Myeloid_And_Lymphoid_Neoplasms_With_Eosinophilia_And_Abnormalities_Of_PDGFRA_PDGFBR_And_FGFR1_(DOID_0060908)
Neuroblastoma_(DOID_769)
Oligodendroglioma_(DOID_3181)
Osteosarcoma_(DOID_3347)
Ovarian_Cancer_(DOID_2394)
Ovarian_Clear_Cell_Carcinoma_(DOID_0050934)
Ovarian_Granulosa_Cell_Tumor_(DOID_2999)
Ovarian_Serous_Carcinoma_(DOID_0050933)
Ovarian_Sex-cord_Stromal_Tumor_(DOID_0080369)
Ovary_Serous_Adenocarcinoma_(DOID_5744)
PTEN_Hamartoma_Tumor_Syndrome_(DOID_0080191)
Pancreatic_Adenocarcinoma_(DOID_4074)
Pancreatic_Cancer_(DOID_1793)
Pancreatic_Carcinoma_(DOID_4905)
Polycythemia_Vera_(DOID_8997)
Prostate_Cancer_(DOID_10283)
Rectum_Cancer_(DOID_1993)
Renal_Carcinoma_(DOID_4451)
Renal_Cell_Carcinoma_(DOID_4450)
Rhabdomyosarcoma_(DOID_3247)
Sertoli-Leydig_Cell_Tumor_(DOID_2997)
Skin_Melanoma_(DOID_8923)
Skin_Squamous_Cell_Carcinoma_(DOID_3151)
Solid_Tumor_(DOID_3260)
Spindle_Cell_Rhabdomyosarcoma_(DOID_3260)
Stomach_Cancer_(DOID_10534)
Stomach_Carcinoma_(DOID_5517)
Systemic_Mastocytosis_(DOID_349)
T-cell_Acute_Lymphoblastic_Leukemia_(DOID_5603)
Thymic_Carcinoma_(DOID_3284)
Thyroid_Gland_Anaplastic_Carcinoma_(DOID_0080522)
Thyroid_Gland_Cancer_(DOID_1781)
Thyroid_Gland_Carcinoma_(DOID_3963)
Thyroid_Gland_Hurthle_Cell_Carcinoma_(DOID_8161)
Thyroid_Gland_Medullary_Carcinoma_(DOID_3973)
Thyroid_Gland_Papillary_Carcinoma_(DOID_3969)
Transitional_Cell_Carcinoma_(DOID_2671)
Tuberous_Sclerosis_(DOID_13515)
Villous_Adenoma
Von_Hippel-Lindau_Disease_(DOID_14175)
Waldenstrom's_Macroglobulinemia_(DOID_0060901)

Uniprot Accession:

  • `human_protein_transcriptlocus.csv`

Transcript ID (starts with ENSP) was mapped to uniprot isoform accession.

Mapping was NOT performed to uniprot canonical accession as this resulted in an issue with the final dataset in which a mutation for the same canonical accession would be listed with different amino acid changes.