Projects: Difference between revisions

From HIVE Lab
Jump to navigation Jump to search
No edit summary
No edit summary
 
(10 intermediate revisions by 2 users not shown)
Line 14: Line 14:
         <h3>[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main The High-performance Integrated Virtual Environment (HIVE) platform]</h3>
         <h3>[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main The High-performance Integrated Virtual Environment (HIVE) platform]</h3>
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
HIVE is a cloud-based environment optimized for the storage and analysis of extra-large data, such as biomedical data, clinical data, next-generation sequencing (NGS) data, mass spectrometry files, confocal microscopy images, post-market surveillance data, medical recall data, and many others. HIVE provides secure web access for authorized users to deposit, retrieve, annotate and compute on Big Data, and analyze the outcomes using web user interfaces. [https://docs.google.com/document/d/1F5iq00uKkJfdSsbwanvKOy-nPnwijH56mwbwa_HhzfY/edit?tab=t.0#heading=h.7dlfmngwfzih More here].
HIVE is a cloud-based environment optimized for the storage and analysis of extra-large data, such as biomedical data, clinical data, next-generation sequencing (NGS) data, mass spectrometry files, confocal microscopy images, post-market surveillance data, medical recall data, and many others. HIVE provides secure web access for authorized users to deposit, retrieve, annotate and compute on Big Data, and analyze the outcomes using web user interfaces. [https://docs.google.com/document/d/1F5iq00uKkJfdSsbwanvKOy-nPnwijH56mwbwa_HhzfY/edit?tab=t.0#heading=h.7dlfmngwfzih More here].<br><br>
[https://hivelab.biochemistry.gwu.edu/wiki/ HIVE LAB WIKI]
         </div>
         </div>
     </div>
     </div>
Line 21: Line 22:
         <h3>[https://www.biocomputeobject.org/ BioCompute Objects (BCO)]</h3>
         <h3>[https://www.biocomputeobject.org/ BioCompute Objects (BCO)]</h3>
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
The BioCompute is FDA funded project to establish a framework for community-based development of standards for harmonization of High-throughput Sequencing (HTS), standardization of data formats, promotion of interoperability, and bioinformatics verification protocols. The BioCompute Object (BCO) was developed in the High-throughput Sequencing Computational Standards for Regulatory Sciences (HTS-CSRS) initiative in the BioCompute Objects Portal (BOP), a web portal to serve as a collaborative ground to encourage a dialogue to facilitate interoperability between different bioinformatic pipelines, industries, and developers. HIVE capabilities have been leveraged to support the development of the BCO. The BCO is versatile and adaptable to other common HTS analysis platforms. [https://docs.google.com/document/d/1WQFZm_PFiQXob4NyOKq6y-2ywnbmNoFHSS27fYf3l4Y/edit?tab=t.0#heading=h.bs8eki17tykx More here].
The BioCompute is FDA funded project to establish a framework for community-based development of standards for harmonization of High-throughput Sequencing (HTS), standardization of data formats, promotion of interoperability, and bioinformatics verification protocols. The BioCompute Object (BCO) was developed in the High-throughput Sequencing Computational Standards for Regulatory Sciences (HTS-CSRS) initiative in the BioCompute Objects Portal (BOP), a web portal to serve as a collaborative ground to encourage a dialogue to facilitate interoperability between different bioinformatic pipelines, industries, and developers. HIVE capabilities have been leveraged to support the development of the BCO. The BCO is versatile and adaptable to other common HTS analysis platforms. [https://docs.google.com/document/d/1WQFZm_PFiQXob4NyOKq6y-2ywnbmNoFHSS27fYf3l4Y/edit?tab=t.0#heading=h.bs8eki17tykx More here].<br><br>
[https://wiki.biocomputeobject.org/Main_Page BIOCOMPUTE OBJECTS WIKI]
</div>
</div>
     </div>
     </div>
Line 27: Line 29:
         <h3>[https://www.glygen.org/ GlyGen]</h3>
         <h3>[https://www.glygen.org/ GlyGen]</h3>
<div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
<div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
GlyGen (gly-glycobiology; gen-information), is an advanced glycoinformatics resource developed to facilitate discovery in basic and translational glycobiology research along with enhancing the integration of multidisciplinary information from diverse resources. GlyGen includes knowledge about molecular, biophysical and functional properties of glycans, genes, and proteins organized in pathways and ontologies, plus a rapidly growing body of biological big data related to cancer mutation and expression. GlyGen adopts an innovative user-driven approach for implementing, prioritizing and knowledge disseminating tools to address the questions and needs of glycobiology community.
GlyGen (gly-glycobiology; gen-information), [https://www.glygen.org/<nowiki>] is an advanced glycoinformatics resource developed to facilitate discovery in basic and translational glycobiology research along with enhancing the integration of multidisciplinary information from diverse resources. GlyGen includes knowledge about molecular, biophysical and functional properties of glycans, genes, and proteins organized in pathways and ontologies, plus a rapidly growing body of biological big data related to cancer mutation and expression. GlyGen adopts an innovative user-driven approach for implementing, prioritizing and knowledge disseminating tools to address the questions and needs of glycobiology community. GlyGen is funded by the National Institute of General Medical Sciences under the grant # 1R24GM146616 - 01 and the  National Institutes of Health Office of Strategic Coordination - The Common Fund under the grant # 1OT2OD032092. More information about GlyGen - </nowiki>https://www.glygen.org/about/ <br><br>
[https://wiki.glygen.org/Main_Page GlyGen WIKI]
         </div>
         </div>
     </div>
     </div>
Line 34: Line 37:
         <h3>[https://hivelab.biochemistry.gwu.edu/predictmod PredictMod]</h3>
         <h3>[https://hivelab.biochemistry.gwu.edu/predictmod PredictMod]</h3>
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
PredictMod is an application designed to predict the outcome of an intervention prior to a patient initiating treatment. Our goal is to provide clinicians with a powerful decision making tool that enhances clinical understanding of patient-level data. The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition. While our primary condition of interest is Prediabetes, the tool is designed to be used for a variety of conditions, interventions, and data types.
PredictMod is an application designed to predict the outcome of an intervention prior to a patient initiating treatment. Our goal is to provide clinicians with a powerful decision making tool that enhances clinical understanding of patient-level data. The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition. While our primary condition of interest is Prediabetes, the tool is designed to be used for a variety of conditions, interventions, and data types. <br> <br>
[https://hivelab.biochemistry.gwu.edu/wiki/PredictMod PredictMod WIKI]
</div>
</div>
     </div>
     </div>
Line 40: Line 44:
         <h3>[[GW-FEAST]]</h3>
         <h3>[[GW-FEAST]]</h3>
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
The GW Federated Ecosystems for Analytics and Standardized Technologies (GW-FEAST) project is part of the ARPA-H FEAST performer team initiative that includes academic and industry partners. The goal of the ARPA-H performer teams is “to create bridges across data silos to make health data more accessible and usable”.
The GW Federated Ecosystems for Analytics and Standardized Technologies (GW-FEAST) project is part of the ARPA-H FEAST performer team initiative that includes academic and industry partners. The goal of the ARPA-H performer teams is “to create bridges across data silos to make health data more accessible and usable”. <br><br>
[https://hivelab.biochemistry.gwu.edu/wiki/GW-FEAST GW-FEAST WIKI]
</div>
</div>
     </div>
     </div>
     <div style="flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC; padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;">
     <div style="flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC; padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;">
         <h3>[https://hivelab.tst.biochemistry.gwu.edu/biomarker-partnership Biomarker Partnership]</h3>
         <h3>[https://hivelab.tst.biochemistry.gwu.edu/biomarker-partnership Biomarker Knowledgebase]</h3>
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
         <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
The Biomarker Partnership is a CFDE sponsored project to develop a knowledgebase that will organize and integrate biomarker data from different public sources. The data will be connected to contextual information to show a novel systems-level view of biomarkers. The motivation for this project is to improve the harmonization and organization of biomarker data. This will be done by mapping biomarkers from public sources to, and across, CF data elements. This mapping will bridge knowledge across multiple DCCs and biomedical disciplines.
The Biomarker Partnership is a CFDE sponsored project to develop a knowledgebase that will organize and integrate biomarker data from different public sources. The data will be connected to contextual information to show a novel systems-level view of biomarkers. The motivation for this project is to improve the harmonization and organization of biomarker data. This will be done by mapping biomarkers from public sources to, and across, CF data elements. This mapping will bridge knowledge across multiple DCCs and biomedical disciplines.<br><br>
[https://wiki.biomarkerkb.org/Main_Page BioMarkerKB WIKI]
</div>
</div>
     </div>
     </div>
Line 88: Line 94:
     </div>
     </div>


</div>
{{DISPLAYTITLE:<span style="position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);">{{FULLPAGENAME}}</span>}}
__NOTOC__
<!-- BANNER ACROSS TOP OF PAGE -->
<div id="ggw-topbanner" style="clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;">
    <div style="margin:0.4em; text-align:center;">
        <div style="font-size:160%; padding:.1em;">RESOURCES</div>
        </div>
    </div>
</div>
<div style="clear: both;"></div>
<div style="clear: both;">
        <div style="border-top: 1px solid #CCC; padding-top: 0.5em;"><div id="ggw_row3" style="display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;">
    <div style="flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC; padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;">
        <h3>[[Tool Resources]]</h3>
        <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
&nbsp;&nbsp;&nbsp;&nbsp;'''''Main article:''' [[Tool Resources]]''<br>There are a variety of bioinformatic tool resources developed by our team.
<br />
</div>
    </div>
</div>
</div>
<div style="clear: both;">
        <div style="border-top: 1px solid #CCC; padding-top: 0.5em;"><div id="ggw_row3" style="display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;">
    <div style="flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC; padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;">
        <h3>[[Dataset Resources]]</h3>
        <div style="border-top: 1px solid #CCC; padding-top: 0.5em;">
&nbsp;&nbsp;&nbsp;&nbsp;'''''Main article:''' [[Dataset Resources]]''<br>There are a variety of bioinformatic dataset resources integrated by our team.
</div>
    </div>
</div>
</div>
</div>

Latest revision as of 19:16, 17 December 2024


Current Projects

The High-performance Integrated Virtual Environment (HIVE) platform

HIVE is a cloud-based environment optimized for the storage and analysis of extra-large data, such as biomedical data, clinical data, next-generation sequencing (NGS) data, mass spectrometry files, confocal microscopy images, post-market surveillance data, medical recall data, and many others. HIVE provides secure web access for authorized users to deposit, retrieve, annotate and compute on Big Data, and analyze the outcomes using web user interfaces. More here.

HIVE LAB WIKI

BioCompute Objects (BCO)

The BioCompute is FDA funded project to establish a framework for community-based development of standards for harmonization of High-throughput Sequencing (HTS), standardization of data formats, promotion of interoperability, and bioinformatics verification protocols. The BioCompute Object (BCO) was developed in the High-throughput Sequencing Computational Standards for Regulatory Sciences (HTS-CSRS) initiative in the BioCompute Objects Portal (BOP), a web portal to serve as a collaborative ground to encourage a dialogue to facilitate interoperability between different bioinformatic pipelines, industries, and developers. HIVE capabilities have been leveraged to support the development of the BCO. The BCO is versatile and adaptable to other common HTS analysis platforms. More here.

BIOCOMPUTE OBJECTS WIKI

GlyGen

GlyGen (gly-glycobiology; gen-information), [https://www.glygen.org/] is an advanced glycoinformatics resource developed to facilitate discovery in basic and translational glycobiology research along with enhancing the integration of multidisciplinary information from diverse resources. GlyGen includes knowledge about molecular, biophysical and functional properties of glycans, genes, and proteins organized in pathways and ontologies, plus a rapidly growing body of biological big data related to cancer mutation and expression. GlyGen adopts an innovative user-driven approach for implementing, prioritizing and knowledge disseminating tools to address the questions and needs of glycobiology community. GlyGen is funded by the National Institute of General Medical Sciences under the grant # 1R24GM146616 - 01 and the National Institutes of Health Office of Strategic Coordination - The Common Fund under the grant # 1OT2OD032092. More information about GlyGen - https://www.glygen.org/about/

GlyGen WIKI

PredictMod

PredictMod is an application designed to predict the outcome of an intervention prior to a patient initiating treatment. Our goal is to provide clinicians with a powerful decision making tool that enhances clinical understanding of patient-level data. The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition. While our primary condition of interest is Prediabetes, the tool is designed to be used for a variety of conditions, interventions, and data types.

PredictMod WIKI

GW-FEAST

The GW Federated Ecosystems for Analytics and Standardized Technologies (GW-FEAST) project is part of the ARPA-H FEAST performer team initiative that includes academic and industry partners. The goal of the ARPA-H performer teams is “to create bridges across data silos to make health data more accessible and usable”.

GW-FEAST WIKI

Biomarker Knowledgebase

The Biomarker Partnership is a CFDE sponsored project to develop a knowledgebase that will organize and integrate biomarker data from different public sources. The data will be connected to contextual information to show a novel systems-level view of biomarkers. The motivation for this project is to improve the harmonization and organization of biomarker data. This will be done by mapping biomarkers from public sources to, and across, CF data elements. This mapping will bridge knowledge across multiple DCCs and biomedical disciplines.

BioMarkerKB WIKI


Past Projects

Gut Microbiome Analytic System (Microbiome)

The HIVE team received NSF funding to develop a Gut Microbiome Monitoring System (GutFeeling) as a tool which when used over time will allow users to rectify their dietary (such as consumption of probiotics and prebiotics) and other lifestyle habits and to help restore their normal microbiome. Rapid analysis of the large amount of metagenomic data, a major bottleneck, has been resolved by our group through the development of a novel algorithm and accompanying software called CensuScope. Through analysis of healthy gut microbiome data, we are actively developing a Knowledge Base (GutFeelingKB) to provide a clearer picture of not only an ideal personalized microbiome but also establish baseline characteristics for each customer. The Mazumder Lab is collaborating with the Milken School of Public Health and Kamtek Sequencing Facility to investigate the relationship between bacterial species commonly present in the digestive tract, diet, physical activity, lifestyle habits, and metabolic risk factors. More here.

HIVE-EQAPOL Project on HIVE NGS Data Processing and Analysis

For this project, our group works closely with the External Quality Assurance Program Oversight Laboratory (EQAPOL) team to conduct HIV NGS data analysis and collaborate in terms of analyzing, storing, and tracking HIV NGS Data. Reliable identification of strains is critical for developing new assays, validating assay platforms, assisting regulators to evaluate test kits, monitoring HIV drug resistance, and informing vaccine development. The HIVE tools and platform are used for virus identification, recombination analysis, and clone discovery.

OncoMX

The OncoMX mission is to create an integrated cancer mutation and expression resource for exploring cancer biomarkers. OncoMX is a collaboration between the George Washington University (GW), NASA's Jet Propulsion Laboratory (JPL), the Swiss Institute of Bioinformatics (SIB), and the University of Delaware (UD). The core knowledgebase of OncoMX is derived from BioMuta and BioXpress integrated cancer mutation and expression databases. Normal expression data from Bgee and custom text mining software augment the cancer data to improve functional interpretation of the reported variants and expression profiles. All data are wrapped into the OncoMX database and web portal, mapped to additional functional information from NCI Early Detection Research Network (EDRN) and Reactome. It is expected that the large-scale integration of cancer data and supporting information, provided by OncoMX with direct community feedback, will benefit cancer research by improving synthesis of information and may make earlier detection a reality.

Glycoproteomics Characterization Workflow and Data-Analysis Pipeline for Vaccines and Biosimilars

In this FDA funded project we are extending High-performance Integrated Virtual Environment (HIVE) capabilities through the development and integration of software tools and datasets for comparative analysis of glycoproteins. Glycomic analysis has many angles and has been extensively reviewed in recent literature. We propose to rely on the independent development of the glycomics field and incorporate these approaches in the HIVE pipeline as they mature while we develop a standardized glycoinformatics pipeline that will benefit investigators and regulators at the FDA.


RESOURCES

Tool Resources

    Main article: Tool Resources
There are a variety of bioinformatic tool resources developed by our team.

Dataset Resources

    Main article: Dataset Resources
There are a variety of bioinformatic dataset resources integrated by our team.