<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://hivelab.biochemistry.gwu.edu/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Hivelabwikiadmin</id>
	<title>HIVE Lab - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://hivelab.biochemistry.gwu.edu/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Hivelabwikiadmin"/>
	<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/Special:Contributions/Hivelabwikiadmin"/>
	<updated>2026-04-17T12:41:03Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.42.1</generator>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1251</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1251"/>
		<updated>2026-04-14T16:52:31Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Spring 2026 Symposium */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Kate Warner, Urnisha Bhuiyan &lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding; however, the data contained within them remains highly valuable to the research community. Integrating these legacy datasets into modern databases or knowledgebases, such as GlyGen, presents a significant challenge because much of the associated metadata (e.g., species, tissue, disease, cell line) is recorded as free-text that does not conform to the standardized dictionaries and ontologies used by current resources.&lt;br /&gt;
&lt;br /&gt;
To address this challenge, this project will leverage large language models (LLMs) to automate the mapping of free-text metadata from legacy databases, specifically CarbBank and CFG, to standardized accessions in authoritative resources such as NCBI Taxonomy, Disease Ontology, and Cellosaurus. The LLM-based workflow will identify and normalize synonyms, abbreviations, and spelling variants (e.g., “human,” “man,” or “h. sapiens” mapped to Homo sapiens), enabling scalable and reproducible metadata harmonization that would otherwise require extensive manual curation. The LLM tasks will be performed using OpenAI resources integrated into the GlyGen curation pipeline. The project involves the development of Python scripts to read and write data, invoke the OpenAI API and compare results with manual curated data. Another aspect of the work is the development and finetunning of a prompt for ChatGPT to ensure reliable and accurate mapping is produced. &lt;br /&gt;
&lt;br /&gt;
While the mapping process will be largely automated, manual validation will be incorporated as a quality-control step to assess model performance, verify correctness, and identify edge cases requiring refinement. This hybrid approach significantly reduces curator burden while ensuring high-quality, ontology-aligned annotations.&lt;br /&gt;
&lt;br /&gt;
The goal of this effort is to migrate and modernize datasets from CarbBank and CFG, making them interoperable with GlyGen and other contemporary glycoinformatics resources through a scalable, AI-assisted curation strategy.&lt;br /&gt;
&lt;br /&gt;
For any questions, please contact Rene Ranzinger (rene@ccrc.uga.edu) or Kate Warner (k.warner1@email.gwu.edu). &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod; GlyGen&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang**&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside; Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi**&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;nowiki&amp;gt;**&amp;lt;/nowiki&amp;gt;Not directly involved in the semester curriculum; long-term volunteer.&lt;br /&gt;
&lt;br /&gt;
== Spring 2026 Symposium ==&lt;br /&gt;
The Spring symposium will be held virtually.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Date:&#039;&#039;&#039; April 15th, 2026 (Wednesday)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Time:&#039;&#039;&#039; 4 - 6 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Zoom Link:&#039;&#039;&#039; [https://gwu-edu.zoom.us/j/93790551366?pwd=C0aN4b95CUbxahO9By6pTj35D9lFIx.1&amp;amp;jst=2#success https://gwu-edu.zoom.us/j/93790551366?pwd=C0aN4b95CUbxahO9By6pTj35D9lFIx.1&amp;amp;jst=2]&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
!Time&lt;br /&gt;
!Project&lt;br /&gt;
!Presentation Title&lt;br /&gt;
!Presenter(s)&lt;br /&gt;
|-&lt;br /&gt;
|4:00-4:05 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Welcome &amp;amp; Introduction&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|-&lt;br /&gt;
|4:05-4:30 PM&lt;br /&gt;
|GlyGen&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Diya Kamalabharathy; Isaac Kim&lt;br /&gt;
|-&lt;br /&gt;
|4:30-4:45 PM&lt;br /&gt;
|ARGOS&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|-&lt;br /&gt;
|4:45-5:10 PM&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Vishal Muthusekaran; Conner Cognata&lt;br /&gt;
|-&lt;br /&gt;
|5:10-5:30 PM&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 15 + 5 mins QA - group presentation &lt;br /&gt;
|Diya Kamalabharathy; Sampurna Chakravorty; Ashley Tien&lt;br /&gt;
|-&lt;br /&gt;
|5:30-5:45PM&lt;br /&gt;
| PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|-&lt;br /&gt;
|5:45-6:00 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Remarks&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1250</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1250"/>
		<updated>2026-04-14T16:51:38Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Agenda (All times are in Eastern Standard Time) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Kate Warner, Urnisha Bhuiyan &lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding; however, the data contained within them remains highly valuable to the research community. Integrating these legacy datasets into modern databases or knowledgebases, such as GlyGen, presents a significant challenge because much of the associated metadata (e.g., species, tissue, disease, cell line) is recorded as free-text that does not conform to the standardized dictionaries and ontologies used by current resources.&lt;br /&gt;
&lt;br /&gt;
To address this challenge, this project will leverage large language models (LLMs) to automate the mapping of free-text metadata from legacy databases, specifically CarbBank and CFG, to standardized accessions in authoritative resources such as NCBI Taxonomy, Disease Ontology, and Cellosaurus. The LLM-based workflow will identify and normalize synonyms, abbreviations, and spelling variants (e.g., “human,” “man,” or “h. sapiens” mapped to Homo sapiens), enabling scalable and reproducible metadata harmonization that would otherwise require extensive manual curation. The LLM tasks will be performed using OpenAI resources integrated into the GlyGen curation pipeline. The project involves the development of Python scripts to read and write data, invoke the OpenAI API and compare results with manual curated data. Another aspect of the work is the development and finetunning of a prompt for ChatGPT to ensure reliable and accurate mapping is produced. &lt;br /&gt;
&lt;br /&gt;
While the mapping process will be largely automated, manual validation will be incorporated as a quality-control step to assess model performance, verify correctness, and identify edge cases requiring refinement. This hybrid approach significantly reduces curator burden while ensuring high-quality, ontology-aligned annotations.&lt;br /&gt;
&lt;br /&gt;
The goal of this effort is to migrate and modernize datasets from CarbBank and CFG, making them interoperable with GlyGen and other contemporary glycoinformatics resources through a scalable, AI-assisted curation strategy.&lt;br /&gt;
&lt;br /&gt;
For any questions, please contact Rene Ranzinger (rene@ccrc.uga.edu) or Kate Warner (k.warner1@email.gwu.edu). &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod; GlyGen&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang**&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside; Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi**&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;nowiki&amp;gt;**&amp;lt;/nowiki&amp;gt;Not directly involved in the semester curriculum; long-term volunteer.&lt;br /&gt;
&lt;br /&gt;
== Spring 2026 Symposium ==&lt;br /&gt;
The Spring symposium will be held virtually.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Date:&#039;&#039;&#039; April 15th, 2026 (Wednesday)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Time:&#039;&#039;&#039; 4 - 6 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Zoom Link:&#039;&#039;&#039; https://gwu-edu.zoom.us/j/93790551366?pwd=C0aN4b95CUbxahO9By6pTj35D9lFIx.1&amp;amp;jst=2#success&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
!Time&lt;br /&gt;
!Project&lt;br /&gt;
!Presentation Title&lt;br /&gt;
!Presenter(s)&lt;br /&gt;
|-&lt;br /&gt;
|4:00-4:05 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Welcome &amp;amp; Introduction&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|-&lt;br /&gt;
|4:05-4:30 PM&lt;br /&gt;
|GlyGen&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Diya Kamalabharathy; Isaac Kim&lt;br /&gt;
|-&lt;br /&gt;
|4:30-4:45 PM&lt;br /&gt;
|ARGOS&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|-&lt;br /&gt;
|4:45-5:10 PM&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Vishal Muthusekaran; Conner Cognata&lt;br /&gt;
|-&lt;br /&gt;
|5:10-5:30 PM&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 15 + 5 mins QA - group presentation &lt;br /&gt;
|Diya Kamalabharathy; Sampurna Chakravorty; Ashley Tien&lt;br /&gt;
|-&lt;br /&gt;
|5:30-5:45PM&lt;br /&gt;
| PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|-&lt;br /&gt;
|5:45-6:00 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Remarks&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1175</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1175"/>
		<updated>2026-03-10T17:24:25Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Kate Warner, Urnisha Bhuiyan &lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding; however, the data contained within them remains highly valuable to the research community. Integrating these legacy datasets into modern databases or knowledgebases, such as GlyGen, presents a significant challenge because much of the associated metadata (e.g., species, tissue, disease, cell line) is recorded as free-text that does not conform to the standardized dictionaries and ontologies used by current resources.&lt;br /&gt;
&lt;br /&gt;
To address this challenge, this project will leverage large language models (LLMs) to automate the mapping of free-text metadata from legacy databases, specifically CarbBank and CFG, to standardized accessions in authoritative resources such as NCBI Taxonomy, Disease Ontology, and Cellosaurus. The LLM-based workflow will identify and normalize synonyms, abbreviations, and spelling variants (e.g., “human,” “man,” or “h. sapiens” mapped to Homo sapiens), enabling scalable and reproducible metadata harmonization that would otherwise require extensive manual curation. The LLM tasks will be performed using OpenAI resources integrated into the GlyGen curation pipeline. The project involves the development of Python scripts to read and write data, invoke the OpenAI API and compare results with manual curated data. Another aspect of the work is the development and finetunning of a prompt for ChatGPT to ensure reliable and accurate mapping is produced. &lt;br /&gt;
&lt;br /&gt;
While the mapping process will be largely automated, manual validation will be incorporated as a quality-control step to assess model performance, verify correctness, and identify edge cases requiring refinement. This hybrid approach significantly reduces curator burden while ensuring high-quality, ontology-aligned annotations.&lt;br /&gt;
&lt;br /&gt;
The goal of this effort is to migrate and modernize datasets from CarbBank and CFG, making them interoperable with GlyGen and other contemporary glycoinformatics resources through a scalable, AI-assisted curation strategy.&lt;br /&gt;
&lt;br /&gt;
For any questions, please contact Rene Ranzinger (rene@ccrc.uga.edu) or Kate Warner (k.warner1@email.gwu.edu). &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod; GlyGen&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang**&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside; Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi**&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;nowiki&amp;gt;**&amp;lt;/nowiki&amp;gt;Not directly involved in the semester curriculum; long-term volunteer.&lt;br /&gt;
&lt;br /&gt;
== Spring 2026 Symposium ==&lt;br /&gt;
The Spring symposium will be held virtually.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Date:&#039;&#039;&#039; April 15th, 2026 (Wednesday)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Time:&#039;&#039;&#039; 4 - 6 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Zoom Link&#039;&#039;&#039; - TBA&lt;br /&gt;
&lt;br /&gt;
=== Agenda (All times are in Eastern Standard Time) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
!Time&lt;br /&gt;
!Project&lt;br /&gt;
!Presentation Title&lt;br /&gt;
!Presenter(s)&lt;br /&gt;
|-&lt;br /&gt;
|4:00-4:05 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Welcome &amp;amp; Introduction&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|-&lt;br /&gt;
|4:05-4:30 PM&lt;br /&gt;
|GlyGen&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Diya Kamalabharathy; Isaac Kim&lt;br /&gt;
|-&lt;br /&gt;
|4:30-4:45 PM&lt;br /&gt;
|ARGOS&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|-&lt;br /&gt;
|4:45-5:10 PM&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Vishal Muthusekaran; Conner Cognata&lt;br /&gt;
|-&lt;br /&gt;
|5:10-5:30 PM&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 15 + 5 mins QA - group presentation &lt;br /&gt;
|Diya Kamalabharathy; Sampurna Chakravorty; Ashley Tien&lt;br /&gt;
|-&lt;br /&gt;
|5:30-5:55PM&lt;br /&gt;
| -&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Vishal Bakshi; Miao Wang&lt;br /&gt;
|-&lt;br /&gt;
|5:55-6:00 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Remarks&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1168</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1168"/>
		<updated>2026-03-04T19:36:52Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Kate Warner, Urnisha Bhuiyan &lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding; however, the data contained within them remains highly valuable to the research community. Integrating these legacy datasets into modern databases or knowledgebases, such as GlyGen, presents a significant challenge because much of the associated metadata (e.g., species, tissue, disease, cell line) is recorded as free-text that does not conform to the standardized dictionaries and ontologies used by current resources.&lt;br /&gt;
&lt;br /&gt;
To address this challenge, this project will leverage large language models (LLMs) to automate the mapping of free-text metadata from legacy databases, specifically CarbBank and CFG, to standardized accessions in authoritative resources such as NCBI Taxonomy, Disease Ontology, and Cellosaurus. The LLM-based workflow will identify and normalize synonyms, abbreviations, and spelling variants (e.g., “human,” “man,” or “h. sapiens” mapped to Homo sapiens), enabling scalable and reproducible metadata harmonization that would otherwise require extensive manual curation. The LLM tasks will be performed using OpenAI resources integrated into the GlyGen curation pipeline. The project involves the development of Python scripts to read and write data, invoke the OpenAI API and compare results with manual curated data. Another aspect of the work is the development and finetunning of a prompt for ChatGPT to ensure reliable and accurate mapping is produced. &lt;br /&gt;
&lt;br /&gt;
While the mapping process will be largely automated, manual validation will be incorporated as a quality-control step to assess model performance, verify correctness, and identify edge cases requiring refinement. This hybrid approach significantly reduces curator burden while ensuring high-quality, ontology-aligned annotations.&lt;br /&gt;
&lt;br /&gt;
The goal of this effort is to migrate and modernize datasets from CarbBank and CFG, making them interoperable with GlyGen and other contemporary glycoinformatics resources through a scalable, AI-assisted curation strategy.&lt;br /&gt;
&lt;br /&gt;
For any questions, please contact Rene Ranzinger (rene@ccrc.uga.edu) or Kate Warner (k.warner1@email.gwu.edu). &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod; GlyGen&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang**&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside; Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi**&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;nowiki&amp;gt;**&amp;lt;/nowiki&amp;gt;Not directly involved in the semester curriculum; long-term volunteer.&lt;br /&gt;
&lt;br /&gt;
== Spring 2026 Symposium ==&lt;br /&gt;
The Spring symposium will be held virtually.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Date:&#039;&#039;&#039; April 8th, 2026 (Wednesday)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Time:&#039;&#039;&#039; 4 - 6 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Zoom Link&#039;&#039;&#039; - TBA&lt;br /&gt;
&lt;br /&gt;
=== Agenda (All times are in Eastern Standard Time) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
!Time&lt;br /&gt;
!Project&lt;br /&gt;
!Presentation Title&lt;br /&gt;
!Presenter(s)&lt;br /&gt;
|-&lt;br /&gt;
|4:00-4:05 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Welcome &amp;amp; Introduction&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|-&lt;br /&gt;
|4:05-4:30 PM&lt;br /&gt;
|GlyGen&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Diya Kamalabharathy; Isaac Kim&lt;br /&gt;
|-&lt;br /&gt;
|4:30-4:45 PM&lt;br /&gt;
|ARGOS&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|-&lt;br /&gt;
|4:45-5:10 PM&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Vishal Muthusekaran; Conner Cognata&lt;br /&gt;
|-&lt;br /&gt;
|5:10-5:30 PM&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 15 + 5 mins QA - group presentation &lt;br /&gt;
|Diya Kamalabharathy; Sampurna Chakravorty; Ashley Tien&lt;br /&gt;
|-&lt;br /&gt;
|5:30-5:55PM&lt;br /&gt;
| -&lt;br /&gt;
|&lt;br /&gt;
* 8 + 5 mins QA - presentation #1&lt;br /&gt;
* 8 + 5 mins QA - presentation #2&lt;br /&gt;
|Vishal Bakshi; Miao Wang&lt;br /&gt;
|-&lt;br /&gt;
|5:55-6:00 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Remarks&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1145</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1145"/>
		<updated>2026-02-10T19:16:59Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Kate Warner, Urnisha Bhuiyan &lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding; however, the data contained within them remains highly valuable to the research community. Integrating these legacy datasets into modern databases or knowledgebases, such as GlyGen, presents a significant challenge because much of the associated metadata (e.g., species, tissue, disease, cell line) is recorded as free-text that does not conform to the standardized dictionaries and ontologies used by current resources.&lt;br /&gt;
&lt;br /&gt;
To address this challenge, this project will leverage large language models (LLMs) to automate the mapping of free-text metadata from legacy databases, specifically CarbBank and CFG, to standardized accessions in authoritative resources such as NCBI Taxonomy, Disease Ontology, and Cellosaurus. The LLM-based workflow will identify and normalize synonyms, abbreviations, and spelling variants (e.g., “human,” “man,” or “h. sapiens” mapped to Homo sapiens), enabling scalable and reproducible metadata harmonization that would otherwise require extensive manual curation. The LLM tasks will be performed using OpenAI resources integrated into the GlyGen curation pipeline. The project involves the development of Python scripts to read and write data, invoke the OpenAI API and compare results with manual curated data. Another aspect of the work is the development and finetunning of a prompt for ChatGPT to ensure reliable and accurate mapping is produced. &lt;br /&gt;
&lt;br /&gt;
While the mapping process will be largely automated, manual validation will be incorporated as a quality-control step to assess model performance, verify correctness, and identify edge cases requiring refinement. This hybrid approach significantly reduces curator burden while ensuring high-quality, ontology-aligned annotations.&lt;br /&gt;
&lt;br /&gt;
The goal of this effort is to migrate and modernize datasets from CarbBank and CFG, making them interoperable with GlyGen and other contemporary glycoinformatics resources through a scalable, AI-assisted curation strategy.&lt;br /&gt;
&lt;br /&gt;
For any questions, please contact Rene Ranzinger (rene@ccrc.uga.edu) or Kate Warner (k.warner1@email.gwu.edu). &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1141</id>
		<title>Tradewinds Solutions Marketplace Awardable Status</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1141"/>
		<updated>2026-01-14T19:27:56Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[File:Awardable logo.png|thumb|227x227px|Mazumder Research Group at the George Washington University Designated “Awardable” Vendor for Department of Defense Chief Digital and Artificial Intelligence Office’s Tradewinds Solutions Marketplace]]&lt;br /&gt;
&#039;&#039;&#039;Washington, D.C. – July 11, 2025 –&#039;&#039;&#039; Mazumder Research Group at the George Washington University (GWU), a leading provider of scalable ML/AI technology for biomedical data analysis and intervention outcome prediction&#039;&#039;&#039;,&#039;&#039;&#039; today announced that it has achieved &#039;&#039;&#039;“Awardable” status&#039;&#039;&#039; through the Chief Digital and Artificial Intelligence Office’s (CDAO) Tradewinds Solutions Marketplace.&lt;br /&gt;
&lt;br /&gt;
The Tradewinds Solutions Marketplace is the premier offering of Tradewinds, the Department of Defense’s (DoD’s) suite of tools and services designed to accelerate the procurement and adoption of Artificial Intelligence (AI)/Machine Learning (ML), data, and analytics capabilities.&lt;br /&gt;
&lt;br /&gt;
&amp;quot;&#039;&#039;We are excited that [[GW-FEAST|Federated Ecosystems for Analytics and Standardized Technologies (FEAST)]] has achieved awardable status on the Tradewinds Marketplace&#039;&#039;,&amp;quot; said Raja Mazumder, Principal Investigator of FEAST and Professor at GWU. &amp;quot;&#039;&#039;This recognition highlights the potential of our ML/AI platform to transform how healthcare interventions are guided through predictive analytics. It also increases our visibility within the government ecosystem. Being part of the Tradewinds Marketplace also creates valuable opportunities to connect with teams across agencies, academia, and industry. We look forward to working with DoD and collaborating with other innovators in the ML and AI space&#039;&#039;.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Mazumder Group’s solutions are designed to enable secure, scalable intervention outcome modeling and the creation of federated data ecosystems for the DoD and beyond. They are used by a wide range of businesses, including academic institutions, biomedical researchers and clinicians.&lt;br /&gt;
&lt;br /&gt;
[https://cdao.appiancloud.us/suite/sites/tsm-submission-portal/record/lQBHZiMy1aRWfjCLrPMB2tlYAIndwC-PEG_POB-JBeM3S5M81ZNZcoZdN1Mdglj-3XfZ_xPqB2yw6QbyXedKBVNAtvS-o5iqAm5Ruky826PflWM0Ws/view/summary. Mazumder Group&#039;s video], &#039;&#039;&#039;AI/ML forecasting and federated data ecosystems: machine learning for improving warfighter readiness, health, and recovery&#039;&#039;&#039;, accessible only by government customers on the Tradewinds Solutions Marketplace, presents an actual use case in which the group demonstrates a cloud-based federated data ecosystem designed to predict clinical outcomes and generate interpretable machine learning models across secure, siloed datasets.&lt;br /&gt;
&lt;br /&gt;
Mazumder Group was recognized among a competitive field of applicants to the Tradewinds Solutions Marketplace whose solutions demonstrated innovation, scalability, and potential impact on DoD missions. Government customers interested in viewing the video solution can create a Tradewinds Solutions Marketplace account at [http://tradewindAI.com tradewindAI.com].&lt;br /&gt;
&lt;br /&gt;
== About the Tradewinds Solutions Marketplace ==&lt;br /&gt;
The Tradewinds Solutions Marketplace is a digital repository of post-competition, readily awardable pitch videos that address the Department of Defense’s (DoD) most significant challenges in the Artificial Intelligence/Machine Learning (AI/ML), data, and analytics space. All awardable solutions have been assessed through complex scoring rubrics and competitive procedures and are available to Government customers with a Marketplace account. Government customers can create an account at www.tradewindai.com. Tradewinds is housed in the DoD’s Chief Digital Artificial Intelligence Office.&lt;br /&gt;
&lt;br /&gt;
== About Mazumder Group ==&lt;br /&gt;
The Mazumder Research Group at The George Washington University (GWU) is involved in developing the High‑performance Integrated Virtual Environment (HIVE) which is a cloud‑based bioinformatics platform co‑created with the FDA for analyzing large omics and clinical datasets. The team also leads efforts in defining bioinformatics communication standards, and builds knowledgebases focused on glycoinformatics (GlyGen), cancer biomarkers (BiomarkerKB, OncoMX, BioMuta, BioXpress), and microbiome analysis (GutFeeling KB). The group uses knowledge graphs and advanced AI/ML methods to analyze data and uncover valuable insights from clinical records, omics research, and scientific literature. Additional information is available at [[Main Page|HIVE Lab]].&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1140</id>
		<title>Tradewinds Solutions Marketplace Awardable Status</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1140"/>
		<updated>2026-01-14T19:26:45Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[File:Awardable logo.png|thumb|227x227px|Mazumder Research Group at the George Washington University Designated “Awardable” Vendor for Department of Defense Chief Digital and Artificial Intelligence Office’s Tradewinds Solutions Marketplace]]&lt;br /&gt;
&#039;&#039;&#039;Washington, D.C. – July 11, 2025 –&#039;&#039;&#039; Mazumder Research Group at the George Washington University (GWU), a leading provider of scalable ML/AI technology for biomedical data analysis and intervention outcome prediction&#039;&#039;&#039;,&#039;&#039;&#039; today announced that it has achieved &#039;&#039;&#039;“Awardable” status&#039;&#039;&#039; through the Chief Digital and Artificial Intelligence Office’s (CDAO) Tradewinds Solutions Marketplace.&lt;br /&gt;
&lt;br /&gt;
The Tradewinds Solutions Marketplace is the premier offering of Tradewinds, the Department of Defense’s (DoD’s) suite of tools and services designed to accelerate the procurement and adoption of Artificial Intelligence (AI)/Machine Learning (ML), data, and analytics capabilities.&lt;br /&gt;
&lt;br /&gt;
&amp;quot;&#039;&#039;We are excited that Federated Ecosystems for Analytics and Standardized Technologies (FEAST) has achieved awardable status on the Tradewinds Marketplace&#039;&#039;,&amp;quot; said Raja Mazumder, Principal Investigator of FEAST and Professor at GWU. &amp;quot;&#039;&#039;This recognition highlights the potential of our ML/AI platform to transform how healthcare interventions are guided through predictive analytics. It also increases our visibility within the government ecosystem. Being part of the Tradewinds Marketplace also creates valuable opportunities to connect with teams across agencies, academia, and industry. We look forward to working with DoD and collaborating with other innovators in the ML and AI space&#039;&#039;.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Mazumder Group’s solutions are designed to enable secure, scalable intervention outcome modeling and the creation of federated data ecosystems for the DoD and beyond. They are used by a wide range of businesses, including academic institutions, biomedical researchers and clinicians.&lt;br /&gt;
&lt;br /&gt;
[https://cdao.appiancloud.us/suite/sites/tsm-submission-portal/record/lQBHZiMy1aRWfjCLrPMB2tlYAIndwC-PEG_POB-JBeM3S5M81ZNZcoZdN1Mdglj-3XfZ_xPqB2yw6QbyXedKBVNAtvS-o5iqAm5Ruky826PflWM0Ws/view/summary. Mazumder Group&#039;s video], &#039;&#039;&#039;AI/ML forecasting and federated data ecosystems: machine learning for improving warfighter readiness, health, and recovery&#039;&#039;&#039;, accessible only by government customers on the Tradewinds Solutions Marketplace, presents an actual use case in which the group demonstrates a cloud-based federated data ecosystem designed to predict clinical outcomes and generate interpretable machine learning models across secure, siloed datasets.&lt;br /&gt;
&lt;br /&gt;
Mazumder Group was recognized among a competitive field of applicants to the Tradewinds Solutions Marketplace whose solutions demonstrated innovation, scalability, and potential impact on DoD missions. Government customers interested in viewing the video solution can create a Tradewinds Solutions Marketplace account at [http://tradewindAI.com tradewindAI.com].&lt;br /&gt;
&lt;br /&gt;
== About the Tradewinds Solutions Marketplace ==&lt;br /&gt;
The Tradewinds Solutions Marketplace is a digital repository of post-competition, readily awardable pitch videos that address the Department of Defense’s (DoD) most significant challenges in the Artificial Intelligence/Machine Learning (AI/ML), data, and analytics space. All awardable solutions have been assessed through complex scoring rubrics and competitive procedures and are available to Government customers with a Marketplace account. Government customers can create an account at www.tradewindai.com. Tradewinds is housed in the DoD’s Chief Digital Artificial Intelligence Office.&lt;br /&gt;
&lt;br /&gt;
== About Mazumder Group ==&lt;br /&gt;
The Mazumder Research Group at The George Washington University (GWU) is involved in developing the High‑performance Integrated Virtual Environment (HIVE) which is a cloud‑based bioinformatics platform co‑created with the FDA for analyzing large omics and clinical datasets. The team also leads efforts in defining bioinformatics communication standards, and builds knowledgebases focused on glycoinformatics (GlyGen), cancer biomarkers (BiomarkerKB, OncoMX, BioMuta, BioXpress), and microbiome analysis (GutFeeling KB). The group uses knowledge graphs and advanced AI/ML methods to analyze data and uncover valuable insights from clinical records, omics research, and scientific literature. Additional information is available at [[Main Page|HIVE Lab]].&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1139</id>
		<title>Tradewinds Solutions Marketplace Awardable Status</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1139"/>
		<updated>2026-01-14T19:21:53Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[File:Awardable logo.png|thumb|227x227px|Mazumder Research Group at the George Washington University Designated “Awardable” Vendor for Department of Defense Chief Digital and Artificial Intelligence Office’s Tradewinds Solutions Marketplace]]&lt;br /&gt;
Washington, D.C. – July 11, 2025 – &#039;&#039;&#039;Mazumder Research Group at the George Washington University (GWU)&#039;&#039;&#039;, a &#039;&#039;&#039;leading provider&#039;&#039;&#039; of &#039;&#039;&#039;scalable ML/AI technology for biomedical data analysis and intervention outcome prediction,&#039;&#039;&#039; today announced that it has achieved “Awardable” status through the Chief Digital and Artificial Intelligence Office’s (CDAO) Tradewinds Solutions Marketplace.&lt;br /&gt;
&lt;br /&gt;
The Tradewinds Solutions Marketplace is the premier offering of Tradewinds, the Department of Defense’s (DoD’s) suite of tools and services designed to accelerate the procurement and adoption of Artificial Intelligence (AI)/Machine Learning (ML), data, and analytics capabilities.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&amp;quot;We are excited that Federated Ecosystems for Analytics and Standardized Technologies (FEAST) has achieved awardable status on the Tradewinds Marketplace,&amp;quot; said Raja Mazumder, Principal Investigator of FEAST and Professor at GWU. &amp;quot;This recognition highlights the potential of our ML/AI platform to transform how healthcare interventions are guided through predictive analytics. It also increases our visibility within the government ecosystem. Being part of the Tradewinds Marketplace also creates valuable opportunities to connect with teams across agencies, academia, and industry. We look forward to working with DoD and collaborating with other innovators in the ML and AI space.&amp;quot;&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Mazumder Group’s&#039;&#039;&#039; solutions are designed to &#039;&#039;&#039;enable secure, scalable intervention outcome modeling and the creation of federated data ecosystems for the DoD and beyond&#039;&#039;&#039;. They are used by a wide range of businesses, including academic institutions, biomedical researchers and clinicians.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Mazumder Group&#039;s video&#039;&#039;&#039;, &#039;&#039;&#039;AI/ML forecasting and federated data ecosystems: machine learning for improving warfighter readiness, health, and recovery&#039;&#039;&#039;, accessible only by government customers on the Tradewinds Solutions Marketplace, presents an actual use case in which the group &#039;&#039;&#039;demonstrates a cloud-based federated data ecosystem designed to predict clinical outcomes and generate interpretable machine learning models across secure, siloed datasets&#039;&#039;&#039;.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Mazumder Group&#039;&#039;&#039; was recognized among a competitive field of applicants to the Tradewinds Solutions Marketplace whose solutions demonstrated innovation, scalability, and potential impact on DoD missions. Government customers interested in viewing the video solution can create a Tradewinds Solutions Marketplace account at tradewindAI.com.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=File:Awardable_logo.png&amp;diff=1138</id>
		<title>File:Awardable logo.png</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=File:Awardable_logo.png&amp;diff=1138"/>
		<updated>2026-01-14T19:18:12Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Awardable logo&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1137</id>
		<title>Tradewinds Solutions Marketplace Awardable Status</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Tradewinds_Solutions_Marketplace_Awardable_Status&amp;diff=1137"/>
		<updated>2026-01-14T19:15:23Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: Created blank page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1136</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1136"/>
		<updated>2026-01-14T19:11:55Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Yashitha Pobbareddy&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS; GlyGen biocuration; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1135</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1135"/>
		<updated>2026-01-14T19:10:52Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isaac Kim&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod; GlyGen biocuration; ARGOS&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1133</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1133"/>
		<updated>2026-01-13T16:37:58Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|-&lt;br /&gt;
|Venya Gulati&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside&lt;br /&gt;
|ARGOS; PredictMod; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1132</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1132"/>
		<updated>2026-01-13T16:36:35Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/conner-cognata/ Conner Cognata]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB; PredictMod; GlyGen biocuration&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1129</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1129"/>
		<updated>2026-01-12T14:04:41Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 11:00 AM to 12:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning (ML) Modeling Project ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Volunteers will conduct ML modeling using publicly-available -omics datasets that were previously identified (see [[Recommended Publications for Intervention Outcome Prediction Models|https://hivelab.biochemistry.gwu.edu/wiki/Recommended_Publications_for_Intervention_Outcome_Prediction_Models]]). This volunteership will involve data harmonization, model training, and pipeline documentation.&lt;br /&gt;
&lt;br /&gt;
Tasks associated with this project include:&lt;br /&gt;
&lt;br /&gt;
# Exploring and understanding the data found in relevant PMIDs that can be used to train intervention outcome prediction models.&lt;br /&gt;
# Preparing the data for model training and model performance evaluation&lt;br /&gt;
# Testing the modeling tutorial, PredictMod platform, and associated project tools&lt;br /&gt;
# Documentation of the ML pipeline and testing results&lt;br /&gt;
Deliverables for this project include:&lt;br /&gt;
&lt;br /&gt;
# ML-ready datasets&lt;br /&gt;
# Trained model scripts&lt;br /&gt;
# Pipeline documentation captured in BioCompute Objects (BCOs) and testing reports&lt;br /&gt;
# Volunteership documentation (final report or weekly progress reports)&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and a final presentation of your work.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Maria Kim; Cyrus Yeung; Jeet Vora&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1111</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1111"/>
		<updated>2026-01-05T15:11:56Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* 1. BiomarkerKB Biocuration Project Ideas */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
::: The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and weekly 1-2 paragraph reports.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1110</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1110"/>
		<updated>2026-01-05T15:09:53Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* 1. BiomarkerKB Biocuration Project Ideas */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease or for a treatment&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on NLP/LLM methods.&lt;br /&gt;
# Continue working on LLM methods started by volunteers in Fall 2025.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and weekly 1-2 paragraph reports.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer; Urnisha Bhuiyan; Rene Ranzinger&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1107</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1107"/>
		<updated>2025-12-09T15:31:33Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Maria Kim, Cyrus Yeung, Jeet Vora&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and weekly 1-2 paragraph reports.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1106</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1106"/>
		<updated>2025-12-05T19:21:56Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and weekly 1-2 paragraph reports.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to the top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1105</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1105"/>
		<updated>2025-12-05T19:15:51Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# 30-minute Zoom meetings (during regular work hours) once a week or every other week with the assigned project point of contact (POC).&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen. &amp;lt;u&amp;gt;We are also looking for individuals who have previously worked with us to take on a coordinator role&amp;lt;/u&amp;gt;.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu. Please note that this project requires attendance at biweekly meetings and weekly 1-2 paragraph reports.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note:&#039;&#039; For anyone interested in ARGOS, you may be assigned to another project of your choice. This project is contingent on a contract extension. Please complete your project selection in order of preference.&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
Qualifications: basic/medium programming skills, knowledgeable of basic bioinformatics platforms and skills.&lt;br /&gt;
&lt;br /&gt;
# Curate and report on currently circulating pathogens to upload to ARGOS&lt;br /&gt;
## The student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
# Report Results&lt;br /&gt;
## Defend your pathogens you have selected to be added to the database. Explain their importance and what value they would hold to the scientific community if they were added.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Spring.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program. Additional recognition will be given to top three volunteers with exceptional presentations at the end of the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Bakshi&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Sampurna Chakravorty&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer&lt;br /&gt;
|PredictMod; ARGOS; BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1093</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1093"/>
		<updated>2025-11-25T18:09:07Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Agenda (All times are in Eastern Standard Time) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/farah-kamila/ Farah Kamila]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Namrata Oruganti&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
== Fall Symposium ==&lt;br /&gt;
The Fall symposium will be held virtually.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Date:&#039;&#039;&#039; Nov 26th, 2025 (Wednesday)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Time:&#039;&#039;&#039; 3 - 5 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Zoom Link&#039;&#039;&#039; - https://gwu-edu.zoom.us/j/96518488501?jst=2&lt;br /&gt;
&lt;br /&gt;
=== Agenda (All times are in Eastern Standard Time) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Time&lt;br /&gt;
!Project&lt;br /&gt;
!Presentation Title&lt;br /&gt;
!Presenter(s)&lt;br /&gt;
|-&lt;br /&gt;
|3 - 3:10 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Welcome &amp;amp; Introduction&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|-&lt;br /&gt;
|3:10 - 3:35 PM&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC (Tianyi &amp;amp; Lori) intro&lt;br /&gt;
* 15 mins - PredictMod: PMID Curation for Intervention Outcome Prediction Models (IOPMs)&lt;br /&gt;
* 5 min QA&lt;br /&gt;
|Diya Kamalabharathy; Anika Sikka; Ashley Tien; Farah Kamila&lt;br /&gt;
|-&lt;br /&gt;
|3:35 - 4:00 PM&lt;br /&gt;
|GlyGen&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC intro&lt;br /&gt;
* 15 mins - Curation of species metadata using LLM &amp;amp; Visualizing glycomics databases and their features&lt;br /&gt;
* 5 min QA&lt;br /&gt;
|Diya Kamalabharathy; Harivinay P. Gujjula&lt;br /&gt;
|-&lt;br /&gt;
|4:00 - 4:25 PM&lt;br /&gt;
|Argos&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC intro&lt;br /&gt;
* 15 mins -Curation of Pathogens and QC Analysis for the Argos Project QC analysis, representative genome selection Curation of genomes 1 &amp;amp; 2&lt;br /&gt;
* 5 mins QA&lt;br /&gt;
|Miao Wang; Arhamur Rauf&lt;br /&gt;
|-&lt;br /&gt;
|4:25 - 4:50 PM&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC (Daniall and Maria) intro &lt;br /&gt;
* 15 mins - Leveraging Large Language Models to collect Biomarker data from PubMed Abstracts&lt;br /&gt;
* 5 mins QA&lt;br /&gt;
|Namrata Oruganti; Vishal Muthusekaran; Sparsh Gupta&lt;br /&gt;
|-&lt;br /&gt;
|4: 50 - 5 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Remarks&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1087</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1087"/>
		<updated>2025-11-13T17:56:03Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/farah-kamila/ Farah Kamila]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Namrata Oruganti&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
== Fall Symposium ==&lt;br /&gt;
The Fall symposium will be held virtually.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Date:&#039;&#039;&#039; Nov 26th, 2025 (Wednesday)&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Time:&#039;&#039;&#039; 3 - 5 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Zoom Link&#039;&#039;&#039; - https://gwu-edu.zoom.us/j/96518488501?jst=2&lt;br /&gt;
&lt;br /&gt;
=== Agenda (All times are in Eastern Standard Time) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Time&lt;br /&gt;
!Project&lt;br /&gt;
!Presentation Title&lt;br /&gt;
!Presenter(s)&lt;br /&gt;
|-&lt;br /&gt;
|3 - 3:10 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Welcome &amp;amp; Introduction&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|-&lt;br /&gt;
|3:10 - 3:35 PM&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC (Tianyi &amp;amp; Lori) intro&lt;br /&gt;
* 15 mins - PredictMod: PubMed Curation for Training an LLM for Recommendation&lt;br /&gt;
* 5 min QA&lt;br /&gt;
|Diya Kamalabharathy; Anika Sikka; Ashley Tien; Farah Kamila&lt;br /&gt;
|-&lt;br /&gt;
|3:35 - 4:00 PM&lt;br /&gt;
|GlyGen&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC intro&lt;br /&gt;
* 15 mins - Curation of species metadata using LLM &amp;amp; Visualizing glycomics databases and their features&lt;br /&gt;
* 5 min QA&lt;br /&gt;
|Diya Kamalabharathy; Harivinay P. Gujjula&lt;br /&gt;
|-&lt;br /&gt;
|4:00 - 4:25 PM&lt;br /&gt;
|Argos&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC intro&lt;br /&gt;
* 15 mins -Curation of Pathogens and QC Analysis for the Argos Project QC analysis, representative genome selection Curation of genomes 1 &amp;amp; 2&lt;br /&gt;
* 5 mins QA&lt;br /&gt;
|Miao Wang; Arhamur Rauf; Linford&lt;br /&gt;
|-&lt;br /&gt;
|4:25 - 4:50 PM&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
* 5 min POC (Daniall and Maria) intro &lt;br /&gt;
* 15 mins - Leveraging Large Language Models to collect Biomarker data from PubMed Abstracts&lt;br /&gt;
* 5 mins QA&lt;br /&gt;
|Namrata Oruganti; Vishal Muthusekaran&lt;br /&gt;
|-&lt;br /&gt;
|4: 50 - 5 PM&lt;br /&gt;
| colspan=&amp;quot;2&amp;quot; |Remarks&lt;br /&gt;
|Raja Mazumder&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1084</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1084"/>
		<updated>2025-11-11T19:54:28Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== 2026 Spring Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 9, 2026, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
January 12, 2026 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: January, 2026 –  April, 2026&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[[Volunteership Fall 2025|Fall 2025 Volunteership]] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;br /&gt;
&lt;br /&gt;
== Spring Symposium (TBD) ==&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1083</id>
		<title>Volunteership Spring 2026</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Spring_2026&amp;diff=1083"/>
		<updated>2025-11-11T19:47:12Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: Created page with &amp;quot;vdv&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;vdv&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1080</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1080"/>
		<updated>2025-11-03T19:04:45Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/isil-erbasol-serbes/ Isil Erbasol Serbes]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/farah-kamila/ Farah Kamila]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Namrata Oruganti&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1079</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1079"/>
		<updated>2025-10-31T19:58:26Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/farah-kamila/ Farah Kamila]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Namrata Oruganti&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1078</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=1078"/>
		<updated>2025-10-31T13:17:19Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/farah-kamila/ Farah Kamila]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Robert Ziebich&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ashley-tien/ Ashley Tien]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Namrata Oruganti&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=984</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=984"/>
		<updated>2025-09-05T14:10:11Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy*]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Farah Kamila&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Robert Ziebich&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Ashley Tien&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Namrata Oruganti&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=981</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=981"/>
		<updated>2025-09-02T17:34:07Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy*&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Farah Kamila&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Robert Ziebich&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Ashley Tien&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS, PredictMod&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_2025&amp;diff=980</id>
		<title>Volunteership 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_2025&amp;diff=980"/>
		<updated>2025-08-29T15:34:23Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;For Fall opportunities, [[Volunteership Fall 2025|click here to view our Fall 2025 Volunteership Program]].&amp;lt;h2&amp;gt;2025 Volunteer Program Details&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h3&amp;gt;Dates&amp;lt;/h3&amp;gt;&lt;br /&gt;
&amp;lt;strong&amp;gt;Volunteer Zoom Kick-Off Meeting&amp;lt;/strong&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
May 27, 2025 | 3:30 to 4:30 PM&lt;br /&gt;
&lt;br /&gt;
&amp;lt;strong&amp;gt;Program Dates: June 2nd, 2025 – July 25th, 2025&amp;lt;/strong&amp;gt; (8 weeks)&amp;lt;br&amp;gt;&lt;br /&gt;
Monday to Friday | Remote | No breaks&lt;br /&gt;
&lt;br /&gt;
&amp;lt;hr&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h3&amp;gt;Volunteer Expectations&amp;lt;/h3&amp;gt;&lt;br /&gt;
&amp;lt;ol&amp;gt;&lt;br /&gt;
  &amp;lt;li&amp;gt;Daily progress updates via Slack (scrum).&amp;lt;/li&amp;gt;&lt;br /&gt;
  &amp;lt;li&amp;gt;Regular Zoom meetings with the assigned project point of contact.&amp;lt;/li&amp;gt;&amp;lt;li&amp;gt;Expected to dedicate 5–6 hours per day to project work, with the remaining time focused on skill development or reading. &amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ol&amp;gt;&lt;br /&gt;
&amp;lt;p style=&amp;quot;color: red;&amp;quot;&amp;gt;&amp;lt;strong&amp;gt;Important:&amp;lt;/strong&amp;gt; If the scrum is not updated for 2 consecutive days, the candidate will be &amp;lt;u&amp;gt;automatically dropped&amp;lt;/u&amp;gt; from the program.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;hr&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h3&amp;gt;Potential Projects&amp;lt;/h3&amp;gt;&lt;br /&gt;
&amp;lt;ol&amp;gt;&lt;br /&gt;
  &amp;lt;li&amp;gt;BiomarkerKB ([https://biomarkerkb.org biomarkerkb.org]) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&amp;lt;/li&amp;gt;&lt;br /&gt;
  &amp;lt;li&amp;gt;GlyGen ([https://glygen.org glygen.org]) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information. &amp;lt;/li&amp;gt;&amp;lt;li&amp;gt;ARGOS ([https://argosdb.org argosdb.org]) project: Analyze genomics data using HIVE to identify reference genome assemblies. &amp;lt;/li&amp;gt;&amp;lt;li&amp;gt;PredictMod ([https://hivelab.biochemistry.gwu.edu/predictmod hivelab.biochemistry.gwu.edu/predictmod]) project. Identifying datasets and harmonizing them so that they can be used to generate ML models.  &amp;lt;/li&amp;gt;&amp;lt;/ol&amp;gt;&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&amp;lt;hr&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h4&amp;gt;1. BiomarkerKB Biocuration Project Ideas&amp;lt;/h4&amp;gt;POC: Daniall Masood, Maria Kim&lt;br /&gt;
# Curate biomarkers for a specific disease (Alzheimers)&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer&lt;br /&gt;
&lt;br /&gt;
Data Identification &amp;amp; Curation: &lt;br /&gt;
&lt;br /&gt;
# Identify publicly-available datasets from scientific literature that can be used for intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
&lt;br /&gt;
Modeling &amp;amp; Integration (for those with experience in programming/ML)&lt;br /&gt;
&lt;br /&gt;
# Conduct data harmonization and pre-processing following established project pipelines to make ML-ready dataset and data dictionary.&lt;br /&gt;
# Perform model training and document ML pipeline in a BioCompute Object (BCO).&lt;br /&gt;
# Integrate model into PredictMod platform.&lt;br /&gt;
&lt;br /&gt;
Individuals with a background or interest in machine learning should reach out to lorikrammer@gwu.edu with a potential dataset to determine if it is a feasible project for the summer.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail. ~1 week&#039;s worth of work&lt;br /&gt;
## Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found. ~4-10 weeks worth of work&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&amp;lt;hr&amp;gt;&lt;br /&gt;
&amp;lt;h3&amp;gt;Requirements for Completion&amp;lt;/h3&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&amp;lt;strong&amp;gt;Note:&amp;lt;/strong&amp;gt; The following are &amp;lt;u&amp;gt;mandatory&amp;lt;/u&amp;gt;. Failure to complete any will result in an incomplete volunteer record.&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h4&amp;gt;Documentation&amp;lt;/h4&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h4&amp;gt;Written Report&amp;lt;/h4&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h4&amp;gt;Presentation &amp;amp; Slide Submission&amp;lt;/h4&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;Present your work last week of the 8-week period.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;Slides must be submitted to the Admin Team and should include:&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
  &amp;lt;li&amp;gt;See Symposium Slides Guidelines below&amp;lt;/li&amp;gt;&amp;lt;/ul&amp;gt;&lt;br /&gt;
Contact the Admin Team to access previously submitted slides.&lt;br /&gt;
&amp;lt;hr&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
&amp;lt;hr&amp;gt;&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
&amp;lt;hr&amp;gt;&lt;br /&gt;
=== Volunteers ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
|-&lt;br /&gt;
! Name&lt;br /&gt;
!Project&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
| [https://www.linkedin.com/in/gracesjchong/ Grace Chong]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
# PredictMod&lt;br /&gt;
# BiomarkerKB Biocuration&lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/alma-ogunsina-4959072b1/ Alma Ogunsina]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB&lt;br /&gt;
# ARGOS&lt;br /&gt;
# PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/diya-kamalabharathy-62557935a/ Diya Kamalabharathy]&lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB Biocuration&lt;br /&gt;
# PredictMod Machine Learning&lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/harivinay-prasad-reddy-gujjula-a06ba71bb/ Harivinay P. Gujjula]&lt;br /&gt;
|GlyGen curation&lt;br /&gt;
|&lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
# BioMarkerKB Biocuration&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/miao-wang-88b602290/Miao&amp;amp;#x20;Wang Miao Wang]&lt;br /&gt;
|ARGOS&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB Biocuration Project Ideas&lt;br /&gt;
# FDA-ARGOS Computation and Pathogen Curation Project&lt;br /&gt;
# PredictMod Machine Learning Project Ideas&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/nahom-gebreselassie-1545ab336/ Nahom Abel]&lt;br /&gt;
|GlyGen curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB Biocuration&lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
# PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/kajal-patel-cs/ Kajal Sanjaykumar Patel]&lt;br /&gt;
|GlyGen and PubMed project&lt;br /&gt;
|&lt;br /&gt;
#PredictMod&lt;br /&gt;
#BiomarkerKB&lt;br /&gt;
#GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/john-mccaffrey-b8850930a/ John McCaffrey]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# PredictMod&lt;br /&gt;
# BiomarkerKB&lt;br /&gt;
# GlyGen Biocuration &lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/nathan-ressom/ Nathan Ressom]&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|&lt;br /&gt;
# PredictMod &lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
# BiomarkerKB Biocuration&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/aaron-ressom/ Aaron Ressom] &lt;br /&gt;
|PredictMod&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB &lt;br /&gt;
# PredictMod &lt;br /&gt;
# GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/akale-kinfe/ Akale Kinfe]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB Biocuration&lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
# ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/aise-arpinar-a8bb9b373/?original_referer= Aise Arpinar]&lt;br /&gt;
|GlyGen curation&lt;br /&gt;
|&lt;br /&gt;
# GlyGen Biocuration&lt;br /&gt;
# BiomarkerKB Biocuration&lt;br /&gt;
# GlyGen Publication Analysis&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/piyush-pandey-906b582b5/ Piyush Pandey]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB Biocuration &lt;br /&gt;
# PredictMod &lt;br /&gt;
# GlyGen Biocuration &lt;br /&gt;
|-&lt;br /&gt;
|[http://www.linkedin.com/in/filmawit-zeru-203272363 Filmawit Zeru]&lt;br /&gt;
|GlycoSiteMiner project&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB&lt;br /&gt;
# GlyGen&lt;br /&gt;
# ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/mathias-belay-03b51a2a3/ Mathias Belay]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# GlyGen&lt;br /&gt;
# PredictMod&lt;br /&gt;
# BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/isaac-kim-b644bb231/ Isaac Kim]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB&lt;br /&gt;
# PredictMod&lt;br /&gt;
# GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/sohana-bahl-6549a2376/ Sohana Bahl]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.linkedin.com/in/ana-vohralikova-794a4433a?utm_source=share&amp;amp;utm_campaign=share_via&amp;amp;utm_content=profile&amp;amp;utm_medium=ios_app Ana Vohralikova]&lt;br /&gt;
|Biomarker curation&lt;br /&gt;
|&lt;br /&gt;
# BiomarkerKB Biocuration Project&lt;br /&gt;
# GlyGen Biocuration Project&lt;br /&gt;
# FDA-ARGOS Computation and Pathogen&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;hr&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Symposium Slide Guidelines ===&lt;br /&gt;
&#039;&#039;&#039;Content Clarity&#039;&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
   •       &#039;&#039;&#039;Keep It Simple:&#039;&#039;&#039; Use concise bullet points instead of long paragraphs. Aim for no more than 6 bullet points per slide. &lt;br /&gt;
&lt;br /&gt;
  •        &#039;&#039;&#039;Focus on Key Points:&#039;&#039;&#039; Highlight the main ideas or data you want your audience to remember. &lt;br /&gt;
&lt;br /&gt;
  •        &#039;&#039;&#039;Consistent Layout:&#039;&#039;&#039; Use a consistent layout for each slide, including fonts, colors, and background. This helps maintain a professional look. &lt;br /&gt;
&lt;br /&gt;
  •        &#039;&#039;&#039;High-Quality Images:&#039;&#039;&#039; Use high-resolution images and graphics to illustrate your points. Avoid using clip art. &lt;br /&gt;
&lt;br /&gt;
  •        &#039;&#039;&#039;Readable Fonts:&#039;&#039;&#039; Use easy-to-read fonts (e.g., Arial, Calibri) and ensure font sizes are large enough to be seen from a distance (24 pt or larger for main text). &lt;br /&gt;
&lt;br /&gt;
  •        &#039;&#039;&#039;Contrast:&#039;&#039;&#039; Ensure there is high contrast between text and background (e.g., dark text on a light background). &lt;br /&gt;
&lt;br /&gt;
  •        &#039;&#039;&#039;Citation:&#039;&#039;&#039; Cite a publication to support the information presented in proper citation format. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Outline for Symposium presentation&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
1.       Introduction: &lt;br /&gt;
&lt;br /&gt;
2.       Project Descriptions &lt;br /&gt;
&lt;br /&gt;
3.       Objectives and Goals: &lt;br /&gt;
&lt;br /&gt;
4.       Methods, Results, Achievements and Contributions: &lt;br /&gt;
&lt;br /&gt;
5.       Future Plans: &lt;br /&gt;
&lt;br /&gt;
6.       Skills and Knowledge Gained: &lt;br /&gt;
&lt;br /&gt;
7.       Acknowledgments: &lt;br /&gt;
&lt;br /&gt;
8.       Q&amp;amp;A Session: &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Outline&#039;&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;1. Introduction:&#039;&#039;&#039;  (1 slide)&lt;br /&gt;
&lt;br /&gt;
  - Briefly introduce yourself.  &lt;br /&gt;
&lt;br /&gt;
  - Add your picture and name on the introduction slide.  If it is group add the group picture.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;2. Project Descriptions:&#039;&#039;&#039;  (1 slide)&lt;br /&gt;
&lt;br /&gt;
  - Provide context and background information about the project.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. Project Objectives and Goals:&#039;&#039;&#039;  (1 slide)&lt;br /&gt;
&lt;br /&gt;
  - Describe the main objectives of the project or initiative.  &lt;br /&gt;
&lt;br /&gt;
  - Discuss any additional goals or desired outcomes.  &lt;br /&gt;
&lt;br /&gt;
  - Explain why these objectives and goals are important.  &lt;br /&gt;
&lt;br /&gt;
  &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;4. Methods, Results, Achievements and Contributions:&#039;&#039;&#039;  &lt;br /&gt;
&lt;br /&gt;
  -  Highlight the methods/tools used in the project.  &lt;br /&gt;
&lt;br /&gt;
  - Highlight the key results and outcomes of the project.  &lt;br /&gt;
&lt;br /&gt;
  - Discuss the most significant achievements and milestones reached.  &lt;br /&gt;
&lt;br /&gt;
  - Explain how each member of the team project contributed to the project (for group project) &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039; &#039;&#039;&#039; &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. Future Plans&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
  - Next steps or future plans for the project&lt;br /&gt;
&lt;br /&gt;
  &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;6. Skills and Knowledge Gained:&#039;&#039;&#039;  (1 slide)&lt;br /&gt;
&lt;br /&gt;
  -   Detail any technical skills acquired or improved.  &lt;br /&gt;
&lt;br /&gt;
  - Highlight any soft skills, such as communication or teamwork, that were developed.  &lt;br /&gt;
&lt;br /&gt;
  - Discuss new knowledge gained in specific areas or subjects.  &lt;br /&gt;
&lt;br /&gt;
  -  Share any personal reflections on the experience and what was learned.  &lt;br /&gt;
&lt;br /&gt;
  - Discuss any challenges or obstacles encountered and how they were overcome.  &lt;br /&gt;
&lt;br /&gt;
  - Provide key insights or lessons learned from the project.  &lt;br /&gt;
&lt;br /&gt;
  &lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;7. Acknowledgments:&#039;&#039;&#039;  &#039;&#039;&#039;:&#039;&#039;&#039;  (1 slide)&lt;br /&gt;
&lt;br /&gt;
  - Acknowledge the contributions of team members and collaborators.  &lt;br /&gt;
&lt;br /&gt;
- Recognize the guidance and support of mentors and advisors.  &lt;br /&gt;
&lt;br /&gt;
  - Acknowledge the Project Funding.  Eg. CFDE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;8. Q&amp;amp;A Session:&#039;&#039;&#039;  &lt;br /&gt;
&lt;br /&gt;
  - Invite the audience to ask questions and engage in discussion.  &lt;br /&gt;
&lt;br /&gt;
  - Provide clear and thoughtful responses to audience questions.  &lt;br /&gt;
&lt;br /&gt;
  - Offer closing remarks and thank the audience for their participation. &lt;br /&gt;
&lt;br /&gt;
Note – If you have limited presentation time you can also merge few topics into one.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=979</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=979"/>
		<updated>2025-08-29T15:23:10Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy*&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Farah Kamila&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Robert Ziebich&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|ARGOS&lt;br /&gt;
|Christie Woodside, Jonathon Keeney&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=951</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=951"/>
		<updated>2025-08-26T19:00:36Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy*&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Farah Kamila&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Robert Ziebich&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=950</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=950"/>
		<updated>2025-08-25T20:43:15Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy*&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Farah Kamila&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, ARGOS, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Robert Ziebich&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, BioMarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Arhamur Rauf&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS, GlyGen, PredictMod&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=946</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=946"/>
		<updated>2025-08-22T16:12:28Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast cancers, biomarkers and glycans, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
# Prepare a Wikipage to showcase the validated PMIDs.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy*&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula*&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Anagha Kalle&lt;br /&gt;
|PredictMod&lt;br /&gt;
|Lori Krammer, Tianyi Wang&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|GlyGen&lt;br /&gt;
|Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Farah Kamila&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, ARGOS, BioMarkerKB&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=939</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=939"/>
		<updated>2025-08-20T18:19:17Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast, and liver cancer, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the Fall.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy*&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay*&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Anagha Kalle&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|Daniall Masood, Maria Kim&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Miao Wang*&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|ARGOS&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;nowiki&amp;gt;*&amp;lt;/nowiki&amp;gt;Returning volunteer.&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=935</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=935"/>
		<updated>2025-08-20T18:15:15Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: /* Volunteers (TBD) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast, and liver cancer, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!POC Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Isil Erbasol Serbes&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod, BiomarkerKB, ARGOS&lt;br /&gt;
|-&lt;br /&gt;
|Ramtin Mashhoon&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Anagha Kalle&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|PredictMod&lt;br /&gt;
|-&lt;br /&gt;
|Vishal Muthusekaran&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Adonay Awet&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=933</id>
		<title>Volunteership Fall 2025</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Volunteership_Fall_2025&amp;diff=933"/>
		<updated>2025-08-18T15:08:58Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: Added Mathias&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== 2025 Volunteer Program Details ==&lt;br /&gt;
&lt;br /&gt;
=== Dates ===&lt;br /&gt;
&#039;&#039;&#039;Application Deadline&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 22, 2025, Noon (email your updated resume and projects in order of preference). Acceptance letter/email will be sent to candidates latest the day after the kick-off meeting.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Volunteer Zoom Kick-Off Meeting&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
August 25, 2025 | 4:00 to 5:00 PM&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Program Dates: September 1st, 2025 – November 30th, 2025&#039;&#039;&#039; (13 weeks)&lt;br /&gt;
&lt;br /&gt;
Remote | Hybrid for GW employees and students (Ross Hall 5th floor)&lt;br /&gt;
&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/wiki/Volunteership_2025 Summer 2025 Volunteership] (Closed)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteer Expectations ===&lt;br /&gt;
&lt;br /&gt;
# Minimum commitment of 10 hours per week.&lt;br /&gt;
# Progress updates via Slack at least 3 days per week (scrum).&lt;br /&gt;
# Regular Zoom meetings with the assigned project point of contact.&lt;br /&gt;
# Attend some lectures or seminars remotely (max 4-5).&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;&#039;&#039;Important:&#039;&#039;&#039; If the scrum is not updated for 2 consecutive working days, the candidate will be automatically dropped from the program.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Potential Projects ===&lt;br /&gt;
We are excited to continue our bioinformatics volunteership program in Fall 2025. This program offers students the opportunity to work on bioinformatics projects supported by agencies such as the NIH, ARPA-H, and FDA. Participants will gain exposure to a variety of activities within a bioinformatics lab, including data analysis, computational biology, and genomics. If you are interested, please email mazumder_lab@gwu.edu your resume and a ranked list of the projects that interest you most. You can also indicate if you want to focus on specific areas that are of interest to you.&lt;br /&gt;
# BiomarkerKB (biomarkerkb.org) project: Biomarker curation project. Involves reading papers and collecting biomarkers.&lt;br /&gt;
# GlyGen (glygen.org) project: Review glycomics and glycoproteomics data and curate tissue, disease, and other related information.&lt;br /&gt;
# ARGOS (argosdb.org) project: Analyze genomics data using HIVE to identify reference genome assemblies.&lt;br /&gt;
# PredictMod (hivelab.biochemistry.gwu.edu/predictmod) project. Curating PMIDs for intervention outcome prediction dataset LLM recommendation training.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;Note: Individuals involved in the above projects with a background in programming and/or machine learning may also undertake additional tasks to support the development of ML models, which can be integrated into PredictMod or used to enhance AI/ML-ready datasets within GlyGen.&#039;&#039;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==== 1. BiomarkerKB Biocuration Project Ideas ====&lt;br /&gt;
POC: Daniall Masood, Maria Kim&lt;br /&gt;
&lt;br /&gt;
# Curate biomarkers for a specific disease&lt;br /&gt;
## The student would be doing manual curation for about 4 weeks, with regular check-ins with me to ensure it is being done correctly.&lt;br /&gt;
## The next 4 weeks can be dedicated to developing an LLM or an automated process to extract biomarker details with data collected in the first 4 weeks as training data/example data.&lt;br /&gt;
# Top 50 biomarkers&lt;br /&gt;
## Curate the top 50 biomarkers for biomarkerkb.org.&lt;br /&gt;
## Define what constitutes a top 50 biomarker.&lt;br /&gt;
## Begin curating biomarkers from different sources and papers by collecting fields mentioned in the data model, as well as collecting cross-references.&lt;br /&gt;
# Biocuration of biomarkers from NLP/LLM work&lt;br /&gt;
## Use the biomarkers collected from NLP work.&lt;br /&gt;
## Curate biomarkers. Data provided was not provided in the biomarker data model.&lt;br /&gt;
## While curating the biomarkers, check if data collected from NLP is correct.&lt;br /&gt;
## After completion, the student can start using curated data to work on the NLP/LLM method.&lt;br /&gt;
# Curate biomarkers for a treatment&lt;br /&gt;
## See #1 above.&lt;br /&gt;
# Continue working on LLM methods started by volunteers over the summer.&lt;br /&gt;
## The data is available as well as some preliminary research and work done by previous volunteers in this area.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas, diseases, treatments, or methods they want to focus on, please reach out to daniallmasood@gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
&lt;br /&gt;
==== 2. GlyGen Biocuration Project Ideas ====&lt;br /&gt;
POC: Rene Ranzinger, Urnisha Bhuiyan, Kate Warner&lt;br /&gt;
&lt;br /&gt;
Over the last three decades, numerous glycomics database projects have been initiated to collect valuable information about glycans, proteins, and their interactions. Some of these databases have been discontinued due to the end of project funding. However, the data within these databases remains highly valuable to the community. Integrating these datasets into modern databases or knowledgebases, such as GlyGen, presents a challenge because much of the valuable metadata (e.g., species, tissue, disease, cell line) annotations are free-text terms that do not align with established standard dictionaries and ontologies used in modern resources. Automated matching of this information with dictionaries or ontologies is often not possible due to the use of synonyms, spelling errors, or abbreviations. For example, &amp;quot;human,&amp;quot; &amp;quot;man,&amp;quot; and &amp;quot;h. sapiens&amp;quot; all map to the scientific species name &amp;quot;Homo sapiens.&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The GlyGen project aims to make datasets from two older databases (CarbBank, CFG) accessible by migrating the data and metadata into our database. For this project, we are seeking curators with a medical or biology background who are interested in helping map metadata terms from these old databases to standard dictionaries and ontologies.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using internet resources (e.g., Google, Wikipedia) to identify terms used in the old database.&lt;br /&gt;
# Mapping identified terms to corresponding dictionaries and ontologies using the webpages and search interfaces of these projects.&lt;br /&gt;
# Finding papers based on titles and author lists that may contain spelling errors.&lt;br /&gt;
# Interacting and discussing with other curators in case terms are mapped differently.&lt;br /&gt;
&lt;br /&gt;
If you have any other ideas or methods you would like to focus on, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;3. GlyGen Publication Analysis Project Ideas&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Rene Ranzinger and Urnisha Bhuiyan&lt;br /&gt;
&lt;br /&gt;
One of the challenges for any bioinformatics project is understanding the size of its community, how well the project serves this community, and how widely its software/database is used. A potential solution is to analyze PubMed publication data. We are seeking applicants with programming skills (in Python or Java) to perform this analysis.&lt;br /&gt;
&lt;br /&gt;
The project involves:&lt;br /&gt;
&lt;br /&gt;
# Using the PubMed web API to filter publications based on keywords.&lt;br /&gt;
# Analyzing paper abstracts to identify research institutions and groups that form the community.&lt;br /&gt;
# Filtering the community list to exclude unrelated co-authors.&lt;br /&gt;
# Prioritize papers identified by GlycoSiteMiner for curation via TableMaker&lt;br /&gt;
&lt;br /&gt;
A subproject will involve analyzing the full text of papers (when available) for keywords or resource and database names. The results of the analysis will be discussed with GlyGen project member who will suggest changes and improvements to the analysis and data presentation. Source code developed as part of this project will be documented and shared in a public GitHub repository. If you have any other ideas or methods you would like to explore, please reach out to rene@ccrc.uga.edu to discuss them.&lt;br /&gt;
&lt;br /&gt;
==== 4. PredictMod Machine Learning Project Ideas ====&lt;br /&gt;
POC: Lori Krammer, Tianyi Wang, Pat McNeely (optional)&lt;br /&gt;
&lt;br /&gt;
Identifying relevant and useful publicly-available datasets for machine learning is currently a resource-intensive task. This curation project aims to develop a corpus for training an AI model to recommend PMIDs with publicly-available datasets useful for intervention outcome prediction models. The corpus will include an annotation spreadsheet + annotated PDFs for PubMed articles relevant to prostate, lung, breast, and liver cancer, and focus on indicators such as condition, intervention, and response.&lt;br /&gt;
&lt;br /&gt;
PMID curation involves:&lt;br /&gt;
&lt;br /&gt;
# Identify potentially relevant PMIDs that may have publicly-available datasets for training intervention outcome prediction models.&lt;br /&gt;
# Curate indicators of useful ML publications that could be used to train an LLM to recommend relevant publications for cancer modeling.&lt;br /&gt;
# Review peer curations and resolve annotation conflicts.&lt;br /&gt;
&lt;br /&gt;
Interested individuals should reach out to lorikrammer@gwu.edu.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;5. FDA-ARGOS Computation and Pathogen Curation Project&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
POC: Christie Woodside, Jonathon Keeney&lt;br /&gt;
&lt;br /&gt;
# Update data tables for more efficient computations&lt;br /&gt;
## Student would review and input additional data and IDs in the tables/sheets used to perform computations. This would be manual work (but super important), but would require high attention to detail.&lt;br /&gt;
## Additional Work: Requires Python/shell coding background. Student would run scripts that prepare and format data tables that are pushed to data.argosdb.org. Coding knowledge is needed in case of errors, bugs, or other mishaps in the code. Ongoing work as computations are performed.&lt;br /&gt;
# Curate and report on current pathogens to upload to ARGOS&lt;br /&gt;
## Student would work on manual curation of circulating pathogens to be added to data.argosdb.org. Regular check-ins and reports of what was found.&lt;br /&gt;
## Locate assembly IDs, reads, and metagenomic information for these pathogens to be used in computations and deposited into data.argosdb.org.&lt;br /&gt;
## Provide documentation on why they were curated, why they are important, how they were selected, and how data was collected.&lt;br /&gt;
# QC Analysis using HIVE&lt;br /&gt;
## Analyze the curated pathogens using our QC ARGOS one-click pipeline.&lt;br /&gt;
## The results will be added to our ARGOS database.&lt;br /&gt;
&lt;br /&gt;
If the student has any other ideas or methods they want to focus on, please reach out to christie.woodside@email.gwu.edu to discuss your idea and check if it will be feasible as a project for the summer.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Requirements for Completion ===&lt;br /&gt;
&#039;&#039;&#039;Note:&#039;&#039;&#039; The following are mandatory. Failure to complete any will result in an incomplete volunteer record.&lt;br /&gt;
&lt;br /&gt;
==== Documentation ====&lt;br /&gt;
All volunteers must maintain adequate documentation of their work, including written protocols and scripts submitted to GitHub.&lt;br /&gt;
&lt;br /&gt;
==== Written Report ====&lt;br /&gt;
Submit a 1–2 page summary of your tasks and accomplishments to the Admin during the final week of your program.&lt;br /&gt;
&lt;br /&gt;
==== Presentation &amp;amp; Slide Submission ====&lt;br /&gt;
Present your work last week of the 13-week period.&lt;br /&gt;
&lt;br /&gt;
Slides must be submitted to the POCs.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Completion Certificate ===&lt;br /&gt;
A certificate of completion and a letter of recommendation will be provided to all participants who successfully complete the program.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Contact ===&lt;br /&gt;
mazumder_lab@gwu.edu.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Volunteers (TBD) ===&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
!Name&lt;br /&gt;
!Project Assigned&lt;br /&gt;
!Projects Interested&lt;br /&gt;
|-&lt;br /&gt;
|Diya Kamalabharathy&lt;br /&gt;
|&lt;br /&gt;
|PredictMod; Glyco web development&lt;br /&gt;
|-&lt;br /&gt;
|Anika Sikka&lt;br /&gt;
|&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Akale Kinfe&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Nahom Abel&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Harivinay P. Gujjula&lt;br /&gt;
|&lt;br /&gt;
|GlyGen&lt;br /&gt;
|-&lt;br /&gt;
|Sparsh Gupta&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|-&lt;br /&gt;
|Mathias Belay&lt;br /&gt;
|&lt;br /&gt;
|BiomarkerKB&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Projects&amp;diff=203</id>
		<title>Projects</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Projects&amp;diff=203"/>
		<updated>2024-12-03T14:47:23Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Current Projects&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main The High-performance Integrated Virtual Environment (HIVE) platform]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
HIVE is a cloud-based environment optimized for the storage and analysis of extra-large data, such as biomedical data, clinical data, next-generation sequencing (NGS) data, mass spectrometry files, confocal microscopy images, post-market surveillance data, medical recall data, and many others. HIVE provides secure web access for authorized users to deposit, retrieve, annotate and compute on Big Data, and analyze the outcomes using web user interfaces. [https://docs.google.com/document/d/1F5iq00uKkJfdSsbwanvKOy-nPnwijH56mwbwa_HhzfY/edit?tab=t.0#heading=h.7dlfmngwfzih More here].&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.biocomputeobject.org/ BioCompute Objects (BCO)]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The BioCompute is FDA funded project to establish a framework for community-based development of standards for harmonization of High-throughput Sequencing (HTS), standardization of data formats, promotion of interoperability, and bioinformatics verification protocols. The BioCompute Object (BCO) was developed in the High-throughput Sequencing Computational Standards for Regulatory Sciences (HTS-CSRS) initiative in the BioCompute Objects Portal (BOP), a web portal to serve as a collaborative ground to encourage a dialogue to facilitate interoperability between different bioinformatic pipelines, industries, and developers. HIVE capabilities have been leveraged to support the development of the BCO. The BCO is versatile and adaptable to other common HTS analysis platforms. [https://docs.google.com/document/d/1WQFZm_PFiQXob4NyOKq6y-2ywnbmNoFHSS27fYf3l4Y/edit?tab=t.0#heading=h.bs8eki17tykx More here].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.glygen.org/ GlyGen]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
GlyGen (gly-glycobiology; gen-information), is an advanced glycoinformatics resource developed to facilitate discovery in basic and translational glycobiology research along with enhancing the integration of multidisciplinary information from diverse resources. GlyGen includes knowledge about molecular, biophysical and functional properties of glycans, genes, and proteins organized in pathways and ontologies, plus a rapidly growing body of biological big data related to cancer mutation and expression. GlyGen adopts an innovative user-driven approach for implementing, prioritizing and knowledge disseminating tools to address the questions and needs of glycobiology community.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.biochemistry.gwu.edu/predictmod PredictMod]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
PredictMod is an application designed to predict the outcome of an intervention prior to a patient initiating treatment. Our goal is to provide clinicians with a powerful decision making tool that enhances clinical understanding of patient-level data. The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition. While our primary condition of interest is Prediabetes, the tool is designed to be used for a variety of conditions, interventions, and data types.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[[GW-FEAST]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The GW Federated Ecosystems for Analytics and Standardized Technologies (GW-FEAST) project is part of the ARPA-H FEAST performer team initiative that includes academic and industry partners. The goal of the ARPA-H performer teams is “to create bridges across data silos to make health data more accessible and usable”.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.tst.biochemistry.gwu.edu/biomarker-partnership Biomarker Partnership]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The Biomarker Partnership is a CFDE sponsored project to develop a knowledgebase that will organize and integrate biomarker data from different public sources. The data will be connected to contextual information to show a novel systems-level view of biomarkers. The motivation for this project is to improve the harmonization and organization of biomarker data. This will be done by mapping biomarkers from public sources to, and across, CF data elements. This mapping will bridge knowledge across multiple DCCs and biomedical disciplines.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Past Projects&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.tst.biochemistry.gwu.edu/gfkb Gut Microbiome Analytic System (Microbiome)]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The HIVE team received NSF funding to develop a Gut Microbiome Monitoring System (GutFeeling) as a tool which when used over time will allow users to rectify their dietary (such as consumption of probiotics and prebiotics) and other lifestyle habits and to help restore their normal microbiome. Rapid analysis of the large amount of metagenomic data, a major bottleneck, has been resolved by our group through the development of a novel algorithm and accompanying software called CensuScope. Through analysis of healthy gut microbiome data, we are actively developing a Knowledge Base (GutFeelingKB) to provide a clearer picture of not only an ideal personalized microbiome but also establish baseline characteristics for each customer. The Mazumder Lab is collaborating with the Milken School of Public Health and Kamtek Sequencing Facility to investigate the relationship between bacterial species commonly present in the digestive tract, diet, physical activity, lifestyle habits, and metabolic risk factors. [https://docs.google.com/document/d/18WyVTJrrf-FR0sHt634vO8Lwel-4OQxP9sNar7gYYro/edit?tab=t.0#heading=h.7qbm3f7lky31 More here].&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;HIVE-EQAPOL Project on HIVE NGS Data Processing and Analysis&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
For this project, our group works closely with the External Quality Assurance Program Oversight Laboratory (EQAPOL) team to conduct HIV NGS data analysis and collaborate in terms of analyzing, storing, and tracking HIV NGS Data. Reliable identification of strains is critical for developing new assays, validating assay platforms, assisting regulators to evaluate test kits, monitoring HIV drug resistance, and informing vaccine development. The HIVE tools and platform are used for virus identification, recombination analysis, and clone discovery.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.oncomx.org/ OncoMX]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The OncoMX mission is to create an integrated cancer mutation and expression resource for exploring cancer biomarkers. OncoMX is a collaboration between the George Washington University (GW), NASA&#039;s Jet Propulsion Laboratory (JPL), the Swiss Institute of Bioinformatics (SIB), and the University of Delaware (UD). The core knowledgebase of OncoMX is derived from BioMuta and BioXpress integrated cancer mutation and expression databases. Normal expression data from Bgee and custom text mining software augment the cancer data to improve functional interpretation of the reported variants and expression profiles. All data are wrapped into the OncoMX database and web portal, mapped to additional functional information from NCI Early Detection Research Network (EDRN) and Reactome. It is expected that the large-scale integration of cancer data and supporting information, provided by OncoMX with direct community feedback, will benefit cancer research by improving synthesis of information and may make earlier detection a reality.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main Glycoproteomics Characterization Workflow and Data-Analysis Pipeline for Vaccines and Biosimilars]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
In this FDA funded project we are extending High-performance Integrated Virtual Environment (HIVE) capabilities through the development and integration of software tools and datasets for comparative analysis of glycoproteins. Glycomic analysis has many angles and has been extensively reviewed in recent literature. We propose to rely on the independent development of the glycomics field and incorporate these approaches in the HIVE pipeline as they mature while we develop a standardized glycoinformatics pipeline that will benefit investigators and regulators at the FDA.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Projects&amp;diff=202</id>
		<title>Projects</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Projects&amp;diff=202"/>
		<updated>2024-12-03T14:44:01Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Current Projects&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main The High-performance Integrated Virtual Environment (HIVE) platform]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
HIVE is a cloud-based environment optimized for the storage and analysis of extra-large data, such as biomedical data, clinical data, next-generation sequencing (NGS) data, mass spectrometry files, confocal microscopy images, post-market surveillance data, medical recall data, and many others. HIVE provides secure web access for authorized users to deposit, retrieve, annotate and compute on Big Data, and analyze the outcomes using web user interfaces. [https://docs.google.com/document/d/1F5iq00uKkJfdSsbwanvKOy-nPnwijH56mwbwa_HhzfY/edit?tab=t.0#heading=h.7dlfmngwfzih More here].&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.biocomputeobject.org/ BioCompute Objects (BCO)]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The BioCompute is FDA funded project to establish a framework for community-based development of standards for harmonization of High-throughput Sequencing (HTS), standardization of data formats, promotion of interoperability, and bioinformatics verification protocols. The BioCompute Object (BCO) was developed in the High-throughput Sequencing Computational Standards for Regulatory Sciences (HTS-CSRS) initiative in the BioCompute Objects Portal (BOP), a web portal to serve as a collaborative ground to encourage a dialogue to facilitate interoperability between different bioinformatic pipelines, industries, and developers. HIVE capabilities have been leveraged to support the development of the BCO. The BCO is versatile and adaptable to other common HTS analysis platforms. [https://docs.google.com/document/d/1WQFZm_PFiQXob4NyOKq6y-2ywnbmNoFHSS27fYf3l4Y/edit?tab=t.0#heading=h.bs8eki17tykx More here].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.glygen.org/ GlyGen]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
GlyGen (gly-glycobiology; gen-information), is an advanced glycoinformatics resource developed to facilitate discovery in basic and translational glycobiology research along with enhancing the integration of multidisciplinary information from diverse resources. GlyGen includes knowledge about molecular, biophysical and functional properties of glycans, genes, and proteins organized in pathways and ontologies, plus a rapidly growing body of biological big data related to cancer mutation and expression. GlyGen adopts an innovative user-driven approach for implementing, prioritizing and knowledge disseminating tools to address the questions and needs of glycobiology community.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.biochemistry.gwu.edu/predictmod PredictMod]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
PredictMod is an application designed to predict the outcome of an intervention prior to a patient initiating treatment. Our goal is to provide clinicians with a powerful decision making tool that enhances clinical understanding of patient-level data. The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition. While our primary condition of interest is Prediabetes, the tool is designed to be used for a variety of conditions, interventions, and data types.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[[GW-FEAST]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The GW Federated Ecosystems for Analytics and Standardized Technologies (GW-FEAST) project is part of the ARPA-H FEAST performer team initiative that includes academic and industry partners. The goal of the ARPA-H performer teams is “to create bridges across data silos to make health data more accessible and usable”.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.tst.biochemistry.gwu.edu/biomarker-partnership Biomarker Partnership]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The Biomarker Partnership is a CFDE sponsored project to develop a knowledgebase that will organize and integrate biomarker data from different public sources. The data will be connected to contextual information to show a novel systems-level view of biomarkers. The motivation for this project is to improve the harmonization and organization of biomarker data. This will be done by mapping biomarkers from public sources to, and across, CF data elements. This mapping will bridge knowledge across multiple DCCs and biomedical disciplines.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Past Projects&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.tst.biochemistry.gwu.edu/gfkb Gut Microbiome Analytic System (Microbiome)]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The HIVE team received NSF funding to develop a Gut Microbiome Monitoring System (GutFeeling) as a tool which when used over time will allow users to rectify their dietary (such as consumption of probiotics and prebiotics) and other lifestyle habits and to help restore their normal microbiome. Rapid analysis of the large amount of metagenomic data, a major bottleneck, has been resolved by our group through the development of a novel algorithm and accompanying software called CensuScope. Through analysis of healthy gut microbiome data, we are actively developing a Knowledge Base (GutFeelingKB) to provide a clearer picture of not only an ideal personalized microbiome but also establish baseline characteristics for each customer. The Mazumder Lab is collaborating with the Milken School of Public Health and Kamtek Sequencing Facility to investigate the relationship between bacterial species commonly present in the digestive tract, diet, physical activity, lifestyle habits, and metabolic risk factors. [https://docs.google.com/document/d/18WyVTJrrf-FR0sHt634vO8Lwel-4OQxP9sNar7gYYro/edit?tab=t.0#heading=h.7qbm3f7lky31 More here].&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;HIVE-EQAPOL Project on HIVE NGS Data Processing and Analysis&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
For this project, our group works closely with the External Quality Assurance Program Oversight Laboratory (EQAPOL) team to conduct HIV NGS data analysis and collaborate in terms of analyzing, storing, and tracking HIV NGS Data. Reliable identification of strains is critical for developing new assays, validating assay platforms, assisting regulators to evaluate test kits, monitoring HIV drug resistance, and informing vaccine development. The HIVE tools and platform are used for virus identification, recombination analysis, and clone discovery.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.oncomx.org/ OncoMX]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The OncoMX mission is to create an integrated cancer mutation and expression resource for exploring cancer biomarkers. OncoMX is a collaboration between the George Washington University (GW), NASA&#039;s Jet Propulsion Laboratory (JPL), the Swiss Institute of Bioinformatics (SIB), and the University of Delaware (UD). The core knowledgebase of OncoMX is derived from BioMuta and BioXpress integrated cancer mutation and expression databases. Normal expression data from Bgee and custom text mining software augment the cancer data to improve functional interpretation of the reported variants and expression profiles. All data are wrapped into the OncoMX database and web portal, mapped to additional functional information from NCI Early Detection Research Network (EDRN) and Reactome. It is expected that the large-scale integration of cancer data and supporting information, provided by OncoMX with direct community feedback, will benefit cancer research by improving synthesis of information and may make earlier detection a reality.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Projects&amp;diff=201</id>
		<title>Projects</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Projects&amp;diff=201"/>
		<updated>2024-12-03T14:41:09Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Current Projects&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main The High-performance Integrated Virtual Environment (HIVE) platform]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
HIVE is a cloud-based environment optimized for the storage and analysis of extra-large data, such as biomedical data, clinical data, next-generation sequencing (NGS) data, mass spectrometry files, confocal microscopy images, post-market surveillance data, medical recall data, and many others. HIVE provides secure web access for authorized users to deposit, retrieve, annotate and compute on Big Data, and analyze the outcomes using web user interfaces. [https://docs.google.com/document/d/1F5iq00uKkJfdSsbwanvKOy-nPnwijH56mwbwa_HhzfY/edit?tab=t.0#heading=h.7dlfmngwfzih More here].&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.biocomputeobject.org/ BioCompute Objects (BCO)]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The BioCompute is FDA funded project to establish a framework for community-based development of standards for harmonization of High-throughput Sequencing (HTS), standardization of data formats, promotion of interoperability, and bioinformatics verification protocols. The BioCompute Object (BCO) was developed in the High-throughput Sequencing Computational Standards for Regulatory Sciences (HTS-CSRS) initiative in the BioCompute Objects Portal (BOP), a web portal to serve as a collaborative ground to encourage a dialogue to facilitate interoperability between different bioinformatic pipelines, industries, and developers. HIVE capabilities have been leveraged to support the development of the BCO. The BCO is versatile and adaptable to other common HTS analysis platforms. [https://docs.google.com/document/d/1WQFZm_PFiQXob4NyOKq6y-2ywnbmNoFHSS27fYf3l4Y/edit?tab=t.0#heading=h.bs8eki17tykx More here].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.glygen.org/ GlyGen]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
GlyGen (gly-glycobiology; gen-information), is an advanced glycoinformatics resource developed to facilitate discovery in basic and translational glycobiology research along with enhancing the integration of multidisciplinary information from diverse resources. GlyGen includes knowledge about molecular, biophysical and functional properties of glycans, genes, and proteins organized in pathways and ontologies, plus a rapidly growing body of biological big data related to cancer mutation and expression. GlyGen adopts an innovative user-driven approach for implementing, prioritizing and knowledge disseminating tools to address the questions and needs of glycobiology community.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.biochemistry.gwu.edu/predictmod PredictMod]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
PredictMod is an application designed to predict the outcome of an intervention prior to a patient initiating treatment. Our goal is to provide clinicians with a powerful decision making tool that enhances clinical understanding of patient-level data. The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition. While our primary condition of interest is Prediabetes, the tool is designed to be used for a variety of conditions, interventions, and data types.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[[GW-FEAST]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The GW Federated Ecosystems for Analytics and Standardized Technologies (GW-FEAST) project is part of the ARPA-H FEAST performer team initiative that includes academic and industry partners. The goal of the ARPA-H performer teams is “to create bridges across data silos to make health data more accessible and usable”.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.tst.biochemistry.gwu.edu/biomarker-partnership Biomarker Partnership]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The Biomarker Partnership is a CFDE sponsored project to develop a knowledgebase that will organize and integrate biomarker data from different public sources. The data will be connected to contextual information to show a novel systems-level view of biomarkers. The motivation for this project is to improve the harmonization and organization of biomarker data. This will be done by mapping biomarkers from public sources to, and across, CF data elements. This mapping will bridge knowledge across multiple DCCs and biomedical disciplines.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Past Projects&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hivelab.tst.biochemistry.gwu.edu/gfkb Gut Microbiome Analytic System (Microbiome)]&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The HIVE team received NSF funding to develop a Gut Microbiome Monitoring System (GutFeeling) as a tool which when used over time will allow users to rectify their dietary (such as consumption of probiotics and prebiotics) and other lifestyle habits and to help restore their normal microbiome. Rapid analysis of the large amount of metagenomic data, a major bottleneck, has been resolved by our group through the development of a novel algorithm and accompanying software called CensuScope. Through analysis of healthy gut microbiome data, we are actively developing a Knowledge Base (GutFeelingKB) to provide a clearer picture of not only an ideal personalized microbiome but also establish baseline characteristics for each customer. The Mazumder Lab is collaborating with the Milken School of Public Health and Kamtek Sequencing Facility to investigate the relationship between bacterial species commonly present in the digestive tract, diet, physical activity, lifestyle habits, and metabolic risk factors. [https://docs.google.com/document/d/18WyVTJrrf-FR0sHt634vO8Lwel-4OQxP9sNar7gYYro/edit?tab=t.0#heading=h.7qbm3f7lky31 More here].&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;HIVE-EQAPOL Project on HIVE NGS Data Processing and Analysis&amp;lt;/h3&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
For this project, our group works closely with the External Quality Assurance Program Oversight Laboratory (EQAPOL) team to conduct HIV NGS data analysis and collaborate in terms of analyzing, storing, and tracking HIV NGS Data. Reliable identification of strains is critical for developing new assays, validating assay platforms, assisting regulators to evaluate test kits, monitoring HIV drug resistance, and informing vaccine development. The HIVE tools and platform are used for virus identification, recombination analysis, and clone discovery.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://www.oncomx.org/ OncoMX]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
The OncoMX mission is to create an integrated cancer mutation and expression resource for exploring cancer biomarkers. OncoMX is a collaboration between the George Washington University (GW), NASA&#039;s Jet Propulsion Laboratory (JPL), the Swiss Institute of Bioinformatics (SIB), and the University of Delaware (UD). The core knowledgebase of OncoMX is derived from BioMuta and BioXpress integrated cancer mutation and expression databases. Normal expression data from Bgee and custom text mining software augment the cancer data to improve functional interpretation of the reported variants and expression profiles. All data are wrapped into the OncoMX database and web portal, mapped to additional functional information from NCI Early Detection Research Network (EDRN) and Reactome. It is expected that the large-scale integration of cancer data and supporting information, provided by OncoMX with direct community feedback, will benefit cancer research by improving synthesis of information and may make earlier detection a reality.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;h3&amp;gt;[https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main Glycoproteomics Characterization Workflow and Data-Analysis Pipeline for Vaccines and Biosimilars]&amp;lt;/h3&amp;gt;&lt;br /&gt;
	&amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
In this FDA funded project we are extending High-performance Integrated Virtual Environment (HIVE) capabilities through the development and integration of software tools and datasets for comparative analysis of glycoproteins. Glycomic analysis has many angles and has been extensively reviewed in recent literature. We propose to rely on the independent development of the glycomics field and incorporate these approaches in the HIVE pipeline as they mature while we develop a standardized glycoinformatics pipeline that will benefit investigators and regulators at the FDA.&lt;br /&gt;
        &amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=People&amp;diff=192</id>
		<title>People</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=People&amp;diff=192"/>
		<updated>2024-12-02T15:34:59Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;h2&amp;gt;GW Faculty&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;table&amp;gt;&lt;br /&gt;
    &amp;lt;tr&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;text-align: center; padding: 25px&amp;quot;&amp;gt;&lt;br /&gt;
            [[File:Raja-Mazumder.png|150px|thumb|&amp;lt;span style=&amp;quot;font-weight:bold;&amp;quot;&amp;gt;[https://apps.smhs.gwu.edu/smhs/facultydirectory/profile.cfm?empName=Raja%20Mazumder&amp;amp;FacID=2067473740| Raja Mazumder]&amp;lt;/span&amp;gt;, Professor]]&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
&amp;lt;td style=&amp;quot;text-align: center; padding: 25px&amp;quot;&amp;gt;&lt;br /&gt;
            [[File:Robel.Kahsay.jpg|150px|thumb|&amp;lt;span style=&amp;quot;font-weight:bold;&amp;quot;&amp;gt;[https://apps.smhs.gwu.edu/smhs/facultydirectory/profile.cfm?empName=Robel%20Kahsay&amp;amp;FacID=2051216059| Robel Kashay]&amp;lt;/span&amp;gt;, Asst Professor]]&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
&amp;lt;td style=&amp;quot;text-align: center; padding: 25px&amp;quot;&amp;gt;&lt;br /&gt;
            [[File:McNeely.Patrick.jpg|130px|thumb|&amp;lt;span style=&amp;quot;font-weight:bold;&amp;quot;&amp;gt;[https://apps.smhs.gwu.edu/smhs/facultydirectory/profile.cfm?empName=Patrick%20McNeely&amp;amp;FacID=2065037504&amp;amp;show=1| Pat McNeely]&amp;lt;/span&amp;gt;, Asst Professor]]&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
&amp;lt;td style=&amp;quot;text-align: center; padding: 25px&amp;quot;&amp;gt;&lt;br /&gt;
            [[File: Jonathon.Keeney.png|150px|thumb|&amp;lt;span style=&amp;quot;font-weight:bold;&amp;quot;&amp;gt;[https://apps.smhs.gwu.edu/smhs/facultydirectory/profile.cfm?empName=Jonathon%20Keeney&amp;amp;FacID=2056964816| Jonathon Keeney]&amp;lt;/span&amp;gt;, Asst Professor]]&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
    &amp;lt;/tr&amp;gt;&lt;br /&gt;
&amp;lt;/table&amp;gt;&lt;br /&gt;
&amp;lt;h2&amp;gt;HIVE Project Leads&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
    [https://orcid.org/0000-0001-8823-9945 Raja Mazumder] (HIVE Lab) - &lt;br /&gt;
    [https://apps.smhs.gwu.edu/smhs/facultydirectory/profile.cfm?empName=Raja%20Mazumder&amp;amp;FacID=2067473740 Faculty Bio]&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;Past HIVE Project Lead/s&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
    Vahan Simonyan (FDA-HIVE)&amp;lt;br&amp;gt;&lt;br /&gt;
    Konstantinos Karagiannis (FDA-HIVE)&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;Current GW HIVE Lab Members&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;table&amp;gt;&lt;br /&gt;
    &amp;lt;tr&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://github.com/seankim658 Sean Kim]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/daniall-masood Daniall Masood]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/kate-warner Kate Warner]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/tianyi-wang Tianyi Wang]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/emily-pennington Emily Pennington]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/jeet-vora Jeet Kiran Vora]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/lori-krammer Lori Krammer]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/raja-mazumder Raja Mazumder]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/urnisha-bhuiyan Urnisha Bhuiyan]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/christie-woodside Christie Woodside]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/jonathon-keeney Jonathon Keeney]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/patrick-mcneely Patrick McNeely]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/robel-kahsay Robel Kahsay]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
    &amp;lt;/tr&amp;gt;&lt;br /&gt;
&amp;lt;/table&amp;gt;&lt;br /&gt;
&amp;lt;h2&amp;gt;Current Volunteers and Part-time Members&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;table&amp;gt;&lt;br /&gt;
    &amp;lt;tr&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/vishal-bakshi Vishal Bakshi]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/miguel-mazumder Miguel Mazumder]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/maria-kim Maria Kim]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/reeya-gupta Reeya Gupta]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/cyrus-au-yeung Cyrus Chun Hong AU YEUNG]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/nikhil-aritheya Nikhil Aritheya]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
    &amp;lt;/tr&amp;gt;&lt;br /&gt;
&amp;lt;/table&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;Current FDA HIVE Members&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;table&amp;gt;&lt;br /&gt;
    &amp;lt;tr&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/alexander-lukyanov Alexander Lukyanov]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/anton-golikov Anton Golikov]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/ilya-mazo Ilya Mazo]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/luis-santana Luis Santana-Quintero]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/tigran-ghazanchyan Tigran Ghazanchyan]&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;[https://example.com/viswanadham-sridhara Viswanadham Sridhara]&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
    &amp;lt;/tr&amp;gt;&lt;br /&gt;
&amp;lt;/table&amp;gt;&lt;br /&gt;
&amp;lt;h2&amp;gt;Past Members&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;table&amp;gt;&lt;br /&gt;
    &amp;lt;tr&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Karina Martinez&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Joe Gergely&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Stephanie Singleton&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Jingyue Wu&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Daniel Lyman&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Millicent Quartey&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;April Yang&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Arya Eskandarian&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Chris Armstrong&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Deepika Prasad&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Dipankar Chattopadhyay&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Hasmik Manukyan&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Hayley Dingerdissen&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Jenna Murrow&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;John Dougherty&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Kamil Kural&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Krista Smith&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Lindsay Hopson&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Marianna Faradzheva&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Marla Surette&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Nagarajan Pattabiraman&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Nikhita Gogate&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Rahi Navelkar&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Sean Smith&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
        &amp;lt;td style=&amp;quot;padding-left: 20px; vertical-align: top;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;ul&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Sergey Ivanovsky&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Sydney Fenstermaker&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Vyacheslav Furtak&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Wei-Lun Alterovitz&amp;lt;/li&amp;gt;&lt;br /&gt;
                &amp;lt;li&amp;gt;Xiying Ding&amp;lt;/li&amp;gt;&lt;br /&gt;
            &amp;lt;/ul&amp;gt;&lt;br /&gt;
        &amp;lt;/td&amp;gt;&lt;br /&gt;
    &amp;lt;/tr&amp;gt;&lt;br /&gt;
&amp;lt;/table&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page/test&amp;diff=124</id>
		<title>Main Page/test</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page/test&amp;diff=124"/>
		<updated>2024-11-22T16:32:17Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;HIVE Test&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page/test&amp;diff=123</id>
		<title>Main Page/test</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page/test&amp;diff=123"/>
		<updated>2024-11-22T16:32:08Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: Created page with &amp;quot;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}} __NOTOC__ &amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt; &amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;     &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;         &amp;lt;div style=&amp;quot;font-size:160%; padding:.1e...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;HIVE Test,&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=122</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=122"/>
		<updated>2024-11-22T15:59:41Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/ HIVE Lab] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=121</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=121"/>
		<updated>2024-11-22T15:48:13Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/ HIVE Lab] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;br /&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
         &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;h3&amp;gt;[[GlyGen Webinar Series]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
            &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
               {{main|GlyGen Webinar Series}}The GlyGen project organizes public webinars on diverse topics, ranging from bioinformatics databases to new glycomics analysis techniques producing interesting data. Recordings of the talks are released on the GlyGen YouTube channel.&amp;lt;br /&amp;gt;&lt;br /&gt;
            &amp;lt;/div&amp;gt;&lt;br /&gt;
         &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=120</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=120"/>
		<updated>2024-11-22T15:47:56Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/ HIVE Lab] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;br /&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
         &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;h3&amp;gt;[[GlyGen Webinar Series]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
            &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
               {{main|GlyGen Webinar Series}}The GlyGen project organizes public webinars on diverse topics, ranging from bioinformatics databases to new glycomics analysis techniques producing interesting data. Recordings of the talks are released on the GlyGen YouTube channel.&amp;lt;br /&amp;gt;&lt;br /&gt;
            &amp;lt;/div&amp;gt;&lt;br /&gt;
         &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=119</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=119"/>
		<updated>2024-11-22T15:46:44Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/ HIVE Lab] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;br /&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
         &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;h3&amp;gt;[[GlyGen Webinar Series]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
            &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
               {{main|GlyGen Webinar Series}}The GlyGen project organizes public webinars on diverse topics, ranging from bioinformatics databases to new glycomics analysis techniques producing interesting data. Recordings of the talks are released on the GlyGen YouTube channel.&amp;lt;br /&amp;gt;&lt;br /&gt;
            &amp;lt;/div&amp;gt;&lt;br /&gt;
         &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         &amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
               &amp;lt;h3&amp;gt;[[GlyGen Workshops]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
               &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;{{main|GlyGen Workshops}}The GlyGen project organizes public workshops to learn more about GlyGen and glycoinformatics. The workshops are organised virtually and in-person. Recordings and material of the workshops are released on the GlyGen YouTube channel and also via the wiki page.&amp;lt;/div&amp;gt;&lt;br /&gt;
            &amp;lt;/div&amp;gt;&lt;br /&gt;
         &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
         &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;h3&amp;gt;[[GlyGen Internships]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
            &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
               {{main|GlyGen Internships}}The GlyGen project offers internships to undergraduate and graduate students in bioinformatics and in biology/biochemistry. These internships expose students to a wide variety of new concepts in bioinformatics and in biology/biochemistry while working on tasks and projects. The internship enable interns to develop their soft-skills and prepares them for the next stage of their career.&lt;br /&gt;
            &amp;lt;/div&amp;gt;&lt;br /&gt;
         &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[GlyGen Interns]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|GlyGen Internships}}The GlyGen project offers internships to undergraduate and graduate students in bioinformatics and in biology/biochemistry. Please refer to this for Interns FAQ and more information on GW Access and requirements.&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row2&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[GlyGen portal|GlyGen Portal]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|GlyGen portal}}&lt;br /&gt;
         The [[GlyGen portal]] is the web interface allowing browser based access to the GlyGen data. The portal supports searching for [[Glycan search|glycans]] and [[Protein search|proteins]], display of detailed [[Glycan details|glycan data]] and [[Protein details|protein data]] as well as export features to save the data to the local system. The portal also provides access to the other resources of the GlyGen project and digital resources such as posters, presentations and videos. &lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[GlyGen data repository|GlyGen Data Repository]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|GlyGen data repository}}GlyGen uses unique processing and integration workflow to retrieve and extract data from various resources. After standardization and harmonization, the high-quality datasets are created. The process of dataset creation is fully documented in BioCompute Objects.These datasets are further processed to create JSON objects and RDF triples that populate MongoDB docstore and Virtuoso triplestore backend databases respectively. The docstore is used by various GlyGen web services while the triplestore is accessed through the GlyGen SPARQL endpoint.&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[Frequently Asked Questions]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|Frequently Asked Questions}}&lt;br /&gt;
         The FAQ section contains a list of questions asked by users regarding using the portal, API and SPARQL endpoint as well as questions related to the data, data integration and quality control.&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[GlyGen web service API|GlyGen Web Service API]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|GlyGen web service API}}The GlyGen APIs allow programmatic access of the GlyGen data objects for glycans, proteins, and glycoproteins in the MongoDB docstore. The GlyGen APIs have been documented using the Swagger framework (https://swagger.io/). Some of these web services are generic and provide searching, listing and detailed record access functionalities for GlyGen data objects, while others are custom designed to respond to specific biological questions or use cases collected from the user community. &lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[GlyGen triplestore|GlyGen Triplestore]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|GlyGen triplestore}}The GlyGen knowledgebase uses a Virtuoso triplestore to store GlyGen triples data. The triplestore can be accessed through the GlyGen SPARQL endpoint.&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;h3&amp;gt;[[Glycan structure dictionary|Glycan Structure Dictionary]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
      &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
         {{main|Glycan structure dictionary}}The Glycan Structure Dictionary has been developed as a reference dictionary to provide a standardized list of widely used glycan terms that can help in the curation and mapping of glycan structures described in publications.&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=118</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=118"/>
		<updated>2024-11-22T15:45:17Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/ HIVE Lab] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;lt;br /&amp;gt;&lt;br /&gt;
   &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
      &amp;lt;div id=&amp;quot;ggw_row3&amp;quot; style=&amp;quot;display: flex; flex-flow: row wrap; justify-content: space-between; padding: 0; margin: 0 -5px 0 -5px;&amp;quot;&amp;gt;&lt;br /&gt;
         &amp;lt;div style=&amp;quot;flex: 1; margin: 5px; min-width: 210px; border: 1px solid #CCC;	padding: 0 10px 10px 10px; box-shadow: 0 2px 2px rgba(0,0,0,0.1); background: #f5faff;&amp;quot;&amp;gt;&lt;br /&gt;
            &amp;lt;h3&amp;gt;[[GlyGen Webinar Series]]&amp;lt;/h3&amp;gt;&lt;br /&gt;
            &amp;lt;div style=&amp;quot;border-top: 1px solid #CCC; padding-top: 0.5em;&amp;quot;&amp;gt;&lt;br /&gt;
               {{main|GlyGen Webinar Series}}The GlyGen project organizes public webinars on diverse topics, ranging from bioinformatics databases to new glycomics analysis techniques producing interesting data. Recordings of the talks are released on the GlyGen YouTube channel.&amp;lt;br /&amp;gt;&lt;br /&gt;
            &amp;lt;/div&amp;gt;&lt;br /&gt;
         &amp;lt;/div&amp;gt;&lt;br /&gt;
      &amp;lt;/div&amp;gt;&lt;br /&gt;
   &amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=117</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=117"/>
		<updated>2024-11-22T15:40:28Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/ HIVE Lab] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
The [https://hivelab.biochemistry.gwu.edu/ HIVE LAB wiki] provides information and links to resources developed by our group. Here are some key sections.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
	<entry>
		<id>https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=116</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://hivelab.biochemistry.gwu.edu/wiki/index.php?title=Main_Page&amp;diff=116"/>
		<updated>2024-11-22T15:39:37Z</updated>

		<summary type="html">&lt;p&gt;Hivelabwikiadmin: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:&amp;lt;span style=&amp;quot;position: absolute; clip: rect(1px 1px 1px 1px); clip: rect(1px, 1px, 1px, 1px);&amp;quot;&amp;gt;{{FULLPAGENAME}}&amp;lt;/span&amp;gt;}}&lt;br /&gt;
__NOTOC__&lt;br /&gt;
&amp;lt;!-- BANNER ACROSS TOP OF PAGE --&amp;gt;&lt;br /&gt;
&amp;lt;div id=&amp;quot;ggw-topbanner&amp;quot; style=&amp;quot;clear:both; position:relative; box-sizing:border-box; width:100%; margin:1.2em 0 6px; min-width:47em; border:1px solid #ddd; background-color:#f9f9f9; color:#000;&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;div style=&amp;quot;margin:0.4em; text-align:center;&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:160%; padding:.1em;&amp;quot;&amp;gt;Welcome to HIVE Wiki,&amp;lt;/div&amp;gt;&lt;br /&gt;
        &amp;lt;div style=&amp;quot;font-size:100%;&amp;quot;&amp;gt;The [https://www.mediawiki.org/wiki/MediaWiki MediaWiki] for the HIVE project. This wiki system provides complementary information to the [https://hivelab.biochemistry.gwu.edu/people] and is divided into the following main sections:&amp;lt;/div&amp;gt;&lt;br /&gt;
    &amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;clear: both;&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
The [https://hivelab.biochemistry.gwu.edu/ HIVE LAB wiki] provides information and links to resources developed by our group. Here are some key sections.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/projects Projects]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/publications Publications]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/people People]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Resources&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/tools Tools]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/datasets Datasets]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
[https://hivelab.biochemistry.gwu.edu/gallery Gallery]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;border: 1px solid #ccc; padding: 10px; background-color: #f9f9f9; border-radius: 5px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;More&#039;&#039;&#039;&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/opportunities Opportunities]&lt;br /&gt;
* [https://hivelab.biochemistry.gwu.edu/events Events]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hivelabwikiadmin</name></author>
	</entry>
</feed>