PredictMod Machine-Learning Pipeline Tutorial: Difference between revisions

Revision as of 20:53, 5 March 2025

What is PredictMod?

The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition.

Additional Resources:

<a href="https://hivelab.biochemistry.gwu.edu/predictmod/about">About us</a>
<a href="https://hivelab.biochemistry.gwu.edu/predictmod/faq">Frequently Asked Questions (FAQ)</a>
<a href="https://hivelab.biochemistry.gwu.edu/predictmod/contact">Contact us</a>

Login & Registration

How to register with PredictMod

Individuals interested in creating a PredictMod account should do so through the Login page. If you have any questions, please contact us at mazumder_lab@gwu.edu.

Login & registration FAQs

Who can register for a PredictMod account?

Any clinicians, and researchers, or other individuals interested in using the tool are invited to register.

How long does registration last?

Registered accounts do not expire. You will be logged out after 24 hours of each login.

Will the system store patient data?

The data will not be saved in the system. PredictMod will use uploaded patient data to make a one-time prediction.

Can I view a history of my predictions?

Prediction results are not saved, so a prediction history is not available.

'Try it Out' Example Query Builder

How the example query builder works

The example query builder is distinct from the standard query builder in that only the example prediction file is accepted. You cannot upload your own single-patient files into the example query builder and obtain a prediction.

You can build an example query by selecting the condition, intervention, and data type of interest to you. Based on these selections, an example data file will be available for download.

You should use example data files as a template for your own data upload.

Download example data

Example data files for each model are provided in the query builder.

Current Models

Current and anticipated models are shown on the Models page.

Query Builder

How the query builder works

The query builder determines the appropriate model to use for a prediction, based on the desired condition, intervention, and data type. Please follow the prompts on the Query Builder page to make your selections. Descriptions of the conditions, interventions, and data types are documented within each Model's BioCompute Object (BCO). You will then be able to upload your own file or download an example file. The uploaded file must meet the formatting requirements associated with the chosen model. For information on formatting, please review the sample data, FAQs, or contact our team.

Formatting FAQs

What file types are accepted?

Comma separated values (csv) or excel workbook (xlsx) files are accepted.

How do I know if my file is not formatted correctly?

An incorrectly formatted file will return an error message when the 'Run a Prediction' option is selected.

If you receive a formatting error message and you are unsure why, please contact our team and we can work with you to resolve the issue.

Run a prediction

Once a correctly formatted file has been uploaded, select ‘Run Prediction’ to view your results.

Interpreting a prediction result

Responder vs. Non-Responder outcomes

PredictMod will provide a prediction categorized as either Responder or Non-Responder. The outcomes associated with the response status vary for each model, though a Responder result is generally associated with a positive health outcome, and the Non-Responder result is generally associated with a negative health outcome.

Data visualization examples and interpretations

The primary data visualization tool is a SHAP force plot. Shapley Value originates from game theory and involves the fair distribution of reward based on the degree of contribution of each player. This can be utilized in precision medicine to identify the key “players” or features that contribute to a given prediction. The SHAP (SHapley Additive exPlanations) Force Plot leverages this ideology to provide Explainable AI with respect to the single-patient predictions made by PredictMod. Each plot for a given prediction not only indicates the most influential features but highlights whether that feature pushes the prediction higher (in red) or lower (in blue). The consideration of the features and their values leads to a score, where higher scores indicate a prediction of 1, or NR and lower scores a prediction of 0, or R. SHAP Force Plots also indicate the degree of feature impact based on proximity to the boundary line where the red and blue bars meet. The closer to the dividing boundary, the more impact that feature had on the patient’s prediction.

Run another prediction

Selecting 'Run Another Prediction' will return you to the query builder page, where you can complete steps 1 through 4 to run a new prediction. As a reminder, no patient data is stored in the PredictMod server, so running another prediction will erase any currently displayed results.

Upload a Model

PredictMod is a collaborative space for researchers to upload their intervention-based models and performance metrics. These models are freely available to users and commercial entities under the CC BY 4.0 license. While our current focus is Prediabetes, the platform allows for multiple models to overlap among conditions and interventions. Researchers can upload their model and relevant documentation directly to PredictMod to make it freely available to users.

@@ Line 1: / Line 1: @@
 <small> Go Back to [[PredictMod|PredictMod Project]]. </small>
-<h2>ML Tutorial </h2>
+<h3>Table of Contents</h3>
+<br>
+<h2>What is PredictMod?</h2>
+<p>The PredictMod platform utilizes machine learning tools and complex datasets based on electronic health records, gut microbiome, and -omics data to forecast patient outcomes, often in response to treatment for a particular condition.</p>
+<br>
+<h4>Additional Resources:</h4>
+<ul>
+    <li><a href="https://hivelab.biochemistry.gwu.edu/predictmod/about">About us</a></li>
+    <li><a href="https://hivelab.biochemistry.gwu.edu/predictmod/faq">Frequently Asked Questions (FAQ)</a></li>
+    <li><a href="https://hivelab.biochemistry.gwu.edu/predictmod/contact">Contact us</a></li>
+</ul>
+<br>
+<h2>Login & Registration</h2>
+<h3>How to register with PredictMod</h3>
+<p>Individuals interested in creating a PredictMod account should do so through the Login page. If you have any questions, please contact us at mazumder_lab@gwu.edu.</p>
+<br>
+<h3>Login & registration FAQs</h3>
+<br>
+<h4>Who can register for a PredictMod account?</h4>
+<p>Any clinicians, and researchers, or other individuals interested in using the tool are invited to register.</p>
+<h4>How long does registration last?</h4>
+<p>Registered accounts do not expire. You will be logged out after 24 hours of each login.</p>
+<h4>Will the system store patient data?</h4>
+<p>The data will not be saved in the system. PredictMod will use uploaded patient data to make a one-time prediction.</p>
+<h4>Can I view a history of my predictions?</h4>
+<p>Prediction results are not saved, so a prediction history is not available.</p>
+<br>
+<h2>'Try it Out' Example Query Builder</h2>
+<h3>How the example query builder works</h3>
+<p>The example query builder is distinct from the standard query builder in that only the example prediction file is accepted. You cannot upload your own single-patient files into the example query builder and obtain a prediction.</p>
+<p>You can build an example query by selecting the condition, intervention, and data type of interest to you. Based on these selections, an example data file will be available for download.</p>
+<p>You should use example data files as a template for your own data upload.</p>
+<br>
+<h3>Download example data</h3>
+<p>Example data files for each model are provided in the query builder.</p>
+<br>
+<h2>Current Models</h2>
+<p>Current and anticipated models are shown on the Models page.</p>
+<br>
+<h2>Query Builder</h2>
+<br>
+<h3>How the query builder works</h3>
+<p>The query builder determines the appropriate model to use for a prediction, based on the desired condition, intervention, and data type. Please follow the prompts on the Query Builder page to make your selections. Descriptions of the conditions, interventions, and data types are documented within each Model's BioCompute Object (BCO). You will then be able to upload your own file or download an example file. The uploaded file must meet the formatting requirements associated with the chosen model. For information on formatting, please review the sample data, FAQs, or contact our team.</p>
+<br>
+<h3>Formatting FAQs</h3>
+<br>
+<h4>What file types are accepted?</h4>
+<p>Comma separated values (csv) or excel workbook (xlsx) files are accepted.</p>
+<h4>How do I know if my file is not formatted correctly?</h4>
+<p>An incorrectly formatted file will return an error message when the 'Run a Prediction' option is selected.</p>
+<p>If you receive a formatting error message and you are unsure why, please contact our team and we can work with you to resolve the issue.</p>
+<br>
+<h3>Run a prediction</h3>
+<p>Once a correctly formatted file has been uploaded, select ‘Run Prediction’ to view your results.</p>
+<h3>Interpreting a prediction result</h3>
+<h4>Responder vs. Non-Responder outcomes</h4>
+<p>PredictMod will provide a prediction categorized as either Responder or Non-Responder. The outcomes associated with the response status vary for each model, though a Responder result is generally associated with a positive health outcome, and the Non-Responder result is generally associated with a negative health outcome.</p>
+<h4>Data visualization examples and interpretations</h4>
+<p>The primary data visualization tool is a SHAP force plot. Shapley Value originates from game theory and involves the fair distribution of reward based on the degree of contribution of each player. This can be utilized in precision medicine to identify the key “players” or features that contribute to a given prediction. The SHAP (SHapley Additive exPlanations) Force Plot leverages this ideology to provide Explainable AI with respect to the single-patient predictions made by PredictMod. Each plot for a given prediction not only indicates the most influential features but highlights whether that feature pushes the prediction higher (in red) or lower (in blue). The consideration of the features and their values leads to a score, where higher scores indicate a prediction of 1, or NR and lower scores a prediction of 0, or R. SHAP Force Plots also indicate the degree of feature impact based on proximity to the boundary line where the red and blue bars meet. The closer to the dividing boundary, the more impact that feature had on the patient’s prediction.</p>
+<br>
+<h3>Run another prediction</h3>
+<p>Selecting 'Run Another Prediction' will return you to the query builder page, where you can complete steps 1 through 4 to run a new prediction. As a reminder, no patient data is stored in the PredictMod server, so running another prediction will erase any currently displayed results.</p>
+<br>
+<h2>Upload a Model</h2>
+<p>PredictMod is a collaborative space for researchers to upload their intervention-based models and performance metrics. These models are freely available to users and commercial entities under the CC BY 4.0 license. While our current focus is Prediabetes, the platform allows for multiple models to overlap among conditions and interventions. Researchers can upload their model and relevant documentation directly to PredictMod to make it freely available to users.</p>
+<br>