How To Use The Prophane Webserver
Discover a Comprehensive Tutorial on Using the Prophane Web Service Hands-On Tutorial: Hands-On Tutorial: Standard Analysis Expert Mode Default Settings for standard Analysis: Hands-On Tutorial: Step by Step Instruction: Setting Up Your Prophane Job Report file FASTA File Exclude Hands-On Tutorial: Setting Up Your Prophane Job Step by Step Instruction: Creating Sample Groups in Prophane Hands-On Tutorial: Creating Sample Groups in Prophane Step by Step Instruction: Quantification Method Hands-On Tutorial: Quantification Method Step by Step Instruction: Taxonomic Annotation Taxonomic Annotation in Prophane Taxonomic annotation in Prophane involves a search within the selected reference database to find proteins that bear similarity to those present in your analysis. Subsequently, Prophane retrieves the associated taxon ID and outlines the entire taxonomic lineage of the protein that exhibits the closest resemblance to the query. Parameters for Taxonomic Annotation in Prophane E-Value: The E-value parameter defines the threshold for similarity for reported proteins. Smaller E-values indicate more stringent similarity criteria, resulting in reporting only very closely matching proteins. Conversely, larger E-values, referred to as the "relaxed" setting, expand the reporting to include more matches with varying degrees of similarity. Advanced Options: For comprehensive information on additional parameters and their functionalities, please refer to the DIAMOND documentation. This resource provides in-depth insights into the various options available for optimizing your taxonomic annotation analysis in Prophane. Hands-On Tutorial:: Taxonomic Annotation Step by Step Instruction: Functional Annotation Functional Annotation in Prophane Functional annotation in Prophane involves a search within the selected functional database to find proteins that bear similarity to those present in your analysis. Subsequently, Prophane retrieves the annotated functions and outlines different functional annotation levels of the protein that exhibits the closest resemblance to the query. Parameters for Functional Annotation in Prophane E-Value: The E-value parameter defines the threshold for similarity for reported proteins. Smaller E-values indicate more stringent similarity criteria, resulting in reporting only very closely matching proteins. Conversely, larger E-values, referred to as the "relaxed" setting, expand the reporting to include more matches with varying degrees of similarity. Advanced Options: For comprehensive information on additional parameters and their functionalities, please refer to the EggNog documentation and HAMMr documentation This resources provide in-depth insights into the various options available for optimizing your functional annotation analysis in Prophane. Hands-On Tutorial:: Taxonomic Annotation Step by Step Instruction: Custom Map Annotation Custom Map Annotation in Prophane Prophane allows users to upload custom maps for taxonomic or functional annotations. These user-specified custom maps enable the mapping of identified protein accessions to specific taxonomic or functional information. The "acc2annot_mapper" within Prophane matches annotations based on protein accessions. Hands-On Tutorial: Custom Map Annotation Step by Step Instruction: Lowest Common Ancestor LCA in Prophane The LCA approach searches hierarchical data to find the lowest common node shared by all members of a group. For every functional or taxonomical annotation at each annotation level, Prophane determines an LCA to represent all protein group members. Often, proteins within a group or metaproteins have different annotations. To obtain a unified representative for all members, Prophane determines the lowest common ancestor among them. If this is not possible the assigned LCA-value is referred to as "various". Two different methods are available for LCA determination: Advanced Options ignore_unclassified The advanced option "ignore_unclassified" only takes into account for LCA-determination proteins for which an annotation was found (no "unclassified" annotations). minimum_number_of_annotations The "minimum_number_of_annotations" advanced option in Prophane empowers users to specify a minimum count of annotations within the annotation lineage that will be taken into account by the LCA methods. This feature proves especially useful for effectively managing annotations that might be considered extraneous or less informative. Hands-On Tutorial:: Lowest Common Ancestor Starting Your Prophane Analysis The summary allows you to ensure that your input files and task settings are accurately configured to meet your analysis requirements. Once you've verified your inputs and settings, proceed by clicking the 'Submit' button to initiate the analysis. Hands-On Tutorial: Accessing Your Prophane Analysis Results Hands-On Tutorial:
Additionally, delve into the wealth of knowledge found in the publication titled A complete and flexible workflow for metaproteomics data analysis based on MetaProteomeAnalyzer and Prophane. This publication serves as a comprehensive resource for in-depth understanding.
The initial segment of each step's description begins with a concise overview, outlining the actions to be undertaken, followed by a meticulous, step-by-step guide on defining your analysis. Subsequently, a distinct pink box presents the specific steps employed for the tutorial test analysis.
You have the option to either log in and create a user account using 'Login with Google,' or you can proceed without logging in.
The benefits of setting up a user account via 'Login with Google' include unrestricted access to all analyses you've ever performed.
If you choose to continue without logging in, you can provide your email address to receive notifications with a direct link to the analysis results. Alternatively, if you prefer not to log in and don't wish to provide an email address, you can manually copy the result URL for downloading and access to status information.
The minimum required input for a Prophane analysis includes a proteomic search result file and the corresponding matching FASTA file. To specify the format of the proteomic search result file, use the 'Source' option. The proteomic research file should be uploaded under 'Report file' and the protein sequence database under 'FASTA File'.
Once all the necessary input files have been selected, and you intend to perform a standard analysis exclusively, simply proceed to the next step and initiate the analysis by clicking the 'Submit' button.
It also accommodates data produced by search software such as:
Descriptions on how to generate report files can be found in "About Prophane", "Input Files".
If you intend to use a different input format regularly, please contact us via email at support@prophane.de.
For example, if you have contamination proteins with accessions starting with "CON_," select "Accessions starting with" in the exclude field and enter "CON_."
The mean quantification values for each sample group will be included in the final output, which consists of the "summary.csv" table and the "lca_summary.mztab" file. These values are also visualized in the Krona-Plots. Additionally, the quantification values for each individual sample will be reported. However, if no sample groups are specified, quantification will be performed separately for each individual sample.
This flexibility in sample grouping and quantification allows you to tailor your analysis to your specific experimental design and research goals.
Please note that Prophane extracts sample names from various protein reports, so it's essential to use the exact same sample names as they appear in these reports to ensure accurate grouping and analysis. The "Sample Group" part of the Prophane documentation describes how Prophane extract sample names from different protein reports.
Default setting: NSAF (normalized to longest metaprotein sequence). Proteome Discoverer Protein Group Reports: Special Consideration
It's important to note that the quantification in Proteome Discoverer protein group reports is not based on spectral counts, and the abundances are already normalized within these reports. Therefore, when working with Proteome Discoverer protein group reports, select the "raw" option to preserve the existing normalization.
For more detailed information about the different normalization methods and when to use them, you can refer to the "Quantification" part of the Prophane documentation describing these methods in greater detail.
The search algorithm employed in Prophane is DIAMOND BLASTP. This powerful algorithm is responsible for identifying sequence homologies within the specified protein database.
Prophane supports following protein sequence databases:
For more detailed information about the DIAMOND BLASTP algorithms please crefer to the Prophane About page. You can explore different protein databases and their unique characteristics by consulting the "Annotation Databases & Algorithms & Parameters" part of the Prophane Documentation.
/home/jules/Documents/Metaproteomics/Prophane/prophane_tutorial_img/Custom_map_2.png /home/jules/Documents/Metaproteomics/Prophane/prophane_tutorial_img/Custom_map_1.png
The search algorithm employed in Prophane is hmmscan, hmmsearch or emapper.
Prophane supports following protein functional databases:
For more detailed information about the different functional databases please crefer to the Prophane About page. You can explore different protein databases and their unique characteristics by consulting the "Annotation Databases & Algorithms & Parameters" part of the Prophane Documentation.
One common use case for custom maps is assigning taxa to protein groups in experiments with a known sample composition. Custom maps containing protein accessions and species lineages allow for tailored annotation, even excluding species with similar sequences.
Annotations are extracted from a user-provided tab-separated table (TSV) file. The names of the annotation levels are derived from the column names in the user-provided table, which serves as the basis for annotation. It's important to avoid using spaces, commas, tabulators, and semicolons in the column names.
For more detailed information about the format specifications for custom maps, please refer to the "Custom Maps" part of the Prophane Documentation.
If you want to use your custom map, first add a new custom map task:
Name your tasks and choose custom map file for upload:
The "LCA per group" method determines the LCA based on annotations of proteins within a protein group. It allows users to set a threshold value for each group, ranging from 0 to 1.A threshold value of 1 returns an LCA only if all annotations of all protein group members match; otherwise, "various" is returned. For example, with a threshold value of 0.51, Prophane returns an LCA if more than half of the annotations are the same.
The "democratic LCA" method selects the annotation that occurs most frequently across all protein groups from each protein group.
For instance, when dealing with annotations like vector annotation that typically consist of lineage with only one-level annotations, this option allows users to filter out these less informative annotations, ensuring a more meaningful and accurate LCA determination process.
For more detailed information about LCA determination and further explanations, please refer to the provided "LCA" part of the Prophane documentation.
Your files are uploaded to the Prophane server, and the analysis process begins. Please note that a typical Prophane run may take several hours.
All submitted Prophane jobs are queued. The time it takes to start your Prophane job depends on your position in the queue. Please be patient.
Once your Prophane job is successfully completed, you will receive an email notification if you are logged in to your account or if you've provided an email address. If you are not logged in, you can utilize the provided link to check the Prophane job status.
To access and download the analysis results, you have two options: you can either log in to your user account and use the Job Control panel, or you can use the download link you stored when submiiting the prophane job (same link is provided in the email notification ).
In the Job Control panel, accessible to registered users, you can view all your submitted jobs. This panel provides real-time updates on the status of your jobs, which may include being in a queue, running, completed, or marked as failed.
Once your Prophane job is ready, you'll find a download link under the "Results" section, allowing you to retrieve your analysis results. Additionally, a link to view the resulting Krona plots in your web browser is provided.
The Job Control panel offers an organized and efficient way to manage your Prophane jobs and access the outcomes of your analyses.
For more detailed information about Prophane results and further explanations, please refer to the provided "Result" part of the Prophane documentation.
These plots can be found in the "plots" folder.
The Krona plots provide an interactive visualization of the annotation results, more precisely from the lca results of the metaproteins . They specifical desined for the vizualization of metabiome analyses. The size of the boxes corresponds to the percentage of the summed quantification results of all groups sharing the same LCA of the corresponding level. In the middle you find the top level annotation. For taxonomic annotation this corresponds to the superkingdom level. The further out you go, the more specific the annotations level and the more often you see various. You can zoom into the Krona Plot and zoom out. And you can change the showed results between sample groups and samples.
Load the summary.txt into excel and examine the results:
Check the Krona Plot: