A modular tool to aggregate results from bioinformatics analyses across many samples into a single report.
/home/runner/work/pmultiqc/pmultiqc/data_temp
pmultiqc
pmultiqc is a MultiQC module to show the pipeline performance of mass spectrometry based quantification pipelines such as nf-core/quantms, MaxQuant, DIA-NN, and FragPipe.https://github.com/bigbio/pmultiqc
Experimental Design and Metadata
FragPipe Parameters
| No. | Parameter | Value |
|---|---|---|
| 1 | Enzyme | trypsin |
| 2 | Enzyme Cut Site | KR |
| 3 | Max Missed Cleavages | 2 |
| 4 | Precursor Mass Tolerance (Lower) | -20.0 |
| 5 | Precursor Mass Tolerance (Upper) | 20 |
| 6 | Precursor Mass Units | 1 |
| 7 | Fragment Mass Tolerance | 20 |
| 8 | Fragment Mass Units | 1 |
| 9 | Variable Modification 1 | 15.9949 M 3 |
| 10 | Variable Modification 2 | 42.0106 [^ 1 |
| 11 | Database Path | 2025-03-28-decoys-contam-mouse_modified.fasta.fas |
| 12 | Match Between Runs (MBR) | 1 |
| 13 | Normalization | 1 |
| 14 | Requantify | 1 |
| 15 | TMT Channels | TMT-6 |
| 16 | TMT Reference Tag | Bridge |
| 17 | Number of Threads | 4 # Number of CPU threads to use. |
| 18 | Decoy Prefix | rev_ # Prefix of the decoy protein entries. Used for parameter optimization only. |
| 19 | Isotope Error | -1/0/1/2/3 # Also search for MS/MS events triggered on specified isotopic peaks. |
| 20 | Mass Offsets | 0.0 # Creates multiple precursor tolerance windows with specified mass offsets. |
| 21 | Precursor True Tolerance | 20 # True precursor mass tolerance (window is +/- this value). |
| 22 | Precursor True Units | 1 # True precursor mass tolerance units (0 for Da 1 for ppm). |
| 23 | Calibrate Mass | 2 # Perform mass calibration (0 for OFF 1 for ON 2 for ON and find optimal parameters 4 for ON and find the optimal fragment mass tolerance). |
| 24 | Clip N-term Met | 1 # Specifies the trimming of a protein N-terminal methionine as a variable modification (0 or 1). |
| 25 | Min Peptide Length | 7 # Minimum length of peptides to be generated during in-silico digestion. |
| 26 | Max Peptide Length | 50 # Maximum length of peptides to be generated during in-silico digestion. |
Experimental Design
| No. | File Name | Experiment | BioReplicate | Data Type |
|---|---|---|---|---|
| 1 | mc38_neg1.mzML | mc38_neg | 1 | DDA |
| 2 | mc38_neg2.mzML | mc38_neg | 2 | DDA |
| 3 | mc38_neg3.mzML | mc38_neg | 3 | DDA |
| 4 | mc38_p1.mzML | mc38_p | 1 | DDA |
| 5 | mc38_p2.mzML | mc38_p | 2 | DDA |
| 6 | mc38_p3.mzML | mc38_p | 3 | DDA |
| 7 | mc38_pos1.mzML | mc38_pos | 1 | DDA |
| 8 | mc38_pos2.mzML | mc38_pos | 2 | DDA |
| 9 | mc38_pos3.mzML | mc38_pos | 3 | DDA |
Results Overview
Summary Table
| #Peptides Quantified | #Proteins Quantified |
|---|---|
| 7472 | 1089 |
HeatMap
Pipeline Result Statistics
| Sample Name | #Peptide IDs | #Unambiguous Peptide IDs | #Modified Peptide IDs | #Protein (group) IDs |
|---|---|---|---|---|
| mc38_neg1 | 2524 | 1147 | 424 | 487 |
| mc38_neg2 | 2710 | 1217 | 471 | 500 |
| mc38_neg3 | 2576 | 1173 | 408 | 468 |
| mc38_p1 | 1686 | 745 | 284 | 333 |
| mc38_p2 | 1492 | 656 | 259 | 292 |
| mc38_p3 | 1739 | 730 | 301 | 357 |
| mc38_pos1 | 3662 | 1754 | 744 | 628 |
| mc38_pos2 | 3634 | 1739 | 708 | 610 |
| mc38_pos3 | 3836 | 1834 | 768 | 639 |
Identification Summary
Number of Peptides identified Per Protein
ProteinGroups Count
Peptide ID Count [MBR gain: +72.99%]
Missed Cleavages
FragPipe: [Number of Missed Cleavages] number of potential enzymatic cleavage sites within the identified sequence.
Modifications
The plot will show percentages, i.e. is normalized by the total number of peptide sequences (where different charge state counts as a separate peptide) per Raw file. The sum of frequencies may exceed 100% per Raw file, since a peptide can have multiple modifications.
E.g. given three peptides in a single Raw file1. _M(Oxidation (M))LVLDEADEM(Oxidation (M))LNK_
2. _(Acetyl (Protein N-term))M(Oxidation (M))YGLLLENLSEYIK_
3. DPFIANGER
, the following frequencies arise:
* 33% of 'Acetyl (Protein N-term)'* 33% of 'Oxidation (M)'
* 33% of '2 Oxidation (M)'
* 33% of 'Unmodified'
Thus, 33% of sequences are unmodified, implying 66% are modified at least once. If a modification, e.g. Oxidation(M), occurs multiple times in a single peptide it's listed as a separate modification (e.g. '2 Oxidation (M)' for double oxidation of a single peptide).
FragPipe: Extracting information from the 'Assigned Modifications' column in psm.tsv. [Assigned Modifications] variable modifications (listed by mass in Da) with modified residue and location within the peptide.Peptide Length Distribution
FragPipe: psm.tsv ('Peptide Length': number of residues in the peptide sequence).
MaxQuant: evidence.txt ('Length': the length of the sequence stored in the column 'Sequence').
DIA-NN: report.tsv (the length of the 'Stripped.Sequence').
quantms: *.mzTab (the length of sequence).
Search Engine Scores
Summary of Hyperscore
[Hyperscore] Similarity score between observed and theoretical spectra, higher values indicate greater similarity.
Quantification Analysis
Protein Intensity Distribution
Peptide Intensity Distribution
FragPipe: Use the 'Intensity' column from psm.tsv and apply a log2 transformation.
MS2 and Spectral Stats
Charge-state
MS/MS Counts Per 3D-peak
[FragPipe: combined_ion.tsv] This plot shows the distribution of MS/MS spectral counts per ion/peak for each sample.
Mass Error Trends
Delta Mass [Da]
FragPipe: [Delta Mass] difference between calibrated observed peptide mass and calculated peptide mass (in Da).
RT Quality Control
IDs over RT
This plot allows to judge column occupancy over retention time. Ideally, the LC gradient is chosen such that the number of identifications (here, after FDR filtering) is uniform over time, to ensure consistent instrument duty cycles. Sharp peaks and uneven distribution of identifications over time indicate potential for LC gradient optimization. See [Moruz 2014, DOI: 10.1002/pmic.201400036](https://pubmed.ncbi.nlm.nih.gov/24700534/) for details.