# HG changeset patch # User galaxyp # Date 1516910608 18000 # Node ID 41c0c75301b3b44f2998710ae9212d1f15667500 # Parent d042eabcd6ec0c2633de711414351a4ad6b2300c planemo upload for repository https://github.com/galaxyproteomics/tools-galaxyp/tree/master/tools/flashlfq commit 59fca11aabb90cd875d93e7da8791a49e1e2c01a-dirty diff -r d042eabcd6ec -r 41c0c75301b3 flashlfq.xml --- a/flashlfq.xml Wed Dec 06 13:46:13 2017 -0500 +++ b/flashlfq.xml Thu Jan 25 15:03:28 2018 -0500 @@ -5,14 +5,14 @@ - + @@ -48,15 +49,35 @@ - - - - + + + + + + + + + + + + + + + + + + + + - + @@ -91,6 +112,35 @@ --nis [int|number of isotopes required to be observed] +**Tab-Delimited Identification Text File** + +The first line of the text file should contain column headers identifying what each column is. Note that MetaMorpheus (.psmtsv), Morpheus, MaxQuant (msms.txt), and TDPortal tab-delimited column headers are supported natively and such files can be read without modification. For search software that lists decoys and PSMs above 1% FDR (e.g., MetaMorpheus), you may want to remove these prior to FlashLFQ analysis. FlashLFQ will probably crash if ambiguous PSMs are passed into it (e.g., a PSM with more than 2 peptides listed in one line). + +The following headers are required in the list of MS/MS identifications: + + - **File Name** - File extensions should be tolerated, but no extension is tested more extensively (e.g. use MyFile and not MyFile.mzML) + - **Base Sequence** - Should only contain amino acid sequences, or it will likely result in a crash + - **Full Sequence** - Modified sequence. Can contain any letters, but must be consistent between the same peptidoform to get accurate results + - **Peptide Monoisotopic Mass** - Theoretical monoisotopic mass, including modification mass + - **Scan Retention Time** - MS/MS identification scan retention time + - **Precursor Charge** - Charge of the ion selected for MS/MS resulting in the identification + - **Protein Accession** - Protein accession(s) for the peptide; protein quantification is still preliminary + + +**Outputs**: + + - **QuantifiedProteins.tsv** - Protein intensities are summed here within a run. + + - **QuantifiedPeaks.tsv** - Each chromatographic peak is shown here, even peaks that were not quantifiable (peak intensity = 0). Details about each peak, such as number of PSMs mapped, start/apex/end retention times, ppm error, etc are contained in this file. A peptide can have multiple peaks over the course of a run (e.g., oxidized peptidoforms elute at different times, etc). Ambiguous peaks are displayed with a | (pipe) delimiter to indicate more than one peptide mapped to that peak. + + - **QuantifiedModifiedSequences.tsv** - Similar to QuantifiedBaseSequences, but instead of being summed by Base Sequence, peptide intensities are summed by modified sequence; this makes it convenient to compare modified peptidoform intensities across runs. + + - **QuantifiedBaseSequences.tsv** - Peptide intensities are summed here within a run (including differently-modified forms of the same amino acid sequence) and displayed in a convenient format for comparing across runs. The identification type (MS/MS or MBR) is also indicated. A peptide with more than 30% of its intensity coming from ambiguous peak(s) is considered not quantifiable and is given an intensity of -1. + + + - **Log.txt** - Log of the FlashLFQ run. Includes timestamps and quantification time for each file, total analysis time, directories used, and settings. + + ]]> 10.1021/acs.jproteome.7b00608 diff -r d042eabcd6ec -r 41c0c75301b3 test-data/._aggregatePSMs_5ppmAroundZero.psmtsv Binary file test-data/._aggregatePSMs_5ppmAroundZero.psmtsv has changed diff -r d042eabcd6ec -r 41c0c75301b3 test-data/._sliced-mzml.mzML Binary file test-data/._sliced-mzml.mzML has changed