# HG changeset patch # User bgruening # Date 1478720193 18000 # Node ID 383e9f2eaa6cf69d169e2ab8975d2b3e082c3b52 # Parent d5dbba73786bb74194cd03f23d98044c6072925b planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/trim_galore commit 93babf3018c7291d11206999ba82750f8def33c6 diff -r d5dbba73786b -r 383e9f2eaa6c trim_galore.xml --- a/trim_galore.xml Tue Mar 29 12:20:39 2016 -0400 +++ b/trim_galore.xml Wed Nov 09 14:36:33 2016 -0500 @@ -1,15 +1,15 @@ - adaptive quality and adapter trimmer + Quality and adapter trimmer of reads - + - + @@ -24,6 +24,7 @@ + ^[ACTGNactgn]*$ @@ -35,10 +36,10 @@ This may remove some unwanted bias from the 3' end that is not directly related to adapter sequence or basecall quality. (--three_prime_clip_R1) - + Instructs Trim Galore! to remove N bp from the 3' end of read 2 after adapter/quality trimming has been performed. This may remove some unwanted bias from - the 3' end that is not directly related to adapter sequence or basecall quality. (--three_prime_clip_R2) + the 3' end that is not directly related to adapter sequence or basecall quality. @@ -52,6 +53,7 @@ $report_file; #end if + ]]> @@ -217,6 +222,7 @@ + Instructs Trim Galore! to remove N bp from the 3' end of read 1 after adapter/quality trimming has been performed. This may remove some unwanted bias from the 3' end that is not directly related to adapter sequence or basecall quality. @@ -234,6 +240,7 @@ + @@ -242,14 +249,15 @@ - - + - - + + + @@ -264,6 +272,7 @@ + @@ -278,7 +287,7 @@ + label="Screen quality-trimmed sequences for 'CAA' or 'CGA' at the start of the read and, if found, removes the first two basepairs" /> @@ -287,11 +296,13 @@ singlePaired['sPaired'] == "single" + singlePaired['sPaired'] == "paired_collection" + @@ -299,20 +310,24 @@ params['retain_unpaired']['retain_unpaired_select'] == "retain_unpaired_output" singlePaired['sPaired'] == "paired_collection" + singlePaired['sPaired'] == "paired" + singlePaired['sPaired'] == "paired" + params['settingsType'] == "custom" params['retain_unpaired']['retain_unpaired_select'] == "retain_unpaired_output" singlePaired['sPaired'] == "paired" + params['settingsType'] == "custom" @@ -323,6 +338,7 @@ params['settingsType'] == "custom" params['report'] == True + @@ -402,7 +418,142 @@ .. _Trim Galore!: http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/ It is developed by Felix Krueger at the Babraham Institute. -]]> + +---- + +**Main Settings** + +* **Adapter sequence to be trimmed** + + * **Automatic detection** + + | Adapter sequence to be trimmed. Trim Galore will try to auto-detect whether the Illumina universal, Nextera transposase or Illumina small RNA adapter sequence was used. + + * **Illumina universal** + + | Adapter sequence to be trimmed is the first 13bp of the Illumina universal adapter 'AGATCGGAAGAGC' instead of the default auto-detection of adapter sequence. + | + | *option --illumina* + + * **Nextera transposase** + + | Adapter sequence to be trimmed is the first 12bp of the Nextera adapter 'CTGTCTCTTATA' instead of the default auto-detection of adapter sequence. + | + | *option --nextera* + + * **Illumina small RNA adapters** + + | Adapter sequence to be trimmed is the first 12bp of the Illumina Small RNA 3' Adapter 'TGGAATTCTCGG' instead of the default auto-detection of adapter sequence. Selecting to trim smallRNA adapters will also lower the --length value to 18bp. If the smallRNA libraries are paired-end then -a2 will be set to the Illumina small RNA 5' adapter automatically (‘GATCGTCGGACT’) unless -a 2 had been defined explicitly. + | + | *option --small_rna* + + * **User defined adapter trimming** + + | Adapter sequence to be trimmed is the sequence entered by the user instead of the default auto-detection of adapter sequence. + | + | *option -a* + +* **If Single-End Reads** + + * **Remove bp from the 3' end** + + | Instructs Trim Galore to remove bp from the 3' end of read 1 (or single-end reads) AFTER adapter/quality trimming has been performed. This may remove some unwanted bias from the 3' end that is not directly related to adapter sequence or basecall quality. Default: OFF. + | + | *option --three_prime_clip_R1* + +* **If Paired-End Reads** + + * **Trims 1 bp off every read from its 3' end** + + | This may be needed for FastQ files that are to be aligned as paired-end data with Bowtie. This is because Bowtie (1) regards alignments like this: + | + | R1 ---------------------------> + | R2 <--------------------------- + | + | or this: + | + | R1 -----------------------> + | R2 <----------------- + | + | as invalid (whenever a start/end coordinate is contained within the other read). + | + | *option --t* + + * **Remove bp from the 3' end of read 1** + + | Instructs Trim Galore to remove bp from the 3' end of read 1 (or single-end reads) AFTER adapter/quality trimming has been performed. This may remove some unwanted bias from the 3' end that is not directly related to adapter sequence or basecall quality. Default: OFF. + | + | *option --three_prime_clip_R1* + + * **Remove bp from the 3' end of read 2** + + | Instructs Trim Galore to remove bp from the 3' end of read 2 AFTER adapter/quality trimming has been performed. This may remove some unwanted bias from the 3' end that is not directly related to adapter sequence or basecall quality. Default: OFF. + | + | *option --three_prime_clip_R2* + +---- + +**Advanced Settings** + +* **Trim low-quality ends from reads in addition to adapter removal** + + | For RRBS samples, quality trimming will be performed first, and adapter trimming is carried in a second round. Other files are quality and adapter trimmed in a single pass. The algorithm is the same as the one used by BWA (Subtract from all qualities; compute partial sums from all indices to the end of the sequence; cut sequence at the index at which the sum is minimal). Default Phred score: 20. + | + | *option -q* + +* **Overlap with adapter sequence required to trim a sequence** + + | Defaults to a very stringent setting of '1', i.e. even a single bp of overlapping sequence will be trimmed of the 3' end of any read. + | + | *option -s* + +* **Maximum allowed error rate** + + | (no. of errors divided by the length of the matching region) (default: 0.1). + | + | *option -e* + +* **Discard reads that became shorter than length ** + + | because of either quality or adapter trimming. A value of '0' effectively disables this behaviour. Default: 20 bp. + | + | For paired-end files, both reads of a read-pair need to be longer than bp to be printed out to validated paired-end files (see option --paired). If only one read became too short there is the possibility of keeping such unpaired single-end reads (see --retain_unpaired). Default pair-cutoff: 20 bp. + | + | *option --length* + +* **Instructs Trim Galore! to remove INT bp from the 5' end of read 1** + + | Instructs Trim Galore to remove bp from the 5' end of read 1 (or single-end reads). This may be useful if the qualities were very poor, or if there is some sort of unwanted bias at the 5' end. Default: OFF. + | + | *option --clip_R1* + +* **Instructs Trim Galore! to remove INT bp from the 5' end of read 2** + + | Instructs Trim Galore to remove bp from the 5' end of read 2 (paired-end reads only). This may be useful if the qualities were very poor, or if there is some sort of unwanted bias at the 5' end. For paired-end BS-Seq, it is recommended to remove the first few bp because the end-repair reaction may introduce a bias towards low methylation. Please refer to the M-bias plot section in the Bismark User Guide for some examples. Default: OFF. + | + | *option --clip_R2* + +* **Specify if you would like to retain unpaired reads** + + | If only one of the two paired-end reads became too short, the longer read will be written to either '.unpaired_1.fq' or '.unpaired_2.fq' output files. The length cutoff for unpaired single-end reads is governed by the parameters -r1/--length_1 and -r2/--length_2. Default: OFF. + | + | *option --retained_unpaired* + +---- + +**RRBS specific settings** + +* **Specifies that the input file was an MspI digested RRBS sample (recognition site: CCGG)** + + | Sequences which were adapter-trimmed will have a further 2 bp removed from their 3' end. This is to avoid that the filled-in C close to the second MspI site in a sequence is used for methylation calls. Sequences which were merely trimmed because of poor quality will not be shortened further. + | + | *option -rrbs* + +* **Screen quality-trimmed sequences for 'CAA' or 'CGA' at the start of the read and, if found, removes the first two basepairs** + + | Selecting this option for non-directional RRBS libraries will screen quality-trimmed sequences for 'CAA' or 'CGA' at the start of the read and, if found, removes the first two basepairs. Like with the option '--rrbs' this avoids using cytosine positions that were filled-in during the end-repair step. '--non_directional' requires '--rrbs' to be specified as well. + | + | *option --non_directional*]]>