# HG changeset patch # User matthias # Date 1557239434 14400 # Node ID 37305bbf8f44aeaa557a76105b45bbd3718258de # Parent ea57169fd0d02301fde25bc6861d2f6f28a2e1df planemo upload for repository https://github.com/bernt-matthias/mb-galaxy-tools/tree/topic/dada2/tools/dada2 commit 72e787011111e0262b8161c3834a6258d7b667bc diff -r ea57169fd0d0 -r 37305bbf8f44 dada2_filterAndTrim.xml --- a/dada2_filterAndTrim.xml Sun May 05 12:28:14 2019 -0400 +++ b/dada2_filterAndTrim.xml Tue May 07 10:30:34 2019 -0400 @@ -28,7 +28,7 @@ #else: maxEE <- c($filter.maxEE) #end if -#if $paired_cond.paired_select == "TRUE" and $seprev_cond.seprev_select == "yes" +#if $paired_cond.paired_select != "single" and $seprev_cond.seprev_select == "yes" truncQ <- c(truncQ,$seprev_cond.trim.truncQ) truncLen <- c(truncLen,$seprev_cond.trim.truncLen) trimLeft <- c(trimLeft,$seprev_cond.trim.trimLeft) @@ -52,14 +52,19 @@ rev <- NULL filt.fwd <- NULL filt.rev <- NULL -#if $paired_cond.paired_select == "TRUE" +#if $paired_cond.paired_select == "paired" fwd <- c(fwd, '$paired_cond.reads.forward') rev <- c(rev, '$paired_cond.reads.reverse') filt.fwd <- c(filt.fwd, '$paired_output.forward') filt.rev <- c(filt.rev, '$paired_output.reverse') +#else if $paired_cond.paired_select == "separate" + fwd <- c(fwd, '$paired_cond.forward') + rev <- c(rev, '$paired_cond.reverse') + filt.fwd <- c(filt.fwd, '$output_fwd') + filt.rev <- c(filt.rev, '$output_rev') #else fwd <- c(fwd, '$paired_cond.reads') - filt.fwd <- c(filt.fwd, '$output') + filt.fwd <- c(filt.fwd, '$output_single') #end if library(dada2, quietly=T) @@ -68,20 +73,29 @@ truncQ = truncQ, truncLen = truncLen, trimLeft = trimLeft, trimRight = trimRight, maxLen = maxLen, minLen = minLen, maxN = maxN, minQ = minQ, maxEE = maxEE, rm.phix = $rmPhiX, orient.fwd = '$orientFwd') -rownames(ftout) <- c( '$paired_cond.reads.element_identifier' ) +#if $paired_cond.paired_select == "separate": + rownames(ftout) <- c( '$paired_cond.forward.element_identifier' ) +#else: + rownames(ftout) <- c( '$paired_cond.reads.element_identifier' ) +#end if write.table(ftout, "$outtab", quote=F, sep="\t", col.names=NA) ]]> - - + + + - + - + + + + + @@ -106,10 +120,16 @@ - paired_cond['paired_select'] == "TRUE" + paired_cond['paired_select'] == "paired" - - paired_cond['paired_select'] == "FALSE" + + paired_cond['paired_select'] == "single" + + + paired_cond['paired_select'] == "separate" + + + paired_cond['paired_select'] == "separate" output_statistics @@ -120,7 +140,7 @@ currently removed `expect_num_outputs="2"` because of https://github.com/galaxyproject/galaxy/pull/7894--> - + @@ -134,19 +154,30 @@ + + + + + + + + + + + - + - + - + @@ -154,12 +185,12 @@ - + - + @@ -168,13 +199,13 @@ - + - + @@ -201,12 +232,14 @@ Description ........... -Filters and trims an input FASTQ dataset (can be compressed) based on several user-definable criteria, and outputs a compressed FASTQ data set containing those trimmed reads which passed the filters. For paired end data forward and reverse FASTQ datasets can be provided as pair of FASTQ datasets, in which case filtering is performed on the forward and reverse reads independently, and both reads must pass for the read pair to be in the output. +Filters and trims a FASTQ dataset (can be compressed) based on several user-definable criteria, and outputs a compressed FASTQ data set containing those trimmed reads which passed the filters. For paired end data forward and reverse FASTQ datasets can be provided as pair of FASTQ datasets (or two separate data sets), in which case filtering is performed on the forward and reverse reads independently, and both reads must pass for the read pair to be in the output. Usage ..... -**Input** is a FASTQ dataset (or a pair on case of paired end data) containing all reads of a sample. If you have multiple samples it is suggested to organize them in a (paired) collection. The **output** is a (paired) collection of filtered and trimmed paired FASTQ datasets (again one data set or pair per sample). +**Input** is a FASTQ dataset (or a pair in case of paired end data) containing all reads of a sample. It is suggested to organize them in a (paired) collection (in particular if you have multiple samples). + +**Output** is a (paired) collection of filtered and trimmed paired FASTQ datasets (again one data set or pair per sample). Upstream dada2 tools are *dada2: derepFastq* and *dada2: learnErrorRates*. Note that these tools do not work on paired end data. So, if you have paired end data you need to split the generated paired collection into one containing the forward reads and one containing the reverse reads. This can be done by the *unzip collection* tool.