# HG changeset patch # User matthias # Date 1557073950 14400 # Node ID e76f2213f0ccd1aa228ce8df3cdad78fca1034a6 # Parent e727d31bcb6016d4ca8c692a9ac5f3fe51697d6c planemo upload for repository https://github.com/bernt-matthias/mb-galaxy-tools/tree/topic/dada2/tools/dada2 commit 990192685955e9cda0282e348c28ef6462d88a38 diff -r e727d31bcb60 -r e76f2213f0cc README.md --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/README.md Sun May 05 12:32:30 2019 -0400 @@ -0,0 +1,39 @@ +Wrappers for the core functionality of the dada2 package https://benjjneb.github.io/dada2/index.html. + +- filterAndTrim +- derep +- learnErrors +- dada +- mergePairs +- makeSequenceTable +- removeBimeraDenovo + +Datatypes +========= + +The dada2 Galaxy wrappers use a few extra data types to ensure that only inputs of the correct type can be used. + +For the outputs of derep, dada, learnErrors, and mergePairs the following datatypes are used that derive from Rdata (which contains the named list that is returned from the corresponding dada function): + +- dada2_derep (Rdata: named list see docs for derep-class) +- dada2_dada (Rdata: named list, see docs for dada-class) +- dada2_errorrates (Rdata: named list, see docs for learnErrors) +- dada2_mergepairs (Rdata: named list, see docs for mergePairs) + +For the outputs of makeSequenceTable and removeBimeraDenovo the following data types are used which derive from tabular: + +- dada2_uniques +-- in R a named integer vector (names are the unique sequences) +-- in Galaxy written as a table (each row corresponding to a unique sequence, column 1: the sequence, column 2: the count) +- dada2_sequencetable +-- in R a named integer matrix (rows = samples, columns = unique sequences) +-- in Galaxy written as a table (rows = unique sequences, columns = samples) + +Note the difference between the R and Galaxy representations! The main motivation is that the dada2_sequencetable is analogous to OTU tables as produced for instance by qiime (and it seemed natural to extend this to the uniques which are essentially a sequencetables of single samples). + + +TODOs +===== + +- implememt getUniques tool to view intermediate results? +- implement tests for cached reference data diff -r e727d31bcb60 -r e76f2213f0cc dada2_mergePairs.xml --- a/dada2_mergePairs.xml Mon Apr 29 09:51:19 2019 -0400 +++ b/dada2_mergePairs.xml Sun May 05 12:32:30 2019 -0400 @@ -12,13 +12,13 @@ - - - - + + + + - - + + - - - - + + + + + + + + + + + + + + + + 0 by default) mismatches in the overlap region. +This function attempts to merge each denoised pair of forward and reverse reads, rejecting any pairs which do not sufficiently overlap (at least 12bp by default) or which contain too many (>0 by default) mismatches in the overlap region. Usage ..... @@ -60,7 +72,7 @@ **Input** - dereplicated reads and learned error rates of the forward reads -- dereplicated reads and learned error rates of the reversereads +- dereplicated reads and learned error rates of the reverse reads **Output** diff -r e727d31bcb60 -r e76f2213f0cc macros.xml --- a/macros.xml Mon Apr 29 09:51:19 2019 -0400 +++ b/macros.xml Sun May 05 12:32:30 2019 -0400 @@ -25,23 +25,44 @@ dada2_derep,dada2_dada,dada2_mergepairs + + + +
@@ -63,9 +84,9 @@ - - - + + +