# HG changeset patch # User matthias # Date 1559060025 14400 # Node ID ef3ebaa7003207ac685faaa164c46cde684646dd # Parent af8d1ccbd1535ba32a061b663c3d440f1b5a9a76 planemo upload for repository https://github.com/bernt-matthias/mb-galaxy-tools/tree/topic/dada2/tools/dada2 commit a54770771e567c7ad8a9dd75cc4689c3935ef11c diff -r af8d1ccbd153 -r ef3ebaa70032 README.md --- a/README.md Mon May 27 13:21:48 2019 -0400 +++ b/README.md Tue May 28 12:13:45 2019 -0400 @@ -11,7 +11,7 @@ Datatypes ========= -The dada2 Galaxy wrappers use a few extra data types to ensure that only inputs of the correct type can be used. +The dada2 Galaxy wrappers use a few extra data types to ensure that only inputs of the correct type can be used, these datatypes are available from Galaxy release 19.05, for earlier releases they need to be added manually. For the outputs of derep, dada, learnErrors, and mergePairs the following datatypes are used that derive from Rdata (which contains the named list that is returned from the corresponding dada function): @@ -35,5 +35,4 @@ TODOs ===== -- implememt getUniques tool to view intermediate results? -- implement tests for cached reference data +- implement getUniques tool to view intermediate results? diff -r af8d1ccbd153 -r ef3ebaa70032 dada2_learnErrors.xml --- a/dada2_learnErrors.xml Mon May 27 13:21:48 2019 -0400 +++ b/dada2_learnErrors.xml Tue May 28 12:13:45 2019 -0400 @@ -90,7 +90,7 @@ Error rates are learned by alternating between sample inference and error rate estimation until convergence. Additionally a plot is generated that shows the observed frequency of each transition (eg. A->C) as a function of the associated quality score, the final estimated error rates (if they exist), the initial input rates, and the expected error rates under the nominal definition of quality scores. -In addition a plot is generated (with plotErrors) that shows the observed frequency of each transition (eg. A->C) as a function of the associated quality score. Also the final estimated error rates (if they exist) are shown. Optionally also the initial input rates and the expected error rates under the nominal definition of quality scores can be added to the plot. +In addition a plot is generated (with plotErrors) that shows the observed frequency of each transition (eg. A->C) as a function of the associated quality score. Also the final estimated error rates (if they exist) are shown. Optionally also the initial input rates and the expected error rates under the nominal definition of quality scores can be added to the plot. Usage ..... @@ -112,7 +112,7 @@ It is expected that the estimated error rates (black lines in the plot) are in a good fit to the observed rates (points in the plot), and that the error rates drop with increased quality. Try to increase the **number of bases to use for learning** if this is not the case. -Error functions: +Error functions: - loessErrfun: accepts a matrix of observed transitions, with each transition corresponding to a row (eg. row 2 = A->C) and each column to a quality score (eg. col 31 = Q30). It returns a matrix of estimated error rates of the same shape. Error rates are estimates by a loess fit of the observed rates of each transition as a function of the quality score. Self-transitions (i.e. A->A) are taken to be the left-over probability. - noqualErrfun: accepts a matrix of observed transitions, groups together all observed transitions regardless of quality scores, and estimates the error rate for that transition as the observed fraction of those transitions. The effect is that quality scores will be effectively ignored. diff -r af8d1ccbd153 -r ef3ebaa70032 macros.xml --- a/macros.xml Mon May 27 13:21:48 2019 -0400 +++ b/macros.xml Tue May 28 12:13:45 2019 -0400 @@ -26,7 +26,7 @@ dada2_derep,dada2_dada,dada2_mergepairs + ]]> + + + + + + + + + + + + + + + + + + + + @@ -69,7 +89,7 @@ - + diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/dada2_species.loc --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/dada2_species.loc Tue May 28 12:13:45 2019 -0400 @@ -0,0 +1,10 @@ +# This is a sample file distributed with Galaxy that is used to define a +# list of dada2 reference data sets for species assignment, using three +# tab separated columns: +# +# +# +# Datasets can be retrieved from http://busco.ezlab.org/frame_wget.html +# +# Datasets can be retrieved from https://benjjneb.github.io/dada2/training.html +test_buildid test_displayname ${__HERE__}/reference_species.fa diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/dada2_taxonomy.loc --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/dada2_taxonomy.loc Tue May 28 12:13:45 2019 -0400 @@ -0,0 +1,10 @@ +# This is a sample file distributed with Galaxy that is used to define a +# list of dada2 reference data sets for taxonomy assignment, using three +# tab separated columns: +# +# +# +# Datasets can be retrieved from https://benjjneb.github.io/dada2/training.html +# +# taxlevels is a comma separated list of taxonomy levels +test_buildid test_displayname ${__HERE__}/reference.fa Level1,Level2,Level3,Level4,Level5 diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/qualityProfile.pdf Binary file test-data/qualityProfile.pdf has changed diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/qualityProfileMultiple.pdf Binary file test-data/qualityProfileMultiple.pdf has changed diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/qualityProfileMultiple_rev.pdf Binary file test-data/qualityProfileMultiple_rev.pdf has changed diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/qualityProfileSmallSample.pdf Binary file test-data/qualityProfileSmallSample.pdf has changed diff -r af8d1ccbd153 -r ef3ebaa70032 test-data/qualityProfile_rev.pdf Binary file test-data/qualityProfile_rev.pdf has changed diff -r af8d1ccbd153 -r ef3ebaa70032 tool_data_table_conf.xml.test --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/tool_data_table_conf.xml.test Tue May 28 12:13:45 2019 -0400 @@ -0,0 +1,11 @@ + + + + value, name, path + +
+ + value, name, path, taxlevels + +
+