# HG changeset patch # User iuc # Date 1424922154 18000 # Node ID 361d2cb38860b709bd2cdd1c41154d6d0d86eb76 # Parent b0e3a2df7897c7e2f8ca3e0d04102805f7e454ea Uploaded diff -r b0e3a2df7897 -r 361d2cb38860 .shed.yml --- a/.shed.yml Wed Feb 11 07:52:43 2015 -0500 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,3 +0,0 @@ -# repository published to https://toolshed.g2.bx.psu.edu/repos/iuc/pear -owner: iuc -name: pear diff -r b0e3a2df7897 -r 361d2cb38860 pear.xml --- a/pear.xml Wed Feb 11 07:52:43 2015 -0500 +++ b/pear.xml Wed Feb 25 22:42:34 2015 -0500 @@ -13,8 +13,15 @@ - - + + + + + + + + + + + + + - - - - - - - + @@ -74,19 +94,17 @@ - + - + - - @@ -118,82 +136,20 @@ Forward paired-end FASTQ file. - -r, --reverse-fastq Reverse paired-end FASTQ file. +PEAR_ is an ultrafast, memory-efficient and highly accurate pair-end read merger. +PEAR evaluates all possible paired-end read overlaps and without requiring the target fragment +size as input. In addition, it implements a statistical test for minimizing false-positive results. +Together with a highly optimized implementation, it can merge millions of paired end reads within a couple of minutes +on a standard desktop computer. -Optional: - -p, --p-value Specify a p-value for the statistical test. If the computed - p-value of a possible assembly exceeds the specified p-value - then paired-end read will not be assembled. Valid options - are: 0.0001, 0.001, 0.01, 0.05 and 1.0. Setting 1.0 disables - the test. (default: 0.01) - -v, --min-overlap Specify the minimum overlap size. The minimum overlap may be - set to 1 when the statistical test is used. However, further - restricting the minimum overlap size to a proper value may - reduce false-positive assembles. (default: 10) - -m, --max-assembly-length Specify the maximum possible length of the assembled - sequences. Setting this value to 0 disables the restriction - and assembled sequences may be arbitrary long. (default: 0) - -n, --min-assembly-length Specify the minimum possible length of the assembled - sequences. Setting this value to 0 disables the restriction - and assembled sequences may be arbitrary short. (default: - 50) - -t, --min-trim-length Specify the minimum length of reads after trimming the low - quality part (see option -q). (default: 1) - -q, --quality-threshold Specify the quality score threshold for trimming the low - quality part of a read. If the quality scores of two - consecutive bases are strictly less than the specified - threshold, the rest of the read will be trimmed. (default: - 0) - -u, --max-uncalled-base Specify the maximal proportion of uncalled bases in a read. - Setting this value to 0 will cause PEAR to discard all reads - containing uncalled bases. The other extreme setting is 1 - which causes PEAR to process all reads independent on the - number of uncalled bases. (default: 1) - -g, --test-method Specify the type of statistical test. Two options are - available. (default: 1) - 1: Given the minimum allowed overlap, test using the highest - OES. Note that due to its discrete nature, this test usually - yields a lower p-value for the assembled read than the cut- - off (specified by -p). For example, setting the cut-off to - 0.05 using this test, the assembled reads might have an - actual p-value of 0.02. +For more information please look at the documentation_ and `github repository`_. - 2. Use the acceptance probability (m.a.p). This test methods - computes the same probability as test method 1. However, it - assumes that the minimal overlap is the observed overlap - with the highest OES, instead of the one specified by -v. - Therefore, this is not a valid statistical test and the - 'p-value' is in fact the maximal probability for accepting - the assembly. Nevertheless, we observed in practice that for - the case the actual overlap sizes are relatively small, test - 2 can correctly assemble more reads with only slightly - higher false-positive rate. - -e, --empirical-freqs Disable empirical base frequencies. (default: use empirical - base frequencies) - -s, --score-method Specify the scoring method. (default: 2) - 1. OES with +1 for match and -1 for mismatch. - 2: Assembly score (AS). Use +1 for match and -1 for mismatch - multiplied by base quality scores. - 3: Ignore quality scores and use +1 for a match and -1 for a - mismatch. - -c, --cap Specify the upper bound for the resulting quality score. If - set to zero, capping is disabled. (default: 40) - -z, --nbase When merging a base-pair that consists of two non-equal - bases out of which none is degenerate, set the merged base - to N and use the highest quality score of the two bases - - +.. _PEAR: http://sco.h-its.org/exelixis/web/software/pear/ +.. _documentation: http://sco.h-its.org/exelixis/web/software/pear/doc.html +.. _github repository: https://github.com/xflouris/PEAR ]]>