# HG changeset patch # User devteam # Date 1390523568 18000 # Node ID 3abc76f7fa1292053ca89021564bbb0497bc53f6 # Parent 3a18b0c7aa7d831057125dbd70d783c5ee6e015a Uploaded diff -r 3a18b0c7aa7d -r 3abc76f7fa12 picard_DownsampleSam.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/picard_DownsampleSam.xml Thu Jan 23 19:32:48 2014 -0500 @@ -0,0 +1,63 @@ + + + picard + + picard_wrapper.py + --input=$inputFile + --output=$outFile + --probability=$probability + --seed=$seed + -j "\$JAVA_JAR_PATH/DownsampleSam.jar" + + + + + + + + + + + + + + + + + + + + + + +.. class:: infomark + +**Purpose** + +Randomly down-sample a SAM or BAM file to retain a random subset of the reads. Mate-pairs are either both kept or both discarded. Reads marked as not primary alignments are all discarded. Each read is given a probability P of being retained - results with the exact same input in the same order and with the same value for RANDOM_SEED will produce the same results. + +**Picard documentation** + +This is a Galaxy wrapper for DownsampleSam, a part of the external package Picard-tools_. + + .. _Picard-tools: http://www.google.com/search?q=picard+samtools + +------ + +.. class:: warningmark + +**Warning on SAM/BAM quality** + +Many SAM/BAM files produced externally and uploaded to Galaxy do not fully conform to SAM/BAM specifications. Galaxy deals with this by using the **LENIENT** +flag when it runs Picard, which allows reads to be discarded if they're empty or don't map. This appears +to be the only way to deal with SAM/BAM that cannot be parsed. + + + + \ No newline at end of file