annotate picard_DownsampleSam.xml @ 146:767bcb0a06df draft

fixed downsamplesam
author Rayan Chikhi <chikhi@psu.edu>
date Mon, 21 Jul 2014 15:24:40 -0400
parents b2ca2d181fb4
children b6662c2c8d64
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
1 <tool name="Downsample SAM/BAM" id="picard_DownsampleSam" version="1.106.0">
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
2 <!-- found on https://bitbucket.org/bwlang/galaxy-dist/src/ca5ded2e18a9ef802c31429e3cb861e8775b24d0/tools/picard/picard_DownsampleSam.xml -->
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
3 <description>Down-sample a file to retain a subset of the reads</description>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
4 <requirements><requirement type="package" version="1.106.0">picard</requirement></requirements>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
5 <command interpreter="python">
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
6 picard_wrapper.py
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
7 --input "$inputFile"
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
8 --output "$outFile"
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
9 --probability=$probability
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
10 --output-format $outputFormat
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
11 --seed=$seed
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
12 -j "\$JAVA_JAR_PATH/DownsampleSam.jar"
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
13 </command>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
14 <inputs>
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
15 <param format="sam,bam" name="inputFile" type="data" label="SAM/BAM dataset to be downsampled"
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
16 help="If empty, upload or import a SAM or BAM dataset." />
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
17 <param name="probability" type="float" size="5" label="Probability (between 0 and 1) that any given read will be kept" help="specify 1 to keep all reads, 0.1 to keep 10% of the reads" value="1" />
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
18 <param name="seed" type="integer" size="5" label="Random seed value" help="(same seed + same probability = same set of reads kept)" value="1" />
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
19 <param name="outputFormat" type="boolean" checked="True" truevalue="bam" falsevalue="sam" label="Output BAM instead of SAM" help="Uncheck for SAM output" />
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
20 </inputs>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
21 <outputs>
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
22 <data name="outFile" format="bam" label="${tool.name} on ${on_string}: downsampled ${outputFormat}">
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
23 <change_format>
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
24 <when input="outputFormat" value="sam" format="sam" />
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
25 </change_format>
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
26 </data>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
27 </outputs>
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
28
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
29 <tests>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
30 <test>
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
31 <param name="inputFile" value="cleansamtestinput.sam" ftype="sam" />
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
32 <param name="probability" value="0.1" />
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
33 <param name="seed" value="2" />
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
34 <param name="outputFormat" value="False" />
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
35 <output name="outFile" file="downsamplesamoutput.sam" ftype="sam" />
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
36 </test>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
37 </tests>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
38 <help>
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
39
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
40 .. class:: infomark
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
41
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
42 **Purpose**
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
43
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
44 Randomly down-sample a SAM or BAM file to retain a random subset of the reads. Mate-pairs are either both kept or both discarded. Reads marked as not primary alignments are all discarded. Each read is given a probability P of being retained - results with the exact same input in the same order and with the same value for RANDOM_SEED will produce the same results.
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
45
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
46 **Picard documentation**
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
47
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
48 This is a Galaxy wrapper for DownsampleSam, a part of the external package Picard-tools_.
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
49
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
50 .. _Picard-tools: http://www.google.com/search?q=picard+samtools
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
51
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
52 ------
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
53
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
54 .. class:: warningmark
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
55
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
56 **Warning on SAM/BAM quality**
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
57
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
58 Many SAM/BAM files produced externally and uploaded to Galaxy do not fully conform to SAM/BAM specifications. Galaxy deals with this by using the **LENIENT**
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
59 flag when it runs Picard, which allows reads to be discarded if they're empty or don't map. This appears
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
60 to be the only way to deal with SAM/BAM that cannot be parsed.
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
61
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
62
876ea92b5c28 Uploaded
devteam
parents:
diff changeset
63 </help>
143
b2ca2d181fb4 fixed downsample sam in picard1106 (accept bam)
Rayan Chikhi <chikhi@psu.edu>
parents: 62
diff changeset
64 </tool>