comparison picard_MarkDuplicates.xml @ 14:486d7500da69 draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/picard commit 7036343b9ac0a0ffc2ce4f6db465b9298ef05e73
author iuc
date Mon, 16 Apr 2018 21:27:09 -0400
parents c7dd8d7946b8
children a5a13ea16d17
comparison
equal deleted inserted replaced
13:c7dd8d7946b8 14:486d7500da69
1 <tool name="MarkDuplicates" id="picard_MarkDuplicates" version="@TOOL_VERSION@.2"> 1 <tool name="MarkDuplicates" id="picard_MarkDuplicates" version="@TOOL_VERSION@.@WRAPPER_VERSION@">
2 <description>examine aligned records in BAM datasets to locate duplicate molecules</description> 2 <description>examine aligned records in BAM datasets to locate duplicate molecules</description>
3 <macros> 3 <macros>
4 <import>picard_macros.xml</import> 4 <import>picard_macros.xml</import>
5 <token name="@WRAPPER_VERSION@">0</token>
5 </macros> 6 </macros>
6 <expand macro="requirements" /> 7 <expand macro="requirements" />
7 <command detect_errors="exit_code"><![CDATA[ 8 <command detect_errors="exit_code"><![CDATA[
8 @java_options@ 9 @java_options@
9 @symlink_element_identifier@ 10 @symlink_element_identifier@
51 <option value="SUM_OF_BASE_QUALITIES">SUM_OF_BASE_QUALITIES</option> 52 <option value="SUM_OF_BASE_QUALITIES">SUM_OF_BASE_QUALITIES</option>
52 <option value="TOTAL_MAPPED_REFERENCE_LENGTH">TOTAL_MAPPED_REFERENCE_LENGTH</option> 53 <option value="TOTAL_MAPPED_REFERENCE_LENGTH">TOTAL_MAPPED_REFERENCE_LENGTH</option>
53 </param> 54 </param>
54 55
55 <param name="read_name_regex" type="text" value="" label="Regular expression that can be used in unusual situations to parse non-standard read names in the incoming SAM/BAM dataset" help="READ_NAME_REGEX; Read names are parsed to extract three variables: tile/region, x coordinate and y coordinate. These values are used to estimate the rate of optical duplication in order to give a more accurate estimated library size. See help below for more info; default='' (uses : separation)"> 56 <param name="read_name_regex" type="text" value="" label="Regular expression that can be used in unusual situations to parse non-standard read names in the incoming SAM/BAM dataset" help="READ_NAME_REGEX; Read names are parsed to extract three variables: tile/region, x coordinate and y coordinate. These values are used to estimate the rate of optical duplication in order to give a more accurate estimated library size. See help below for more info; default='' (uses : separation)">
56 57
57 <sanitizer> 58 <sanitizer>
58 <valid initial="string.printable"> 59 <valid initial="string.printable">
59 </valid> 60 </valid>
60 </sanitizer> 61 </sanitizer>
61 </param> 62 </param>
125 variables, in order. It must match the entire read name. Note that if the default regex 126 variables, in order. It must match the entire read name. Note that if the default regex
126 is specified, a regex match is not actually done, but instead the read name is split on 127 is specified, a regex match is not actually done, but instead the read name is split on
127 colon character. For 5 element names, the 3rd, 4th and 5th elements are assumed to be 128 colon character. For 5 element names, the 3rd, 4th and 5th elements are assumed to be
128 tile, x and y values. For 7 element names (CASAVA 1.8), the 5th, 6th, and 7th elements 129 tile, x and y values. For 7 element names (CASAVA 1.8), the 5th, 6th, and 7th elements
129 are assumed to be tile, x and y values. Default value: '' 130 are assumed to be tile, x and y values. Default value: ''
130 131
131 132
132 DUPLICATE_SCORING_STRATEGY=ScoringStrategy 133 DUPLICATE_SCORING_STRATEGY=ScoringStrategy
133 DS=ScoringStrategy The scoring strategy for choosing the non-duplicate among candidates. Default value: 134 DS=ScoringStrategy The scoring strategy for choosing the non-duplicate among candidates. Default value:
134 SUM_OF_BASE_QUALITIES. Possible values: {SUM_OF_BASE_QUALITIES, TOTAL_MAPPED_REFERENCE_LENGTH} 135 SUM_OF_BASE_QUALITIES. Possible values: {SUM_OF_BASE_QUALITIES, TOTAL_MAPPED_REFERENCE_LENGTH}
135 136
136 OPTICAL_DUPLICATE_PIXEL_DISTANCE=Integer 137 OPTICAL_DUPLICATE_PIXEL_DISTANCE=Integer
137 The maximum offset between two duplicate clusters in order to consider them optical 138 The maximum offset between two duplicate clusters in order to consider them optical
138 duplicates. This should be set to 100 for (circa 2011+) read names and typical flowcells. 139 duplicates. This should be set to 100 for (circa 2011+) read names and typical flowcells.
139 Structured flow cells (NovaSeq, HiSeq 4000, X) should use ~2500. 140 Structured flow cells (NovaSeq, HiSeq 4000, X) should use ~2500.
140 For older conventions, distances could be to some fairly small number (e.g. 5-10 pixels) 141 For older conventions, distances could be to some fairly small number (e.g. 5-10 pixels)
141 Default value: 100. 142 Default value: 100.
142 143
143 BARCODE_TAG=String Barcode SAM tag (ex. BC for 10X Genomics) Default value: null. 144 BARCODE_TAG=String Barcode SAM tag (ex. BC for 10X Genomics) Default value: null.
144 145
145 @more_info@ 146 @more_info@
146 147
147 </help> 148 </help>
149 <expand macro="citations" />
148 </tool> 150 </tool>
149
150