Mercurial > repos > devteam > picard
comparison picard_MarkDuplicates.xml @ 14:486d7500da69 draft
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/picard commit 7036343b9ac0a0ffc2ce4f6db465b9298ef05e73
author | iuc |
---|---|
date | Mon, 16 Apr 2018 21:27:09 -0400 |
parents | c7dd8d7946b8 |
children | a5a13ea16d17 |
comparison
equal
deleted
inserted
replaced
13:c7dd8d7946b8 | 14:486d7500da69 |
---|---|
1 <tool name="MarkDuplicates" id="picard_MarkDuplicates" version="@TOOL_VERSION@.2"> | 1 <tool name="MarkDuplicates" id="picard_MarkDuplicates" version="@TOOL_VERSION@.@WRAPPER_VERSION@"> |
2 <description>examine aligned records in BAM datasets to locate duplicate molecules</description> | 2 <description>examine aligned records in BAM datasets to locate duplicate molecules</description> |
3 <macros> | 3 <macros> |
4 <import>picard_macros.xml</import> | 4 <import>picard_macros.xml</import> |
5 <token name="@WRAPPER_VERSION@">0</token> | |
5 </macros> | 6 </macros> |
6 <expand macro="requirements" /> | 7 <expand macro="requirements" /> |
7 <command detect_errors="exit_code"><![CDATA[ | 8 <command detect_errors="exit_code"><![CDATA[ |
8 @java_options@ | 9 @java_options@ |
9 @symlink_element_identifier@ | 10 @symlink_element_identifier@ |
51 <option value="SUM_OF_BASE_QUALITIES">SUM_OF_BASE_QUALITIES</option> | 52 <option value="SUM_OF_BASE_QUALITIES">SUM_OF_BASE_QUALITIES</option> |
52 <option value="TOTAL_MAPPED_REFERENCE_LENGTH">TOTAL_MAPPED_REFERENCE_LENGTH</option> | 53 <option value="TOTAL_MAPPED_REFERENCE_LENGTH">TOTAL_MAPPED_REFERENCE_LENGTH</option> |
53 </param> | 54 </param> |
54 | 55 |
55 <param name="read_name_regex" type="text" value="" label="Regular expression that can be used in unusual situations to parse non-standard read names in the incoming SAM/BAM dataset" help="READ_NAME_REGEX; Read names are parsed to extract three variables: tile/region, x coordinate and y coordinate. These values are used to estimate the rate of optical duplication in order to give a more accurate estimated library size. See help below for more info; default='' (uses : separation)"> | 56 <param name="read_name_regex" type="text" value="" label="Regular expression that can be used in unusual situations to parse non-standard read names in the incoming SAM/BAM dataset" help="READ_NAME_REGEX; Read names are parsed to extract three variables: tile/region, x coordinate and y coordinate. These values are used to estimate the rate of optical duplication in order to give a more accurate estimated library size. See help below for more info; default='' (uses : separation)"> |
56 | 57 |
57 <sanitizer> | 58 <sanitizer> |
58 <valid initial="string.printable"> | 59 <valid initial="string.printable"> |
59 </valid> | 60 </valid> |
60 </sanitizer> | 61 </sanitizer> |
61 </param> | 62 </param> |
125 variables, in order. It must match the entire read name. Note that if the default regex | 126 variables, in order. It must match the entire read name. Note that if the default regex |
126 is specified, a regex match is not actually done, but instead the read name is split on | 127 is specified, a regex match is not actually done, but instead the read name is split on |
127 colon character. For 5 element names, the 3rd, 4th and 5th elements are assumed to be | 128 colon character. For 5 element names, the 3rd, 4th and 5th elements are assumed to be |
128 tile, x and y values. For 7 element names (CASAVA 1.8), the 5th, 6th, and 7th elements | 129 tile, x and y values. For 7 element names (CASAVA 1.8), the 5th, 6th, and 7th elements |
129 are assumed to be tile, x and y values. Default value: '' | 130 are assumed to be tile, x and y values. Default value: '' |
130 | 131 |
131 | 132 |
132 DUPLICATE_SCORING_STRATEGY=ScoringStrategy | 133 DUPLICATE_SCORING_STRATEGY=ScoringStrategy |
133 DS=ScoringStrategy The scoring strategy for choosing the non-duplicate among candidates. Default value: | 134 DS=ScoringStrategy The scoring strategy for choosing the non-duplicate among candidates. Default value: |
134 SUM_OF_BASE_QUALITIES. Possible values: {SUM_OF_BASE_QUALITIES, TOTAL_MAPPED_REFERENCE_LENGTH} | 135 SUM_OF_BASE_QUALITIES. Possible values: {SUM_OF_BASE_QUALITIES, TOTAL_MAPPED_REFERENCE_LENGTH} |
135 | 136 |
136 OPTICAL_DUPLICATE_PIXEL_DISTANCE=Integer | 137 OPTICAL_DUPLICATE_PIXEL_DISTANCE=Integer |
137 The maximum offset between two duplicate clusters in order to consider them optical | 138 The maximum offset between two duplicate clusters in order to consider them optical |
138 duplicates. This should be set to 100 for (circa 2011+) read names and typical flowcells. | 139 duplicates. This should be set to 100 for (circa 2011+) read names and typical flowcells. |
139 Structured flow cells (NovaSeq, HiSeq 4000, X) should use ~2500. | 140 Structured flow cells (NovaSeq, HiSeq 4000, X) should use ~2500. |
140 For older conventions, distances could be to some fairly small number (e.g. 5-10 pixels) | 141 For older conventions, distances could be to some fairly small number (e.g. 5-10 pixels) |
141 Default value: 100. | 142 Default value: 100. |
142 | 143 |
143 BARCODE_TAG=String Barcode SAM tag (ex. BC for 10X Genomics) Default value: null. | 144 BARCODE_TAG=String Barcode SAM tag (ex. BC for 10X Genomics) Default value: null. |
144 | 145 |
145 @more_info@ | 146 @more_info@ |
146 | 147 |
147 </help> | 148 </help> |
149 <expand macro="citations" /> | |
148 </tool> | 150 </tool> |
149 | |
150 |