Mercurial > repos > jjohnson > snpeff_with_dep
annotate snpEff.xml @ 24:f22f28cd881e
Update for snpEff v3.2
author | Jim Johnson <jj@umn.edu> |
---|---|
date | Tue, 07 May 2013 14:58:15 -0500 |
parents | 50c1a8ff6c82 |
children |
rev | line source |
---|---|
24 | 1 <tool id="snpEff" name="SnpEff" version="3.2"> |
0 | 2 <description>Variant effect and annotation</description> |
3 <!-- | |
4 You will need to change the path to wherever your installation is. | |
5 You can change the amount of memory used by snpEff, just change the -Xmx parameter (e.g. use -Xmx2G for 2Gb of memory) | |
6 <command>java -Xmx6G -jar /path/to/your/snpEff/snpEff.jar eff -c /path/to/your/snpEff/snpEff/snpEff.config $inputFormat $offset -upDownStreamLen $udLength $filterIn $filterHomHet -no $filterOut -stats $statsFile $genomeVersion $input > $output </command> | |
24 | 7 Options: |
8 -a , -around : Show N codons and amino acids around change (only in coding regions). Default is 0 codons. | |
9 -i <format> : Input format [ vcf, txt, pileup, bed ]. Default: VCF. | |
10 -o <format> : Ouput format [ txt, vcf, gatk, bed, bedAnn ]. Default: VCF. | |
11 -interval : Use a custom interval file (you may use this option many times) | |
12 -chr <string> : Prepend 'string' to chromosome name (e.g. 'chr1' instead of '1'). Only on TXT output. | |
13 -s, -stats : Name of stats file (summary). Default is 'snpEff_summary.html' | |
14 -t : Use multiple threads (implies '-noStats'). Default 'off' | |
15 | |
16 Sequence change filter options: | |
17 -del : Analyze deletions only | |
18 -ins : Analyze insertions only | |
19 -hom : Analyze homozygous variants only | |
20 -het : Analyze heterozygous variants only | |
21 -minQ X, -minQuality X : Filter out variants with quality lower than X | |
22 -maxQ X, -maxQuality X : Filter out variants with quality higher than X | |
23 -minC X, -minCoverage X : Filter out variants with coverage lower than X | |
24 -maxC X, -maxCoverage X : Filter out variants with coverage higher than X | |
25 -nmp : Only MNPs (multiple nucleotide polymorphisms) | |
26 -snp : Only SNPs (single nucleotide polymorphisms) | |
27 | |
28 Results filter options: | |
29 -fi <bedFile> : Only analyze changes that intersect with the intervals specified in this file (you may use this option many times) | |
30 -no-downstream : Do not show DOWNSTREAM changes | |
31 -no-intergenic : Do not show INTERGENIC changes | |
32 -no-intron : Do not show INTRON changes | |
33 -no-upstream : Do not show UPSTREAM changes | |
34 -no-utr : Do not show 5_PRIME_UTR or 3_PRIME_UTR changes | |
35 | |
36 Annotations options: | |
37 -cancer : Perform 'cancer' comparissons (Somatic vs Germline). Default: false | |
38 -canon : Only use canonical transcripts. | |
39 -geneId : Use gene ID instead of gene name (VCF output). Default: false | |
40 -hgvs : Use HGVS annotations for amino acid sub-field. Default: false | |
41 -lof : Add loss of function (LOF) and Nonsense mediated decay (NMD) tags. | |
42 -reg <name> : Regulation track to use (this option can be used add several times). | |
43 -oicr : Add OICR tag in VCF file. Default: false | |
44 -onlyReg : Only use regulation tracks. | |
45 -onlyTr <file.txt> : Only use the transcripts in this file. Format: One transcript ID per line. | |
46 -sequenceOntolgy : Use Sequence Ontolgy terms. Default: false | |
47 -ss, -spliceSiteSize <int> : Set size for splice sites (donor and acceptor) in bases. Default: 2 | |
48 -ud, -upDownStreamLen <int> : Set upstream downstream interval length (in bases) | |
49 | |
50 Generic options: | |
51 -0 : File positions are zero-based (same as '-inOffset 0 -outOffset 0') | |
52 -1 : File positions are one-based (same as '-inOffset 1 -outOffset 1') | |
53 -c , -config : Specify config file | |
54 -h , -help : Show this help and exit | |
55 -if, -inOffset : Offset input by a number of bases. E.g. '-inOffset 1' for one-based input files | |
56 -of, -outOffset : Offset output by a number of bases. E.g. '-outOffset 1' for one-based output files | |
57 -noLog : Do not report usage statistics to server | |
58 -noStats : Do not create stats (summary) file | |
59 -q , -quiet : Quiet mode (do not show any messages or errors) | |
60 -v , -verbose : Verbose mode | |
61 | |
0 | 62 --> |
63 <requirements> | |
24 | 64 <requirement type="package" version="3.2">snpEff</requirement> |
0 | 65 </requirements> |
22
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
66 <command> |
24 | 67 SNPEFF_DATA_DIR=`grep '^data_dir' \$JAVA_JAR_PATH/snpEff.config | sed 's/.*data_dir.*[=:]//'`; |
68 eval "if [ ! -e \$SNPEFF_DATA_DIR/$genomeVersion ] ; | |
22
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
69 then java -Xmx6G -jar \$JAVA_JAR_PATH/snpEff.jar download -c \$JAVA_JAR_PATH/snpEff.config $genomeVersion ; |
24 | 70 fi"; |
71 java -Xmx6G -jar \$JAVA_JAR_PATH/snpEff.jar eff -c \$JAVA_JAR_PATH/snpEff.config -i $inputFormat -o $outputFormat -upDownStreamLen $udLength | |
72 #if $spliceSiteSize and $spliceSiteSize.__str__ != '': | |
73 -spliceSiteSize $spliceSiteSize | |
74 #end if | |
75 #if $filterIn and $filterIn.__str__ != 'no_filter': | |
76 -$filterIn | |
77 #end if | |
78 #if $filterHomHet and $filterHomHet.__str__ != 'no_filter': | |
79 -$filterHomHet | |
80 #end if | |
81 #if $annotations and $annotations.__str__ != '': | |
82 -#slurp | |
83 #echo ' -'.join($annotations.__str__.split(',')) | |
84 #end if | |
85 #if $filterOut and $filterOut.__str__ != '': | |
86 -#slurp | |
87 #echo ' -'.join($filterOut.__str__.split(',')) | |
88 #end if | |
89 #if str( $transcripts ) != 'None': | |
90 -onlyTr $transcripts | |
91 #end if | |
92 #if str( $intervals ) != 'None': ### fix this for multiple dataset input | |
93 -interval $intervals | |
94 #end if | |
95 #if $statsFile: | |
96 -stats $statsFile | |
97 #end if | |
98 #if $offset.__str__ != '': | |
99 -${offset} | |
100 #end if | |
101 #if $chr.__str__.strip() != '': | |
102 -chr "$chr" | |
103 #end if | |
104 $noLog $genomeVersion $input > $snpeff_output | |
105 </command> | |
0 | 106 <inputs> |
107 <param format="vcf,tabular,pileup,bed" name="input" type="data" label="Sequence changes (SNPs, MNPs, InDels)"/> | |
108 | |
109 <param name="inputFormat" type="select" label="Input format"> | |
24 | 110 <option value="vcf" selected="true">VCF</option> |
111 <option value="txt">Tabular (Deprecated)</option> | |
112 <option value="pileup">Pileup (Deprecated)</option> | |
113 <option value="bed">BED (Deprecated)</option> | |
0 | 114 </param> |
115 | |
116 <param name="outputFormat" type="select" label="Output format"> | |
24 | 117 <option value="vcf" selected="true">VCF (only if input is VCF)</option> |
0 | 118 <option value="txt">Tabular</option> |
119 <option value="bed">BED</option> | |
120 <option value="bedAnn">BED Annotations</option> | |
121 </param> | |
122 | |
123 <param name="genomeVersion" type="select" label="Genome"> | |
24 | 124 <!--GENOME DESCRIPTION--> |
22
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
125 <options from_file="snpeffect_genomedb.loc"> |
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
126 <column name="name" index="1"/> |
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
127 <column name="value" index="0"/> |
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
128 </options> |
0 | 129 </param> |
130 | |
131 <param name="udLength" type="select" label="Upstream / Downstream length"> | |
132 <option value="0">No upstream / downstream intervals (0 bases)</option> | |
133 <option value="200">200 bases</option> | |
134 <option value="500">500 bases</option> | |
135 <option value="1000">1000 bases</option> | |
136 <option value="2000">2000 bases</option> | |
137 <option value="5000" selected="true">5000 bases</option> | |
138 <option value="10000">10000 bases</option> | |
139 <option value="20000">20000 bases</option> | |
140 </param> | |
141 | |
24 | 142 <param name="spliceSiteSize" type="select" optional="true" label="Set size for splice sites (donor and acceptor) in bases. Default: 2"> |
143 <option value="1">1 base</option> | |
144 <option value="2">2 bases</option> | |
145 <option value="3">3 bases</option> | |
146 <option value="4">4 bases</option> | |
147 <option value="5">5 bases</option> | |
148 <option value="6">6 bases</option> | |
149 <option value="7">7 bases</option> | |
150 <option value="8">8 bases</option> | |
151 <option value="9">9 bases</option> | |
152 </param> | |
0 | 153 |
24 | 154 <param name="filterHomHet" type="select" display="radio" label="Filter homozygous / heterozygous changes"> |
155 <option value="no_filter" selected="true">No filter (analyze everything)</option> | |
156 <option value="hom">Analyze homozygous sequence changes only </option> | |
157 <option value="het">Analyze heterozygous sequence changes only </option> | |
158 </param> | |
159 | |
160 <!-- The tool testing code can not handle select,radio,checkbox values that start with '-', so the '-' is added in the command generation --> | |
161 <param name="filterIn" type="select" display="radio" label="Filter sequence changes"> | |
162 <option value="no_filter" selected="true">No filter (analyze everything)</option> | |
163 <option value="del">Analyze deletions only </option> | |
164 <option value="ins">Analyze insertions only </option> | |
165 <option value="mnp">Only MNPs (multiple nucleotide polymorphisms) </option> | |
166 <option value="snp">Only SNPs (single nucleotide polymorphisms) </option> | |
167 </param> | |
0 | 168 |
24 | 169 <param name="annotations" type="select" display="checkboxes" multiple="true" optional="true" label="Annotation options"> |
170 <option value="cancer">Perform 'cancer' comparissons (Somatic vs Germline). Default: false</option> | |
171 <option value="canon">Only use canonical transcripts.</option> | |
172 <option value="geneId">Use gene ID instead of gene name (VCF output). Default: false</option> | |
173 <option value="hgvs">Use HGVS annotations for amino acid sub-field. Default: false</option> | |
174 <option value="lof">Add loss of function (LOF) and Nonsense mediated decay (NMD) tags.</option> | |
175 <option value="oicr">Add OICR tag in VCF file. Default: false</option> | |
176 <option value="onlyReg">Only use regulation tracks.</option> | |
177 <option value="sequenceOntolgy">Use Sequence Ontolgy terms. Default: false</option> | |
178 </param> | |
179 | |
180 <param name="regulation" type="select" display="checkboxes" multiple="true" optional="true" label="Non-coding and regulatory Annotation"> | |
181 <help>These are available for only a few genomes</help> | |
182 <!--GENOME REG_NAME --> | |
183 <options from_file="snpeffect_regulationdb.loc"> | |
184 <column name="name" index="1"/> | |
185 <column name="value" index="0"/> | |
186 <filter type="param_value" ref="genomeVersion" key="name" column="1" /> | |
187 </options> | |
188 </param> | |
0 | 189 |
24 | 190 <param name="intervals" format="bed" type="data" optional="true" label="Use custom interval file for annotation"/> |
191 <param name="transcripts" format="tabular" type="data" optional="true" label="Only use the transcripts in this file. Format: One transcript ID per line."/> | |
192 | |
193 <param name="filterOut" type="select" display="checkboxes" multiple="true" optional="true" label="Filter output"> | |
194 <option value="no-downstream">Do not show DOWNSTREAM changes </option> | |
195 <option value="no-intergenic">Do not show INTERGENIC changes </option> | |
196 <option value="no-intron">Do not show INTRON changes </option> | |
197 <option value="no-upstream">Do not show UPSTREAM changes </option> | |
198 <option value="no-utr">Do not show 5_PRIME_UTR or 3_PRIME_UTR changes </option> | |
199 </param> | |
200 | |
201 <param name="offset" type="select" display="radio" optional="true" label="Chromosomal position"> | |
202 <option value="" selected="true">Use default (based on input type)</option> | |
203 <option value="0">Force zero-based positions (both input and output)</option> | |
204 <option value="1">Force one-based positions (both input and output)</option> | |
0 | 205 </param> |
24 | 206 <param name="chr" type="text" optionl="true" label="Text to prepend to chromosome name" help="By default SnpEff simplifies all chromosome names. For instance 'chr1' is just '1'. You can prepend any string you want to the chromosome name."> |
207 <validator type="regex" message="No whitespace allows">^\S*$</validator> | |
208 | |
209 </param> | |
210 <param name="generate_stats" type="boolean" truevalue="" falsevalue="-noStats" checked="true" label="Produce Summary Stats"/> | |
211 <param name="noLog" type="boolean" truevalue="-noLog" falsevalue="" checked="true" label="Do not report usage statistics to server"/> | |
0 | 212 </inputs> |
213 <outputs> | |
24 | 214 <data format="vcf" name="snpeff_output" > |
0 | 215 <change_format> |
24 | 216 <when input="outputFormat" value="vcf" format="vcf" /> |
0 | 217 <when input="outputFormat" value="txt" format="tabular" /> |
218 <when input="outputFormat" value="bed" format="bed" /> | |
219 <when input="outputFormat" value="bedAnn" format="bed" /> | |
220 </change_format> | |
221 </data> | |
24 | 222 |
223 <data format="html" name="statsFile"> | |
224 <filter>generate_stats == True</filter> | |
225 </data> | |
0 | 226 </outputs> |
5 | 227 <stdio> |
228 <exit_code range="1:" level="fatal" description="Error" /> | |
24 | 229 <exit_code range="-1" level="fatal" description="Error: Cannot open file" /> |
5 | 230 </stdio> |
24 | 231 <tests> |
232 <test> | |
233 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
234 <param name="inputFormat" value="vcf"/> | |
235 <param name="outputFormat" value="vcf"/> | |
236 <param name="genomeVersion" value="testCase"/> | |
237 <param name="udLength" value="0"/> | |
238 <param name="filterHomHet" value="no_filter"/> | |
239 <param name="filterIn" value="no_filter"/> | |
240 <param name="generate_stats" value="False"/> | |
241 <!-- | |
242 <param name="filterOut" value="no-upstream"/> | |
243 --> | |
244 <output name="snpeff_output"> | |
245 <assert_contents> | |
246 <!-- Check that an effect was added --> | |
247 <has_text text="EFF=" /> | |
248 </assert_contents> | |
249 </output> | |
250 <!-- Check for a HTML header indicating that this was successful --> | |
251 <!-- | |
252 <output name="statsFile"> | |
253 <assert_contents> | |
254 <has_text text="SnpEff: Variant analysis" /> | |
255 </assert_contents> | |
256 </output> | |
257 --> | |
258 </test> | |
259 | |
260 <test> | |
261 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
262 <param name="inputFormat" value="vcf"/> | |
263 <param name="outputFormat" value="vcf"/> | |
264 <param name="genomeVersion" value="testCase"/> | |
265 <param name="udLength" value="0"/> | |
266 <param name="filterHomHet" value="het"/> | |
267 <param name="filterIn" value="no_filter"/> | |
268 <!-- | |
269 <param name="filterOut" value=""/> | |
270 --> | |
271 <param name="generate_stats" value="False"/> | |
272 <output name="snpeff_output"> | |
273 <assert_contents> | |
274 <!-- Check that NO effects were added since -het is set --> | |
275 <not_has_text text="EFF=NON_SYNONYMOUS_CODING" /> | |
276 </assert_contents> | |
277 </output> | |
278 </test> | |
279 <test> | |
280 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
281 <param name="inputFormat" value="vcf"/> | |
282 <param name="outputFormat" value="vcf"/> | |
283 <param name="genomeVersion" value="testCase"/> | |
284 <param name="udLength" value="0"/> | |
285 <param name="filterHomHet" value="no_filter"/> | |
286 <param name="filterIn" value="del"/> | |
287 <!-- | |
288 <param name="filterOut" value=""/> | |
289 --> | |
290 <param name="generate_stats" value="False"/> | |
291 <output name="snpeff_output"> | |
292 <assert_contents> | |
293 <!-- Check that deleletions were evaluated --> | |
294 <has_text_matching expression="Y\t59030478\t.*EFF=INTERGENIC" /> | |
295 <!-- Check that insertion on last line was NOT evaluated --> | |
296 <has_text_matching expression="Y\t59032947\t.*SF=5\tGT" /> | |
297 </assert_contents> | |
298 </output> | |
299 </test> | |
300 <test> | |
301 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
302 <param name="inputFormat" value="vcf"/> | |
303 <param name="outputFormat" value="vcf"/> | |
304 <param name="genomeVersion" value="testCase"/> | |
305 <param name="udLength" value="0"/> | |
306 <param name="filterHomHet" value="no_filter"/> | |
307 <param name="filterIn" value="no_filter"/> | |
308 <param name="filterOut" value="no-upstream"/> | |
309 <param name="generate_stats" value="False"/> | |
310 <output name="snpeff_output"> | |
311 <assert_contents> | |
312 <!-- Check that NO UPSTREAM effect was added --> | |
313 <not_has_text text="UPSTREAM" /> | |
314 </assert_contents> | |
315 </output> | |
316 </test> | |
317 | |
318 </tests> | |
0 | 319 <help> |
320 | |
321 This tool calculate the effect of variants (SNPs/MNPs/Insertions) and deletions. | |
322 | |
323 For details about this tool, please go to http://snpEff.sourceforge.net | |
324 | |
325 </help> | |
326 </tool> | |
327 |