annotate antismash.xml @ 25:99435bb90725 draft

Uploaded
author bgruening
date Fri, 18 Oct 2013 03:35:17 -0400
parents b942dfd80819
children a5eb6fad2a76
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
20
ca150c701c39 Uploaded
bgruening
parents: 19
diff changeset
1 <tool id="antismash" name="Secondary Metabolites" version="2.0.2.1">
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
2 <description>and Antibiotics Analysis (antiSMASH)</description>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
3 <requirements>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
4 <requirement type="package" version="3.0">hmmer</requirement>
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
5 <requirement type="package">hmmer</requirement>
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
6 <requirement type="package" version="2.2.28">blast+</requirement>
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
7 <requirement type="package">blast+</requirement>
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
8 <requirement type="package" version="3.8.31">muscle</requirement>
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
9 <requirement type="package">muscle</requirement>
17
213d82f33fc2 Uploaded
bgruening
parents: 16
diff changeset
10 <requirement type="package" version="1.4.0-post-1">straight.plugin</requirement>
19
3d9f00b883a3 Uploaded
bgruening
parents: 17
diff changeset
11 <requirement type="package" version="1.62">biopython</requirement>
3d9f00b883a3 Uploaded
bgruening
parents: 17
diff changeset
12 <requirement type="package" version="1.2.6">pyquery</requirement>
17
213d82f33fc2 Uploaded
bgruening
parents: 16
diff changeset
13 <requirement type="package" version="0.1.2">helperlibs</requirement>
21
93b2baa4b9d6 Uploaded
bgruening
parents: 20
diff changeset
14 <requirement type="package" version="0.9">cssselect</requirement>
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
15 <requirement type="package" version="2.0.2">antismash</requirement>
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
16 <requirement type="package">antismash</requirement>
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
17 </requirements>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
18 <command>
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
19 #import os, glob
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
20 #set $outputfolder = $html.files_path
23
d26e8aa37ce9 Uploaded
bgruening
parents: 22
diff changeset
21 #if str($infile.ext) == 'genbank':
d26e8aa37ce9 Uploaded
bgruening
parents: 22
diff changeset
22 #set $file_extension = 'gb'
d26e8aa37ce9 Uploaded
bgruening
parents: 22
diff changeset
23 #else:
d26e8aa37ce9 Uploaded
bgruening
parents: 22
diff changeset
24 ## TODO add embl as input file
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
25 #set $file_extension = 'gb'
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
26 #end if
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
27
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
28 ln -s $infile #echo 'input_tempfile.' + $file_extension#;
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
29 mkdir -p $outputfolder;
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
30 run_antismash.py
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
31 --cpus 12
13
9cfa2fb488b0 Uploaded
bgruening
parents: 11
diff changeset
32 --enable $types
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
33 --input-type 'nucl'
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
34 $smcogs
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
35 $clusterblast
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
36 $subclusterblast
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
37 $inclusive
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
38 $full_hmmer
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
39 $full_blast
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
40 $eukaryotic
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
41
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
42
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
43 #if str($pfam_database) != "None":
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
44 --pfamdir $pfam_database.fields.path
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
45 #end if
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
46
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
47 --debug
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
48
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
49 --disable-embl
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
50 --outputfolder $outputfolder
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
51
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
52 #echo 'input_tempfile.' + $file_extension#
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
53
9
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
54 ## leave out the start and end features, it can be easily replaced with Galaxy tools
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
55 ##--from START Start analysis at nucleotide specified
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
56 ##--to END
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
57
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
58 2>&#38;1
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
59
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
60 ##
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
61 ## shuffling files to create the correct outputs for Galaxy
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
62 ##
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
63
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
64 ## html output
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
65 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
66 cp #echo os.path.join($outputfolder, 'index.html')# $html 2> /dev/null
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
67
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
68 ## gene clusters
16
e2df6fe8b447 Uploaded
bgruening
parents: 15
diff changeset
69 #if 'geneclusterprots_tabular' in str($outputs).split(','):
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
70 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
71 cp #echo os.path.join($outputfolder, 'geneclusters.txt')# $geneclusterprots_tabular 2> /dev/null
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
72 #end if
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
73
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
74 #if 'geneclusterprots_fasta' in str($outputs).split(','):
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
75 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
76 cp #echo os.path.join($outputfolder, '*_genecluster_proteins.fa')# $geneclusterprots_fasta 2> /dev/null
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
77 #end if
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
78
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
79
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
80 ##SVG images
16
e2df6fe8b447 Uploaded
bgruening
parents: 15
diff changeset
81 #if 'archive_svgs' in str($outputs).split(','):
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
82 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
83 cd #echo os.path.join($outputfolder, 'svg')#
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
84 #if $clusterblast:
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
85 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
86 tar cfz $archive_svgs *_all.svg genecluster* 2> /dev/null
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
87 #else:
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
88 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
89 tar cfz $archive_svgs genecluster*
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
90 #end if
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
91 #end if
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
92
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
93 ##all files in a archive
16
e2df6fe8b447 Uploaded
bgruening
parents: 15
diff changeset
94 #if 'archive' in str($outputs).split(','):
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
95 ;
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
96 cd $outputfolder;
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
97 tar cf $archive *.zip 2> /dev/null
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
98 #end if
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
99
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
100 ## genbank
16
e2df6fe8b447 Uploaded
bgruening
parents: 15
diff changeset
101 #if 'gb' in str($outputs).split(','):
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
102 ;
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
103 cat #echo os.path.join($outputfolder, '*.gbk')# > $genbank 2> /dev/null
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
104 #end if
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
105
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
106 </command>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
107 <inputs>
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
108 <param name="infile" type="data" format="genbank" label="Nucleotide sequence file in GenBank format"/>
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
109
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
110 <param name="eukaryotic" type="select" label="Origin of DNA">
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
111 <option value="" selected="True">Prokaryotic</option>
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
112 <option value="--eukaryotic">Eukaryotic</option>
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
113 </param>
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
114
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
115 <param name="clusterblast" type="boolean" label="BLAST identified clusters against known clusters"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
116 help="(--clusterblast)"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
117 truevalue="--clusterblast" falsevalue="" checked="True" />
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
118 <param name="subclusterblast" type="boolean" label="Subcluster BLAST analysis"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
119 help="(--subclusterblast)"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
120 truevalue="--subclusterblast" falsevalue="" checked="false" />
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
121 <param name="smcogs" type="boolean" label="Analysis of secondary metabolism gene families (smCOGs)"
11
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
122 falsevalue="" truevalue="--smcogs" checked="True" />
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
123
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
124 <param name="full_blast" type="boolean" label="Run a whole-genome BLAST analysis"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
125 help="(--full-blast)"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
126 truevalue="--full-blast" falsevalue="" checked="False" />
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
127 <param name="full_hmmer" type="boolean" label="Run a whole-genome Pfam analysis"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
128 help="(--full-hmmer)"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
129 truevalue="--full-hmmer" falsevalue="" checked="false" />
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
130
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
131 <param name="inclusive" type="boolean" label="Use inclusive algorithm for cluster detection"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
132 help="(--inclusive)"
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
133 truevalue="--inclusive" falsevalue="" checked="false" />
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
134
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
135 <param name="pfam_database" type="select" optional="true" label="Pfam database" help="Pfam Covariance models">
4
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
136 <options from_file="antismash.loc">
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
137 <column name="value" index="0"/>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
138 <column name="name" index="1"/>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
139 <column name="path" index="2"/>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
140 </options>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
141 </param>
9b91d26ee080 Uploaded
bgruening
parents: 0
diff changeset
142
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
143 <param name="types" type="select" display="checkboxes" multiple="true" label="Gene cluster types to search">
9
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
144 <option value="t1pks" selected="True">type I polyketide synthases</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
145 <option value="t2pks" selected="True">type II polyketide synthases</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
146 <option value="t3pks" selected="True">type III polyketide synthases</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
147 <option value="t4pks" selected="True">type IV polyketide synthases</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
148 <option value="transatpks" selected="True">trans-AT PKS</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
149 <option value="nrps" selected="True">nonribosomal peptide synthetases</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
150 <option value="terpene" selected="True">terpene synthases</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
151 <option value="lantipeptide" selected="True">lantipeptides</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
152 <option value="bacteriocin" selected="True">bacteriocins</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
153 <option value="blactam" selected="True">beta-lactams</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
154 <option value="amglyccycl" selected="True">aminoglycosides / aminocyclitols</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
155 <option value="aminocoumarin" selected="True">aminocoumarins</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
156 <option value="siderophore" selected="True">siderophores</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
157 <option value="ectoine" selected="True">ectoines</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
158 <option value="butyrolactone" selected="True">butyrolactones</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
159 <option value="indole" selected="True">indoles</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
160 <option value="nucleoside" selected="True">nucleosides</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
161 <option value="phosphoglycolipid" selected="True">phosphoglycolipids</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
162 <option value="oligosaccharide" selected="True">oligosaccharides</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
163 <option value="furan" selected="True">furans</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
164 <option value="hserlactone" selected="True">hserlactones</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
165 <option value="thiopeptide" selected="True">thiopeptides</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
166 <option value="phenazine" selected="True">phenazines</option>
b11e1dfbc7c9 Uploaded
bgruening
parents: 5
diff changeset
167 <option value="phosphonate" selected="True">phosphonates</option>
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
168 <option value="other" selected="True">others</option>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
169 </param>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
170
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
171 <param name="outputs" type="select" multiple="true" label="Additional outputs">
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
172 <option value="geneclusterprots_fasta" selected="True">Gene cluster proteins (FASTA)</option>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
173 <option value="geneclusterprots_tabular">Gene cluster proteins (Tabular)</option>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
174 <option value="archive_svgs">All clusters as image (compressed)</option>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
175 <option value="archive">All files compressed</option>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
176 <option value="gb">Annotated genome (GenBank)</option>
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
177 </param>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
178
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
179 </inputs>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
180 <outputs>
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
181 <data format="fasta" name="geneclusterprots_fasta" label="${tool.name} on ${on_string} (Gen Cluster Proteins)">
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
182 <filter>'geneclusterprots_fasta' in outputs</filter>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
183 </data>
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
184 <data format="tabular" name="geneclusterprots_tabular" label="${tool.name} on ${on_string} (Gen Cluster Proteins)">
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
185 <filter>'geneclusterprots_tabular' in outputs</filter>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
186 </data>
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
187 <data format="tar" name="archive" label="${tool.name} on ${on_string} (all files compressed)">
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
188 <filter>'archive' in outputs</filter>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
189 </data>
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
190 <data format="tar.gz" name="archive_svgs" label="${tool.name} on ${on_string} (SVG images)">
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
191 <filter>'archive_svgs' in outputs</filter>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
192 </data>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
193 <data format="html" name="html" label="${tool.name} on ${on_string} (html report)">
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
194 <!-- html is default output at any time.
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
195 <filter>'html' in outputs</filter>
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
196 -->
15
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
197 </data>
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
198 <data name="genbank" format="genbank" label="${tool.name} on ${on_string} (genbank)">
9a099b9b6857 Uploaded
bgruening
parents: 13
diff changeset
199 <filter>'gb' in outputs</filter>
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
200 </data>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
201 </outputs>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
202 <help>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
203
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
204 .. class:: infomark
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
205
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
206 **What it does**
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
207
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
208 antiSMASH allows the rapid genome-wide identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genomes.
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
209 It integrates and cross-links with a large number of in silico secondary metabolite analysis tools that have been published earlier.
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
210
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
211
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
212 **Input**
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
213
22
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
214 The ideal input for antiSMASH is an annotated nucleotide file in Genbank format. If no annotation is available,
225d40beff1a Uploaded
bgruening
parents: 21
diff changeset
215 we recommend running your sequence through an annotation pipeline like RAST are the one included in Galaxy.
11
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
216
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
217
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
218 There are several optional analyses that may or may not be run on your sequence.
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
219 Highly recommended is the Gene Cluster Blast Comparative Analysis, which runs BlastP using each amino acid sequence from a detected gene cluster as a
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
220 query on a large database of predicted protein sequences from secondary metabolite biosynthetic gene clusters, and pools the results to identify
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
221 the gene clusters that are most homologous to the gene cluster that was detected in your query nucleotide sequence.
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
222
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
223
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
224 Also available is the analysis of secondary metabolism gene families (smCOGs).
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
225 This analysis attempts to allocate each gene in the detected gene clusters to a secondary metabolism-specific gene
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
226 family using profile hidden Markov models specific for the conserved sequence region characteristic of this family.
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
227 Additionally, a phylogenetic tree is constructed of each gene together with the (max. 100) sequences of the smCOG seed alignment.
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
228
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
229
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
230 For the most thorough genome analysis, we provide genome-wide PFAM HMM analysis of all genes in the genome through modules of the CLUSEAN pipeline.
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
231 Of course, some regions important to secondary metabolism may have been missed in the gene cluster identification stage
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
232 (e.g. because they represent the biosynthetic pathway of a yet unknown secondary metabolite).
d2c2eb518142 Uploaded
bgruening
parents: 9
diff changeset
233 Therefore, when genome-wide PFAM HMM analysis is selected, the PFAM frequencies are also used to find all genome regions in which PFAM domains typical for secondary metabolism are overrepresented.
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
234
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
235
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
236 **References**
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
237
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
238 Marnix H. Medema, Kai Blin, Peter Cimermancic, Victor de Jager, Piotr Zakrzewski, Michael A. Fischbach, Tilmann Weber,
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
239 Rainer Breitling and Eriko Takano (2011). antiSMASH: Rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters. Nucleic Acids Research, doi: 10.1093/nar/gkr466.
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
240
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
241 http://antismash.secondarymetabolites.org/help.html
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
242
25
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
243
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
244 Bjoern A. Gruening: https://github.com/bgruening/galaxytools/tree/master/antismash
99435bb90725 Uploaded
bgruening
parents: 24
diff changeset
245
0
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
246 </help>
7ad005dfbe78 Uploaded
bgruening
parents:
diff changeset
247 </tool>