Mercurial > repos > peterjc > tmhmm_and_signalp
annotate tools/protein_analysis/promoter2.xml @ 25:41a42022f815 draft
Uploaded v0.2.6, embedded citations
author | peterjc |
---|---|
date | Fri, 21 Nov 2014 08:17:36 -0500 |
parents | 90e3d02f8013 |
children | 20139cb4c844 |
rev | line source |
---|---|
25 | 1 <tool id="promoter2" name="Promoter 2.0" version="0.0.8"> |
7 | 2 <description>Find eukaryotic PolII promoters in DNA sequences</description> |
3 <!-- If job splitting is enabled, break up the query file into parts --> | |
4 <!-- Using 2000 per chunk so 4 threads each doing 500 is ideal --> | |
5 <parallelism method="basic" split_inputs="fasta_file" split_mode="to_size" split_size="2000" merge_outputs="tabular_file"></parallelism> | |
6 <command interpreter="python"> | |
25 | 7 promoter2.py "\$GALAXY_SLOTS" "$fasta_file" "$tabular_file" |
19
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
8 ##If the environment variable isn't set, get "", and the python wrapper |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
9 ##defaults to four threads. |
7 | 10 </command> |
11 <stdio> | |
12 <!-- Anything other than zero is an error --> | |
13 <exit_code range="1:" /> | |
14 <exit_code range=":-1" /> | |
15 </stdio> | |
16 <inputs> | |
17 <param name="fasta_file" type="data" format="fasta" label="FASTA file of DNA sequences"/> | |
18 </inputs> | |
19 <outputs> | |
20 <data name="tabular_file" format="tabular" label="Promoter2 on ${fasta_file.name}" /> | |
21 </outputs> | |
22 <requirements> | |
23 <requirement type="binary">promoter</requirement> | |
24 </requirements> | |
16 | 25 <tests> |
26 <test> | |
27 <param name="fasta_file" value="Adenovirus.fasta" ftype="fasta"/> | |
28 <output name="tabular_file" file="Adenovirus.promoter2.tabular" ftype="tabular"/> | |
29 </test> | |
30 <test> | |
31 <param name="fasta_file" value="empty.fasta" ftype="fasta"/> | |
32 <output name="tabular_file" file="empty_promoter2.tabular" ftype="tabular"/> | |
33 </test> | |
34 </tests> | |
7 | 35 <help> |
36 | |
37 **What it does** | |
38 | |
39 This calls the Promoter 2.0 tool for prediction of eukaryotic PolII promoter sequences using a Neural Network (NN) model. | |
40 | |
41 The input is a FASTA file of nucleotide sequences (e.g. upstream regions of your genes), and the output is tabular with five columns (one row per promoter): | |
42 | |
19
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
43 ====== ================================================== |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
44 Column Description |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
45 ------ -------------------------------------------------- |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
46 1 Sequence identifier (first word of FASTA header) |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
47 2 Promoter position, e.g. 600 |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
48 3 Promoter score, e.g. 1.063 |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
49 4 Promoter likelihood, e.g. Highly likely prediction |
4cd848c5590b
Uploaded v0.2.5 preview 3, use $NSLOTS in the PSORT wrappers.
peterjc
parents:
17
diff
changeset
|
50 ====== ================================================== |
7 | 51 |
52 The scores are classified very simply as follows: | |
53 | |
54 ========= ======================== | |
55 Score Description | |
56 --------- ------------------------ | |
57 below 0.5 ignored | |
58 0.5 - 0.8 Marginal prediction | |
59 0.8 - 1.0 Medium likely prediction | |
60 above 1.0 Highly likely prediction | |
61 ========= ======================== | |
62 | |
63 Internally the input FASTA file is divided into parts (to allow multiple processors to be used), and the raw output is reformatted into this tabular layout suitable for downstream analysis within Galaxy. | |
64 | |
65 **References** | |
66 | |
20
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
67 If you use this Galaxy tool in work leading to a scientific publication please |
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
68 cite the following papers: |
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
69 |
22
90e3d02f8013
Uploaded v0.2.5 preview 6, can use accents in <help> RST
peterjc
parents:
20
diff
changeset
|
70 Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013). |
20
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
71 Galaxy tools and workflows for sequence analysis with applications |
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
72 in molecular plant pathology. PeerJ 1:e167 |
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
73 http://dx.doi.org/10.7717/peerj.167 |
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
74 |
22
90e3d02f8013
Uploaded v0.2.5 preview 6, can use accents in <help> RST
peterjc
parents:
20
diff
changeset
|
75 Steen Knudsen (1999). |
7 | 76 Promoter2.0: for the recognition of PolII promoter sequences. |
20
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
77 Bioinformatics, 15:356-61. |
7 | 78 http://dx.doi.org/10.1093/bioinformatics/15.5.356 |
79 | |
20
a538e182fab3
Uploaded v0.2.5 preview 4, adding Cock et al. 2003 citation information.
peterjc
parents:
19
diff
changeset
|
80 See also http://www.cbs.dtu.dk/services/Promoter/output.php |
7 | 81 |
17
af3174637834
Uploaded v0.2.5 preview 1, adding links to Tool Shed entry
peterjc
parents:
16
diff
changeset
|
82 This wrapper is available to install into other Galaxy Instances via the Galaxy |
af3174637834
Uploaded v0.2.5 preview 1, adding links to Tool Shed entry
peterjc
parents:
16
diff
changeset
|
83 Tool Shed at http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp |
7 | 84 </help> |
25 | 85 <citations> |
86 <citation type="doi">10.7717/peerj.167</citation> | |
87 <citation type="doi">10.1093/bioinformatics/15.5.356</citation> | |
88 </citations> | |
7 | 89 </tool> |