annotate tools/protein_analysis/promoter2.xml @ 18:2b35b5c4b7f4 draft

Uploaded v0.2.5, preview 2, fixed bug in RXLR tools
author peterjc
date Fri, 10 May 2013 07:48:26 -0400
parents af3174637834
children 4cd848c5590b
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
17
af3174637834 Uploaded v0.2.5 preview 1, adding links to Tool Shed entry
peterjc
parents: 16
diff changeset
1 <tool id="promoter2" name="Promoter 2.0" version="0.0.5">
7
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
2 <description>Find eukaryotic PolII promoters in DNA sequences</description>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
3 <!-- If job splitting is enabled, break up the query file into parts -->
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
4 <!-- Using 2000 per chunk so 4 threads each doing 500 is ideal -->
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
5 <parallelism method="basic" split_inputs="fasta_file" split_mode="to_size" split_size="2000" merge_outputs="tabular_file"></parallelism>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
6 <command interpreter="python">
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
7 promoter2.py "\$NSLOTS" $fasta_file $tabular_file
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
8 ##Set the number of threads in the runner entry in universe_wsgi.ini
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
9 ##which (on SGE at least) will set the $NSLOTS environment variable.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
10 ##If the environment variable isn't set, get "", and defaults to one.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
11 </command>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
12 <stdio>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
13 <!-- Anything other than zero is an error -->
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
14 <exit_code range="1:" />
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
15 <exit_code range=":-1" />
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
16 </stdio>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
17 <inputs>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
18 <param name="fasta_file" type="data" format="fasta" label="FASTA file of DNA sequences"/>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
19 </inputs>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
20 <outputs>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
21 <data name="tabular_file" format="tabular" label="Promoter2 on ${fasta_file.name}" />
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
22 </outputs>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
23 <requirements>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
24 <requirement type="binary">promoter</requirement>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
25 </requirements>
16
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
26 <tests>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
27 <test>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
28 <param name="fasta_file" value="Adenovirus.fasta" ftype="fasta"/>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
29 <output name="tabular_file" file="Adenovirus.promoter2.tabular" ftype="tabular"/>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
30 </test>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
31 <test>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
32 <param name="fasta_file" value="empty.fasta" ftype="fasta"/>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
33 <output name="tabular_file" file="empty_promoter2.tabular" ftype="tabular"/>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
34 </test>
1d5486a02bc4 Uploaded v0.2.4, added unit tests for Promoter 2
peterjc
parents: 7
diff changeset
35 </tests>
7
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
36 <help>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
37
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
38 **What it does**
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
39
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
40 This calls the Promoter 2.0 tool for prediction of eukaryotic PolII promoter sequences using a Neural Network (NN) model.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
41
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
42 The input is a FASTA file of nucleotide sequences (e.g. upstream regions of your genes), and the output is tabular with five columns (one row per promoter):
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
43
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
44 1. Sequence identifier (first word of FASTA header)
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
45 2. Promoter position, e.g. 600
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
46 3. Promoter score, e.g. 1.063
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
47 4. Promoter likelihood, e.g. Highly likely prediction
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
48
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
49 The scores are classified very simply as follows:
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
50
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
51 ========= ========================
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
52 Score Description
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
53 --------- ------------------------
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
54 below 0.5 ignored
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
55 0.5 - 0.8 Marginal prediction
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
56 0.8 - 1.0 Medium likely prediction
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
57 above 1.0 Highly likely prediction
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
58 ========= ========================
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
59
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
60 Internally the input FASTA file is divided into parts (to allow multiple processors to be used), and the raw output is reformatted into this tabular layout suitable for downstream analysis within Galaxy.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
61
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
62 **References**
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
63
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
64 Knudsen.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
65 Promoter2.0: for the recognition of PolII promoter sequences.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
66 Bioinformatics, 15:356-61, 1999.
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
67 http://dx.doi.org/10.1093/bioinformatics/15.5.356
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
68
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
69 http://www.cbs.dtu.dk/services/Promoter/output.php
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
70
17
af3174637834 Uploaded v0.2.5 preview 1, adding links to Tool Shed entry
peterjc
parents: 16
diff changeset
71 This wrapper is available to install into other Galaxy Instances via the Galaxy
af3174637834 Uploaded v0.2.5 preview 1, adding links to Tool Shed entry
peterjc
parents: 16
diff changeset
72 Tool Shed at http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp
7
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
73 </help>
5e62aefb2918 Uploaded v0.1.2 to Test Tool Shed
peterjc
parents:
diff changeset
74 </tool>