changeset 24:ee10017fcd80 draft

Uploaded v0.2.5 preview 8, omitted README.rst in last upload
author peterjc
date Tue, 17 Sep 2013 12:06:15 -0400
parents 1f7678c1f4ad
children 41a42022f815
files tools/protein_analysis/README.rst
diffstat 1 files changed, 192 insertions(+), 0 deletions(-) [+]
line wrap: on
line diff
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/tools/protein_analysis/README.rst	Tue Sep 17 12:06:15 2013 -0400
@@ -0,0 +1,192 @@
+This package contains Galaxy wrappers for a selection of standalone command
+line protein analysis tools:
+
+* SignalP 3.0, THMHMM 2.0, Promoter 2.0 from the Center for Biological
+  Sequence Analysis at the Technical University of Denmark,
+  http://www.cbs.dtu.dk/cbs/
+
+* WoLF PSORT v0.2 from http://wolfpsort.org/
+
+* PSORTb v3 from http://www.psort.org/downloads/index.html
+
+Also, the RXLR motif tool uses SignalP 3.0 and HMMER 2.3.2 internally.
+
+To use these Galaxy wrappers you must first install the command line tools.
+At the time of writing they are all free for academic use, or open source.
+
+These wrappers are copyright 2010-2013 by Peter Cock, James Hutton Institute
+(formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
+Contributions/revisions copyright 2011 Konrad Paszkiewicz. All rights reserved.
+See the included LICENCE file for details (MIT open source licence).
+
+The wrappers are available from the Galaxy Tool Shed
+http://toolshed.g2.bx.psu.edu/view/peterjc/tmhmm_and_signalp 
+
+Citation
+========
+
+If you use any of these Galaxy tools in work leading to a scientific
+publication, in addition to citing the invididual underlying tools, please cite:
+
+Peter Cock, Bjoern Gruening, Konrad Paszkiewicz and Leighton Pritchard (2013).
+Galaxy tools and workflows for sequence analysis with applications
+in molecular plant pathology. PeerJ 1:e167
+http://dx.doi.org/10.7717/peerj.167
+
+Full reference information is included in the help text for each tool.
+
+
+Requirements
+============
+
+First install those command line tools you wish to use the wrappers for:
+
+1. Install the command line version of SignalP 3.0 and ensure "signalp" is
+   on the PATH, see: http://www.cbs.dtu.dk/services/SignalP/
+
+2. Install the command line version of TMHMM 2.0 and ensure "tmhmm" is on
+   the PATH, see: http://www.cbs.dtu.dk/services/TMHMM/
+
+3. Install the command line version of Promoter 2.0 and ensure "promoter" is
+   on the PATH, see: http://www.cbs.dtu.dk/services/Promoter
+
+4. Install the WoLF PSORT v0.2 package, and ensure "runWolfPsortSummary"
+   is on the PATH (we use an extra wrapper script to change to the WoLF PSORT
+   directory, run runWolfPsortSummary, and then change back to the original
+   directory), see: http://wolfpsort.org/WoLFPSORT_package/version0.2/
+
+5. Install hmmsearch from HMMER 2.3.2 (the last stable release of HMMER 2)
+   but put it on the path under the name hmmsearch2 (allowing it to co-exist
+   with HMMER 3), or edit rlxr_motif.py accordingly.
+
+Verify each of the tools is installed and working from the command line
+(when logged in as the Galaxy user if appropriate).
+
+
+Manual Installation
+===================
+
+1. Create a folder tools/protein_analysis under your Galaxy installation.
+   This folder name is not critical, and can be changed if desired - you
+   must update the paths used in tool_conf.xml to match.
+
+2. Copy/move the following files (from this archive) there:
+
+   * tmhmm2.xml (Galaxy tool definition)
+   * tmhmm2.py (Python wrapper script)
+
+   * signalp3.xml (Galaxy tool definition)
+   * signalp3.py (Python wrapper script)
+
+   * promoter2.xml (Galaxy tool definition)
+   * promoter2.py (Python wrapper script)
+
+   * psortb.xml (Galaxy tool definition)
+   * psortb.py (Python wrapper script)
+
+   * wolf_psort.xml (Galaxy tool definition)
+   * wolf_psort.py (Python wrapper script)
+
+   * rxlr_motifs.xml (Galaxy tool definition)
+   * rxlr_motifs.py (Python script)
+
+   * seq_analysis_utils.py (shared Python code)
+   * LICENCE
+   * README.rst (this file)
+
+3. Edit your Galaxy conjuration file tool_conf.xml (to use the tools) AND
+   also tool_conf.xml.sample (to run the tests) to include the new tools
+   by adding::
+
+    <section name="Protein sequence analysis" id="protein_analysis">
+      <tool file="protein_analysis/tmhmm2.xml" />
+      <tool file="protein_analysis/signalp3.xml" />
+      <tool file="protein_analysis/psortb.xml" />
+      <tool file="protein_analysis/wolf_psort.xml" />
+      <tool file="protein_analysis/rxlr_motifs.xml" />
+    </section>
+    <section name="Nucleotide sequence analysis" id="nucleotide_analysis">
+      <tool file="protein_analysis/promoter2.xml" />
+    </section>
+
+   Leave out the lines for any tools you do not wish to use in Galaxy.
+
+4. Copy/move the test-data files (from this archive) to Galaxy's
+   subfolder test-data.
+
+5. Run the Galaxy functional tests for these new wrappers with::
+
+    ./run_functional_tests.sh -id tmhmm2
+    ./run_functional_tests.sh -id signalp3
+    ./run_functional_tests.sh -id Psortb
+    ./run_functional_tests.sh -id rxlr_motifs
+
+   Alternatively, this should work (assuming you left the name and id as shown in
+   the XML file tool_conf.xml.sample)::
+
+    ./run_functional_tests.sh -sid Protein_sequence_analysis-protein_analysis
+
+   To check the section ID expected, use ./run_functional_tests.sh -list
+
+6. Restart Galaxy and check the new tools are shown and work.
+
+
+History
+=======
+
+======= ======================================================================
+Version Changes
+------- ----------------------------------------------------------------------
+v0.0.1  - Initial release
+v0.0.2  - Corrected some typos in the help text
+        - Renamed test output file to use Galaxy convention of *.tabular
+v0.0.3  - Check for tmhmm2 silent failures (no output)
+        - Additional unit tests
+v0.0.4  - Ignore comment lines in tmhmm2 output.
+v0.0.5  - Explicitly request tmhmm short output (may not be the default)
+v0.0.6  - Improvement to how sub-jobs are run (should be faster)
+v0.0.7  - Change SignalP default truncation from 60 to 70 to match the
+          SignalP webservice.
+v0.0.8  - Added WoLF PSORT wrapper to the suite.
+v0.0.9  - Added our RXLR motifs tool to the suite.
+v0.1.0  - Added Promoter 2.0 wrapper (similar to SignalP & TMHMM wrappers)
+        - Support Galaxy's <parallelism> tag for SignalP, TMHMM & Promoter
+v0.1.1  - Fixed an error in the header of the tabular output from Promoter
+v0.1.2  - Use the new <stdio> settings in the XML wrappers to catch errors
+        - Use SGE style $NSLOTS for thread count (otherwise default to 4)
+v0.1.3  - Added missing file whisson_et_al_rxlr_eer_cropped.hmm to Tool Shed
+v0.2.0  - Added PSORTb wrapper to the suite, based on earlier work
+          contributed by Konrad Paszkiewicz.
+v0.2.1  - Use a script to create the Tool Shed tar-ball (removed some stray
+          files accidentally included previously via a wildcard).
+v0.2.2  - Include missing test files.
+v0.2.3  - Added unit tests for WoLF PSORT.
+v0.2.4  - Added unit tests for Promoter 2
+v0.2.5  - Link to Tool Shed added to help text and this documentation.
+        - More unit tests.
+        - Fixed bug with RXLR tool and empty FASTA files.
+        - Fixed typo in the RXLR tool help text.
+        - Updated citation information (Cock et al. 2013).
+        - Adopted standard MIT licence.
+        - Use reStructuredText for this README file.
+        - Development moved to GitHub, https://github.com/peterjc/pico_galaxy
+======= ======================================================================
+
+
+Developers
+==========
+
+This script and other tools are being developed on the following hg branches:
+http://bitbucket.org/peterjc/galaxy-central/src/seq_analysis
+http://bitbucket.org/peterjc/galaxy-central/src/tools
+
+Development has now moved to a dedicated GitHub repository:
+https://github.com/peterjc/pico_galaxy/tree/master/tools
+
+For making the "Galaxy Tool Shed" http://community.g2.bx.psu.edu/ tarball use
+the following command from the Galaxy root folder::
+
+    $ ./tools/protein_analysis/make_tmhmm_and_signalp.sh
+
+This simplifies ensuring a consistent set of files is bundled each time,
+including all the relevant test files.