annotate tools/blast_rbh/README.rst @ 13:4454596ed127 draft

Uploaded
author peterjc
date Thu, 30 Oct 2014 14:20:48 -0400
parents 661276ad882e
children 40c85a67e645
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
13
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
1 Find BLAST Reciprocal Best Hits (RBH), with Galaxy wrapper
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
2 ==========================================================
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
3
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
4 This tool is copyright 2011-2014 by Peter Cock, The James Hutton Institute
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved.
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
6 See the licence text below.
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
7
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
8 This tool is a short Python script to run reciprocal BLAST searches on a
13
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
9 pair of sequence files, and extract the reciprocal best hits. The script
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
10 ``blast_rbh.py`` can be used directly (without Galaxy) as long as NCBI
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
11 BLAST+ is installed.
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
12
13
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
13 It comes with an optional Galaxy tool definition file ``blast_rbh.xml``
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
14 allowing the Python script to be run from within Galaxy. It is available
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
15 from the Galaxy Tool Shed at:
12
661276ad882e Uploaded v0.1.3, uses Biopython for making NR
peterjc
parents: 11
diff changeset
16 http://toolshed.g2.bx.psu.edu/view/peterjc/blast_rbh
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
17
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
18
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
19 Automated Installation
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
20 ======================
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
21
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
22 Installation via the Galaxy Tool Shed should take care of the Galaxy side of
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
23 things, including the dependency the NCBI BLAST+ binaries.
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
24
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
25
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
26 Manual Installation
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
27 ===================
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
28
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
29 There are just two files to install:
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
30
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
31 - ``blast_rbh.py`` (the Python script)
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
32 - ``blast_rbh.xml`` (the Galaxy tool definition)
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
33
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
34 The suggested location is in a ``tools/blast_rbh/`` folder. You will then
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
35 need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the tool
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
36 by adding the line::
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
37
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
38 <tool file="blast_rbh/blast_rbh.xml" />
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
39
13
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
40 If you want to run the functional tests, copy the sample test files under
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
41 sample test files under Galaxy's ``test-data/`` directory. Then::
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
42
13
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
43 ./run_tests.sh -id blast_reciprocal_best_hits
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
44
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
45 You will need to have the NCBI BLAST+ binaries installed and on the ``$PATH``.
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
46
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
47
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
48 History
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
49 =======
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
50
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
51 ======= ======================================================================
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
52 Version Changes
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
53 ------- ----------------------------------------------------------------------
10
982458013bdb Uploaded v0.1.2b, fixed dependency definition
peterjc
parents: 9
diff changeset
54 v0.1.0 - Initial Test Tool Shed release, targetting NCBI BLAST+ 2.2.29
8
d3eb5cda7270 Uploaded v0.1.2, self comparison and easier command line API
peterjc
parents: 6
diff changeset
55 v0.1.1 - Supports self-comparison, sometimes useful for spotting duplicates.
d3eb5cda7270 Uploaded v0.1.2, self comparison and easier command line API
peterjc
parents: 6
diff changeset
56 v0.1.2 - Using optparse for command line API.
11
4733e4ea4dab Uploaded v0.1.2c, same defaults in blast_rbh.py as blast_rbh.xml
peterjc
parents: 10
diff changeset
57 - Tool definition now embeds citation information.
10
982458013bdb Uploaded v0.1.2b, fixed dependency definition
peterjc
parents: 9
diff changeset
58 - Fixed Tool Shed dependency definition.
12
661276ad882e Uploaded v0.1.3, uses Biopython for making NR
peterjc
parents: 11
diff changeset
59 v0.1.3 - Option to make FASTA files non-redundant (via Biopython dependency).
661276ad882e Uploaded v0.1.3, uses Biopython for making NR
peterjc
parents: 11
diff changeset
60 - Avoid extra database and BLAST search in self-comparison mode.
13
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
61 v0.1.4 - Check for duplicate FASTA identifiers (workaround for makeblastdb
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
62 not treating this as an error, leading to confusing RBH output).
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
63 v0.1.5 - Clarify documentation for using the Python script outside Galaxy.
4454596ed127 Uploaded
peterjc
parents: 12
diff changeset
64 - Updated to depend on NCBI BLAST+ 2.2.30 via ToolShed install.
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
65 ======= ======================================================================
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
66
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
67
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
68 Developers
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
69 ==========
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
70
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
71 This tool is developed on the following GitHub repository:
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
72 https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
73
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
74 For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball I use
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
75 the following command from the Galaxy root folder::
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
76
10
982458013bdb Uploaded v0.1.2b, fixed dependency definition
peterjc
parents: 9
diff changeset
77 $ tar -czf blast_rbh.tar.gz tools/blast_rbh/README.rst tools/blast_rbh/blast_rbh.xml tools/blast_rbh/blast_rbh.py tools/blast_rbh/tool_dependencies.xml test-data/rhodopsin_nucs.fasta test-data/rhodopsin_proteins.fasta test-data/three_human_mRNA.fasta test-data/four_human_proteins.fasta test-data/k12_edited_proteins.fasta test-data/k12_ten_proteins.fasta test-data/rbh_megablast_rhodopsin_nucs_vs_three_human_mRNA.tabular test-data/rbh_blastn_three_human_mRNA_vs_rhodopsin_nucs.tabular test-data/rbh_blastp_four_human_vs_rhodopsin_proteins.tabular test-data/rbh_none.tabular test-data/rbh_tblastx_rhodopsin_nucs_vs_three_human_mRNA.tabular test-data/rbh_blastp_k12.tabular test-data/rbh_blastp_k12_self.tabular
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
78
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
79 Check this worked::
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
80
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
81 $ tar -tzf blast_rbh.tar.gz
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
82 tools/blast_rbh/README.rst
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
83 tools/blast_rbh/blast_rbh.xml
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
84 tools/blast_rbh/blast_rbh.py
10
982458013bdb Uploaded v0.1.2b, fixed dependency definition
peterjc
parents: 9
diff changeset
85 tools/blast_rbh/tool_dependencies.xml
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
86 test-data/rhodopsin_nucs.fasta
1
a68f4e5789d7 Uploaded v0.1.0a, including a blastp test.
peterjc
parents: 0
diff changeset
87 test-data/rhodopsin_proteins.fasta
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
88 test-data/three_human_mRNA.fasta
1
a68f4e5789d7 Uploaded v0.1.0a, including a blastp test.
peterjc
parents: 0
diff changeset
89 test-data/four_human_proteins.fasta
6
e47960bcdccb Uploaded v0.1.0f, exclude tied best hits
peterjc
parents: 5
diff changeset
90 test-data/k12_edited_proteins.fasta
e47960bcdccb Uploaded v0.1.0f, exclude tied best hits
peterjc
parents: 5
diff changeset
91 test-data/k12_ten_proteins.fasta
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
92 test-data/rbh_megablast_rhodopsin_nucs_vs_three_human_mRNA.tabular
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
93 test-data/rbh_blastn_three_human_mRNA_vs_rhodopsin_nucs.tabular
1
a68f4e5789d7 Uploaded v0.1.0a, including a blastp test.
peterjc
parents: 0
diff changeset
94 test-data/rbh_blastp_four_human_vs_rhodopsin_proteins.tabular
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
95 test-data/rbh_none.tabular
5
c84b6c21e3d4 Uploaded v0.1.0e, test TBLASTX mode; more columns in output
peterjc
parents: 1
diff changeset
96 test-data/rbh_tblastx_rhodopsin_nucs_vs_three_human_mRNA.tabular
6
e47960bcdccb Uploaded v0.1.0f, exclude tied best hits
peterjc
parents: 5
diff changeset
97 test-data/rbh_blastp_k12.tabular
9
95190ebd4657 Uploaded v0.1.2a, include missing test file
peterjc
parents: 8
diff changeset
98 test-data/rbh_blastp_k12_self.tabular
0
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
99
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
100
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
101 Licence (MIT)
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
102 =============
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
103
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
104 Permission is hereby granted, free of charge, to any person obtaining a copy
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
105 of this software and associated documentation files (the "Software"), to deal
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
106 in the Software without restriction, including without limitation the rights
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
107 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
108 copies of the Software, and to permit persons to whom the Software is
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
109 furnished to do so, subject to the following conditions:
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
110
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
111 The above copyright notice and this permission notice shall be included in
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
112 all copies or substantial portions of the Software.
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
113
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
114 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
115 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
116 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
117 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
118 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
119 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
a96608a125fb Uploaded v0.1.0, first release
peterjc
parents:
diff changeset
120 THE SOFTWARE.