Mercurial > repos > peterjc > blast_rbh
annotate tools/blast_rbh/README.rst @ 42:2bca27f8abb2 draft default tip
v0.3.0 - Updated to BLAST+ 2.9.0 via conda; removed legacy tool packages.
author | peterjc |
---|---|
date | Wed, 07 Dec 2022 09:58:43 +0000 |
parents | 38fe3aaa15aa |
children |
rev | line source |
---|---|
13 | 1 Find BLAST Reciprocal Best Hits (RBH), with Galaxy wrapper |
2 ========================================================== | |
0 | 3 |
31
a5106b6c07a8
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit de1ad30e756a7073c422d5677fe5c6750521fa1b-dirty
peterjc
parents:
30
diff
changeset
|
4 This tool is copyright 2011-2017 by Peter Cock, The James Hutton Institute |
0 | 5 (formerly SCRI, Scottish Crop Research Institute), UK. All rights reserved. |
6 See the licence text below. | |
7 | |
8 This tool is a short Python script to run reciprocal BLAST searches on a | |
13 | 9 pair of sequence files, and extract the reciprocal best hits. The script |
10 ``blast_rbh.py`` can be used directly (without Galaxy) as long as NCBI | |
11 BLAST+ is installed. | |
0 | 12 |
13 | 13 It comes with an optional Galaxy tool definition file ``blast_rbh.xml`` |
14 allowing the Python script to be run from within Galaxy. It is available | |
15 from the Galaxy Tool Shed at: | |
12 | 16 http://toolshed.g2.bx.psu.edu/view/peterjc/blast_rbh |
0 | 17 |
18 | |
18
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
19 Citation |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
20 ======== |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
21 |
21 | 22 Please cite the following paper: |
18
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
23 |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
24 NCBI BLAST+ integrated into Galaxy. |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
25 P.J.A. Cock, J.M. Chilton, B. Gruening, J.E. Johnson, N. Soranzo |
21 | 26 *GigaScience* 2015, 4:39 |
35
5c78f5e6cdf2
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 1b9ec80b23b7fc9d3ed95afba3bc3b57dc3f47c2-dirty
peterjc
parents:
34
diff
changeset
|
27 https://doi.org/10.1186/s13742-015-0080-7 |
18
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
28 |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
29 You should also cite the NCBI BLAST+ tools: |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
30 |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
31 BLAST+: architecture and applications. |
21 | 32 C. Camacho et al. *BMC Bioinformatics* 2009, 10:421. |
35
5c78f5e6cdf2
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 1b9ec80b23b7fc9d3ed95afba3bc3b57dc3f47c2-dirty
peterjc
parents:
34
diff
changeset
|
33 https://doi.org/10.1186/1471-2105-10-421 |
18
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
34 |
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
35 |
0 | 36 Automated Installation |
37 ====================== | |
38 | |
39 Installation via the Galaxy Tool Shed should take care of the Galaxy side of | |
40 things, including the dependency the NCBI BLAST+ binaries. | |
41 | |
42 | |
43 Manual Installation | |
44 =================== | |
45 | |
39
64369ee3aaf0
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 0a172d2516a9d7e8d8fa26dff24c0c906b9d0c6a-dirty
peterjc
parents:
35
diff
changeset
|
46 There are just three files to install: |
0 | 47 |
48 - ``blast_rbh.py`` (the Python script) | |
39
64369ee3aaf0
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 0a172d2516a9d7e8d8fa26dff24c0c906b9d0c6a-dirty
peterjc
parents:
35
diff
changeset
|
49 - ``best_hits.py`` (helper script, put in same directory) |
0 | 50 - ``blast_rbh.xml`` (the Galaxy tool definition) |
51 | |
52 The suggested location is in a ``tools/blast_rbh/`` folder. You will then | |
53 need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the tool | |
54 by adding the line:: | |
55 | |
56 <tool file="blast_rbh/blast_rbh.xml" /> | |
57 | |
13 | 58 If you want to run the functional tests, copy the sample test files under |
59 sample test files under Galaxy's ``test-data/`` directory. Then:: | |
0 | 60 |
13 | 61 ./run_tests.sh -id blast_reciprocal_best_hits |
0 | 62 |
63 You will need to have the NCBI BLAST+ binaries installed and on the ``$PATH``. | |
64 | |
65 | |
66 History | |
67 ======= | |
68 | |
69 ======= ====================================================================== | |
70 Version Changes | |
71 ------- ---------------------------------------------------------------------- | |
10 | 72 v0.1.0 - Initial Test Tool Shed release, targetting NCBI BLAST+ 2.2.29 |
8
d3eb5cda7270
Uploaded v0.1.2, self comparison and easier command line API
peterjc
parents:
6
diff
changeset
|
73 v0.1.1 - Supports self-comparison, sometimes useful for spotting duplicates. |
d3eb5cda7270
Uploaded v0.1.2, self comparison and easier command line API
peterjc
parents:
6
diff
changeset
|
74 v0.1.2 - Using optparse for command line API. |
11
4733e4ea4dab
Uploaded v0.1.2c, same defaults in blast_rbh.py as blast_rbh.xml
peterjc
parents:
10
diff
changeset
|
75 - Tool definition now embeds citation information. |
10 | 76 - Fixed Tool Shed dependency definition. |
12 | 77 v0.1.3 - Option to make FASTA files non-redundant (via Biopython dependency). |
78 - Avoid extra database and BLAST search in self-comparison mode. | |
13 | 79 v0.1.4 - Check for duplicate FASTA identifiers (workaround for makeblastdb |
80 not treating this as an error, leading to confusing RBH output). | |
81 v0.1.5 - Clarify documentation for using the Python script outside Galaxy. | |
82 - Updated to depend on NCBI BLAST+ 2.2.30 via ToolShed install. | |
15 | 83 v0.1.6 - Offer the new blastp-fast task added in BLAST+ 2.2.30. |
18
f3ec931988fd
Uploaded 0.1.6c with the preprint citation (forgot to update source folder before making tar ball).
peterjc
parents:
17
diff
changeset
|
84 - Added "NCBI BLAST+ integrated into Galaxy" preprint citation. |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
85 v0.1.7 - Reorder XML elements (internal change only). |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
86 - Planemo for Tool Shed upload (``.shed.yml``, internal change only). |
21 | 87 - Updated citation information with GigaScience paper. |
23 | 88 v0.1.8 - Updated to depend on NCBI BLAST+ 2.2.31 via ToolShed install. |
89 v0.1.9 - Updates to the command line API for the Python script. | |
90 - PEP8 style updates to the Python script (internal change only). | |
91 - Fix parameter help text which was not being displayed. | |
28 | 92 v0.1.11 - Updated to depend on NCBI BLAST+ 2.5.0 via ToolShed or BioConda. |
93 - Update Biopython dependency. | |
94 - Tweak Python script to work under Python 2 or Python 3. | |
33
692366540172
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 21f6d9932d322034e4cea5fee23b24bf0b1e1e85-dirty
peterjc
parents:
31
diff
changeset
|
95 v0.1.12 - Use ``<command detect_errors="aggressive">`` (internal change only). |
692366540172
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 21f6d9932d322034e4cea5fee23b24bf0b1e1e85-dirty
peterjc
parents:
31
diff
changeset
|
96 - Single quote command line arguments (internal change only). |
39
64369ee3aaf0
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit 0a172d2516a9d7e8d8fa26dff24c0c906b9d0c6a-dirty
peterjc
parents:
35
diff
changeset
|
97 v0.2.0 - Refactored to use more than one Python file (internal change only). |
41
38fe3aaa15aa
"planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit eb20367d88fc595a7d274321494ef98e919f4747-dirty"
peterjc
parents:
39
diff
changeset
|
98 v0.3.0 - Updated to BLAST+ 2.9.0 via conda; removed legacy tool packages. |
0 | 99 ======= ====================================================================== |
100 | |
101 | |
102 Developers | |
103 ========== | |
104 | |
105 This tool is developed on the following GitHub repository: | |
106 https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh | |
107 | |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
108 For pushing a release to the test or main "Galaxy Tool Shed", use the following |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
109 Planemo commands (which requires you have set your Tool Shed access details in |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
110 ``~/.planemo.yml`` and that you have access rights on the Tool Shed):: |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
111 |
28 | 112 $ planemo shed_update -t testtoolshed --check_diff tools/blast_rbh/ |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
113 ... |
0 | 114 |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
115 or:: |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
116 |
28 | 117 $ planemo shed_update -t toolshed --check_diff tools/blast_rbh/ |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
118 ... |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
119 |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
120 To just build and check the tar ball, use:: |
0 | 121 |
28 | 122 $ planemo shed_upload --tar_only tools/blast_rbh/ |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
123 ... |
34
362c62ef2feb
planemo upload for repository https://github.com/peterjc/galaxy_blast/tree/master/tools/blast_rbh commit dc88568c900086550ce206dd83bad65e991df743-dirty
peterjc
parents:
33
diff
changeset
|
124 $ tar -tzf shed_upload.tar.gz |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
125 test-data/four_human_proteins.fasta |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
126 test-data/k12_edited_proteins.fasta |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
127 test-data/k12_ten_proteins.fasta |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
128 test-data/rbh_blastn_three_human_mRNA_vs_rhodopsin_nucs.tabular |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
129 test-data/rbh_blastp_four_human_vs_rhodopsin_proteins.tabular |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
130 test-data/rbh_blastp_k12.tabular |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
131 test-data/rbh_blastp_k12_self.tabular |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
132 test-data/rbh_megablast_rhodopsin_nucs_vs_three_human_mRNA.tabular |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
133 test-data/rbh_none.tabular |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
134 test-data/rbh_tblastx_rhodopsin_nucs_vs_three_human_mRNA.tabular |
0 | 135 test-data/rhodopsin_nucs.fasta |
1 | 136 test-data/rhodopsin_proteins.fasta |
0 | 137 test-data/three_human_mRNA.fasta |
20
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
138 tools/blast_rbh/README.rst |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
139 tools/blast_rbh/blast_rbh.py |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
140 tools/blast_rbh/blast_rbh.xml |
e8f8c580bcca
planemo upload for repository https://github.com/peterjc/galaxy_blast/tools/blast_rbh commit 05c2a834609f1fd02372c4f9b0a0733680fe9675-dirty
peterjc
parents:
18
diff
changeset
|
141 tools/blast_rbh/tool_dependencies.xml |
0 | 142 |
143 | |
144 Licence (MIT) | |
145 ============= | |
146 | |
147 Permission is hereby granted, free of charge, to any person obtaining a copy | |
148 of this software and associated documentation files (the "Software"), to deal | |
149 in the Software without restriction, including without limitation the rights | |
150 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell | |
151 copies of the Software, and to permit persons to whom the Software is | |
152 furnished to do so, subject to the following conditions: | |
153 | |
154 The above copyright notice and this permission notice shall be included in | |
155 all copies or substantial portions of the Software. | |
156 | |
157 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR | |
158 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, | |
159 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE | |
160 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER | |
161 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, | |
162 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN | |
163 THE SOFTWARE. |