0
|
1 Revision history for Perl extension Bio::Tools::Blast.pm and related modules.
|
|
2
|
|
3 0.091 Thu May 18 15:15:35 2000
|
|
4 (These fixes were made on both branch-06 and the main trunk).
|
|
5 - Fixed Webblast.pm (jitterbug PR #220). It now uses SeqIO and IO::Scalar
|
|
6 for sending formatted query sequence data to the BLAST servers.
|
|
7 - Fixed the examples/blast/ scripts related to remote BLASTing.
|
|
8 - Added single quotes to an '_expect' data member in Sbjct
|
|
9 (reported by Janet Young in jitterbug PR #218).
|
|
10 - Fixed regexp bug in Blast::HTML.pm that caused accessions with
|
|
11 . and _ characters to not be recognized.
|
|
12
|
|
13
|
|
14 0.09 Wed Mar 22 00:44:01 2000
|
|
15 - Handles Blast reports that don't contain description sections.
|
|
16 - Handles parsing of NCBI 2.0.9 Blast2 reports.
|
|
17 - Identifiers of hits are assumed to be the first
|
|
18 chunk of non-whitespace characters in the description line
|
|
19 Does not assume any semantics in the structure of the identifier.
|
|
20 (Blast::Sbjct::_set_id() formerly attempted to extract database
|
|
21 name from the seq identifiers, but this was prone to breaking).
|
|
22 - Fixed bug in the parsing of reports that lack an hits
|
|
23 (incorporated code fix submitted by Michael B. Thornton
|
|
24 jitterbug PR#114, PR#173).
|
|
25 - New helper script retrieve_blast.pl added to examples/blast/
|
|
26 for retrieving Blast reports from NCBI after running
|
|
27 run_blast_remote.pl See docs in retrieve_blast.pl for details.
|
|
28 - Updated Webblast.pm with new URLs for the NCBI servers.
|
|
29 - Fixed bug (jitterbug PR#84) in the parsing of alignments that would
|
|
30 cause occaisionally cause some significant hits to be missed
|
|
31 (incorporates code suggested by Peter van Heusden).
|
|
32 - Added support to Blast.pm for not parsing alignment data
|
|
33 using the new NO_ALIGNS parsing option. Also added -noaligns
|
|
34 command-line option to blast_config.pl (addresses jitterbug PR#93).
|
|
35 - Blast::hit() method doesn't blow up if there are no hits
|
|
36 (fixes jitterbug PR#129).
|
|
37 - Addressed jitterbug PR#96 by correcting docs. If the significance
|
|
38 is defined but of bad format, no default is set.
|
|
39 - HSP.pm properly handles bit scores expressed using floating point
|
|
40 and scientific notation (fixes jitterbug report PR#110).
|
|
41 - Fixed logic error in HSP.pm that would incorrectly throw an
|
|
42 exception when the number of identical residues was zero
|
|
43 (fixes jitterbug PR#164).
|
|
44 - HSP::strand() returns -1, 0, 1 instead of 'Minus', 'Plus', ''
|
|
45 to be in-line with with behavior of SeqFeature.
|
|
46
|
|
47 Fixes within Bio::Root
|
|
48 ----------------------
|
|
49 - Fixed bug reported in PR#85. Bio::Root::Utilities::get_newline()
|
|
50 returns "\n" by default.
|
|
51 - IOManager::read doesn't call alarm() if it isn't available
|
|
52 (fixes bug #PR98).
|
|
53 - Root::Object::_set_io() does not pass @_ to the IOManager constructor
|
|
54 call (incorporates suggestion from Scot Harker, jitterbut PR#192).
|
|
55
|
|
56
|
|
57 0.085 Tue Jun 29 05:15:40 1999 (bioperl 0.05.1)
|
|
58 (Note: the $VERSION variable was not changed to 0.085 in this release!)
|
|
59 - Fixed bugs reported in PR#s: 30, 31, 41, 44, 45, 47, 48
|
|
60 - Blast.pm no longer throws exceptions for hitless reports,
|
|
61 even if a -signif criterion is specified.
|
|
62 - Better support for PSI Blasts (running and parsing) based on
|
|
63 comments/suggestions from Ross Crowhurst and Lu Sun.
|
|
64 - Bug fixes in Webblast.pm regarding blastServerURL and filenames
|
|
65 with special characters (bug reports from James Diggans and
|
|
66 Bradford Powell, respectively).
|
|
67 - Removed =head3 tags that were causing pod2man warnings.
|
|
68 - Fixed -best parsing switch.
|
|
69 - Added better POD to Webblast.pm
|
|
70
|
|
71 0.080 Sat Apr 24 19:39:37 1999 (bioperl 0.05)
|
|
72 - Complete re-implementation of the Blast report parsing strategy.
|
|
73 The parser now always streamifies all Blast report(s) and
|
|
74 segments them hit-wise instead of report-wise. This improves
|
|
75 performance and memory usage (but doesn't solve the memory leak).
|
|
76 - Documented a workaround for the memory leak (see the docs for
|
|
77 Blast.pm under the section "Memory Usage Issues").
|
|
78 - Webblast.pm now uses LWP instead of postclient.pl to post network
|
|
79 Blast requests (thanks to Ross Crowhurst for code submission).
|
|
80 This is a much more robust mechanism, although it now requires you
|
|
81 to have LWP installed ("a good thing"). For example, it makes it
|
|
82 easy to send requests and receive results through a firewall.
|
|
83 It can run Blast1, Blast2, and PSI-Blast at NCBI and also supports
|
|
84 WashU-Blast servers.
|
|
85 - Fixed some miscellaneous parsing bugs and errors reported by
|
|
86 Greg Colello, Xie Tao, and Richard Copley from the pre-release
|
|
87 - Updated example/blast scripts. Renamed parse.pl and parse2.pl to
|
|
88 parse_blast.pl and parse_blast2.pl. parse_stream.pl is gone
|
|
89 (since parse_blast.pl now handles both files and streams).
|
|
90 Renamed run.pl to run_blast_remote.pl and added additional
|
|
91 options to keep it in synch with the new Webblast.pm.
|
|
92
|
|
93
|
|
94 0.075 Wed Feb 17 01:29:57 1999 (bioperl 0.04.4)
|
|
95 - Bug fix in Blast::Run::postclient.pl which was always printing
|
|
96 usage information (reported by Ross Crowhurst).
|
|
97 - Improved error detection & reporting in Blast::Run::Webblast
|
|
98 and Blast::Run::postclient.pl.
|
|
99 - Bug fixes in HSP.pm: _set_match_stats() now properly handles
|
|
100 BLASTN output format and matches() properly handles BLASTX
|
|
101 reports.
|
|
102 - Fixed detection of substitution matrix type in Blast.pm.
|
|
103 - Updated Blast example scripts in examples/blast/
|
|
104 (in particular, run.pl).
|
|
105
|
|
106 0.074 Wed Feb 3 04:10:06 1999 (bioperl 0.04.3)
|
|
107 - Modified blast_config.pl to make the -prog and -db command-line
|
|
108 arguments mandatory. There are no default values for $opt_blast
|
|
109 and $opt_db. Also removed the -dna command-line argument.
|
|
110 - Fixed alignment section parsing bug in Blast.pm (bug report #24);
|
|
111 affects _parse_hsp_string().
|
|
112 - Modified Blast::_display_hits() to use table_tiled(), thus
|
|
113 avoiding the buggy Sbjct::_display_hits().
|
|
114 - Bug fix in Blast::HTML.pm regarding detection of HTML formatting.
|
|
115 - Changes in HSP::_set_data() to support BLAST-GP version 2.0.8.
|
|
116 - Changed the #! line of Bio/Tools/Blast/Run/postclient.pl
|
|
117 to the more standard /usr/bin/perl and predeclared some vars
|
|
118 to avoid compiler warnings.
|
|
119 - blast_config.pl and seqtools.pl were modified to include support
|
|
120 for the -wait option (sets timeout during file/stream reading).
|
|
121 - Assorted touch-ups to example/blast scripts.
|
|
122
|
|
123 0.073 Mon Dec 28 15:18:29 1998 (bioperl 0.04.2)
|
|
124 - Made changes in Sbjct.pm and HSP.pm to allow parsing of older
|
|
125 versions of Blast. This affected HSP::_set_data() ("Query:?")
|
|
126 and Sbjct::_set_hsps() ("\s*Score").
|
|
127 - Improved error handling in Sbjct::_set_hsps() and error
|
|
128 detection in HSP_set_data().
|
|
129 - Added methods for obtaining the number of identical and conserved
|
|
130 residues (sans gaps) sequences in HSP.pm (num_identical,
|
|
131 num_conserved).
|
|
132 - Modified the default behavior of frac_identical() and
|
|
133 frac_conserved() to use the procedure used in the BLAST report:
|
|
134 BLAST-GP: uses total length of alignment in denominator
|
|
135 WU-BLAST: uses length of query sequence (sans gaps) in denominator.
|
|
136 When these methods are called with an argument ('sbjct' or 'query'),
|
|
137 the length of the sbjct or query sequence (sans gaps) is used in
|
|
138 the denominator (recommended).
|
|
139 - Fixed error in test #29 in the t/Blast.t test script.
|
|
140
|
|
141 0.072 Wed Dec 16 05:05:21 1998 (bioperl 0.04.1)
|
|
142 - Fixed out of range exception in HSP::matches() as suggested by
|
|
143 Michael Lonetto in bio.perl.org bug report #11.
|
|
144 - Made changes in Sbjct.pm and HSP.pm to deal with the reporting of
|
|
145 TBLAST[NX] data in amino acid coordinate space. This affects
|
|
146 frac_identical() and frac_conserved() in HSP.pm, and _tile_hsps(),
|
|
147 length_aln(), frac_aligned_query(), frac_aligned_hit(),
|
|
148 frac_unaligned_query(), frac_unaligned_hit() in Sbjct.pm.
|
|
149 These also in response to bug report #11.
|
|
150 - Fixed behavior of frac_identical() and frac_aligned() in
|
|
151 both Sbjct.pm and HSP.pm to correspond to the data reported
|
|
152 by Blast. Default behavior now includes gaps instead of ignoring
|
|
153 them as before. This was in response to a report by Eli Venter.
|
|
154 - Cleaned up a few "uninitialized value" warnings.
|
|
155
|
|
156 0.071 Thu Dec 10 18:41:51 1998 (bioperl 0.04)
|
|
157 - Bio::Tools::Blast::Sbjct::_set_id() to no longer uppercases
|
|
158 sequence ids for sbjct sequences. (Note however that the Blast
|
|
159 dataset may have been built with a tool that uppercases all
|
|
160 sequence ids. Check your raw Blast report.).
|
|
161 - Bio::Tools::Blast::HTML.pm incorporates the new URL for
|
|
162 information about the Bioperl Blast module.
|
|
163
|
|
164 0.07 Thu Dec 3 13:43:06 1998
|
|
165 - Reduced memory leak that becomes apparent when parsing many Blast
|
|
166 reports. This resulted in changes to Bio::Tools::Blast::_get_parse_func()
|
|
167 and the addition of a new function to Bio::Root::Object.pm that
|
|
168 explicitly breaks cyclic object reference structures.
|
|
169 (This problem was highlighted by recently Lincoln Stein, who is has
|
|
170 created a BoulderIO-based Blast parser. See BoulderIO-1.10 or later).
|
|
171 - Fixed support in blast_config.pl for supplying a custom filtering
|
|
172 function as a string using the -filt_func command-line option.
|
|
173 Added some examples of this to the parse.pl example script.
|
|
174 - Added start() and end() methods to Bio::Tools::Blast::HSP
|
|
175 and Bio::Tools::Blast::Sbjct.pm. Sbjct also received a range()
|
|
176 method (based on discussion with Ewan Birney).
|
|
177 - Fixed bugs in Blast.pm related to accessing the name of the
|
|
178 database, expecially in conjunction with stream parsing.
|
|
179 - A few fixes in Blast.pm and Sbjct regarding handling of
|
|
180 hit sequence and database names (reported by John Calley and
|
|
181 and Karl-Heinz Ott).
|
|
182 - Fixed regexp in Blast::_parse_hsp_data (reported by Brian Karlak).
|
|
183 - Added columns for start/end of the query/sbjct sequences to the
|
|
184 table_tiled() output. Added columns for HSP query/sbjct strand
|
|
185 and frame information to the table() output.
|
|
186 - Changes in Bio::Root::Object.pm regarding strict() and verbose()
|
|
187 which are no longer object-specific.
|
|
188 - Increased documentation in Bio::Root::Object.pm.
|
|
189 - Removed autoloading and several deprecated methods from the
|
|
190 Bio::Root::Object and Utilities.
|
|
191
|
|
192
|
|
193 0.062 Fri Sep 4 00:45:05 1998
|
|
194 - Fixed bugs in Blast::_set_hits() which caused no hits to be
|
|
195 saved when _parse_hsp_data() threw an exception.
|
|
196 - Assorted documentation changes. This includes a fix in
|
|
197 the docs of Bio::Root::Object.pm that caused a compilation error
|
|
198 when you try to use it without installing it via Makefile.PL.
|
|
199 Most modules now have docs for data members in their PODs.
|
|
200 - Added method strip_html() to Bio::Tools::WWW.pm (a more general
|
|
201 version than that found in Bio::Tools::Blast::HTML.pm).
|
|
202 - Includes Bio::Seq.pm version 0.050 (formerly Bio::PreSeq.pm).
|
|
203 This version has Ewan Birney's modifications (start() and end()
|
|
204 methods added; numbering() now deprecated).
|
|
205 - By popular demand, Bio::Root::Object::src_obj() has been changed
|
|
206 to the more intuitive parent(). All modules have been updated to
|
|
207 incorporate this change.
|
|
208
|
|
209 0.061 Wed Aug 26 12:30:34 1998
|
|
210 - Fixed memory leak problem associated with Blast::_set_hits() that
|
|
211 lead to excessive memory usage when parsing large numbers reports,
|
|
212 expecially when using the -check_all=>1 parsing option.
|
|
213 (This may have solved the parse_stream.pl memory leak.)
|
|
214 - Fixed regexp in Sbjct::_set_hsps() to recognize sequence lengths
|
|
215 containing commas (reported by Lincoln Stein).
|
|
216 - Blast::signif() now always returns number in scientific notation.
|
|
217 - Fixed bug in Bio::Root::IOManager::display() (wasn't returning true).
|
|
218 - Changed Makefile.PL to always save backups for in-place edits
|
|
219 and prevents the 'inplace edit without backup' warning (reported by
|
|
220 Bobby Otillar).
|
|
221 - Assorted documentation changes/additions.
|
|
222
|
|
223 0.06 Sun Aug 16 18:31:27 1998
|
|
224 - Fixed bug in HSP::_set_match_stats() that failed to recognize Frames
|
|
225 on the + strand.
|
|
226 - Added HSP::strand() method for retrieval of strand information for
|
|
227 query and sbjct HSP sequences from nucleotide Blasts.
|
|
228 - Changed calls to Perl's length() function to CORE::length()
|
|
229 to avoid confusing the Perl 5.005_001 interpreter with
|
|
230 Bio::Tools::HSP::length() and Bio::Tools::Sbjct::length().
|
|
231 (Reported by Mike Cariaso).
|
|
232 - Added column to the output of the Blast::table_tiled() method
|
|
233 to include fraction-aligned of the sbjct sequence.
|
|
234 - Uses new version of Bio::Root::Object.pm that has new exception
|
|
235 handling policy: exceptions and warnings are no longer by default
|
|
236 attached to the objects that threw them. See notes about this in
|
|
237 Bio::Root::Object.pm. (Main reason for 0.003-point jump in version).
|
|
238 - Added some safety net methods in Sbjct.pm: frac_aligned_sbjct()
|
|
239 and num_unaligned_sbjct(), which map to frac_aligned_hit() and
|
|
240 num_unaligned_hit(), respectively (API consistency issue).
|
|
241 - Minor changes/fixes in some of the demo scripts (seqtools.pl et al.).
|
|
242 - Removed "Modifications" notes from Blast-related modules. Added
|
|
243 pointer to distribution site instead, which consolidates this info.
|
|
244 - Modules can be used by copying directly from the lib/ directory into
|
|
245 your perl/lib directory without loss of functionality but at the expense of
|
|
246 increased startup time. It is still recommended that you use the standard
|
|
247 Makefile.PL procedure, however, since this will properly autosplit
|
|
248 the modules as well as run other checks.
|
|
249
|
|
250 0.057 Thu Jul 23 02:28:26 1998
|
|
251 - HSP.pm handles "match" sequence lines that are not properly padded.
|
|
252 Affects HSP::_set_data() and other HSP.pm methods that use residue
|
|
253 strings or indices (seq_str(), seq_inds()). (Reported by Libby Shoop).
|
|
254 - Added signif_fmt() method to Blast.pm to allow retrieval of
|
|
255 P/Expect value data in different formats (e.g., exponent only).
|
|
256 - Changes in the table() methods of Blast.pm to accomodate signif_fmt().
|
|
257 - Minor changes in destructors to improve memory management.
|
|
258 - Added warnings in Blast.pm and parse_stream.pl regarding
|
|
259 memory usage issue when parsing Blast streams.
|
|
260 - Updated HTML::get_html_func() to work with the new version of
|
|
261 Bio::Root::IOManager::read().
|
|
262 - Added new rexexps to HTML::_markup_report() to deal with hits
|
|
263 which lack a database identifier (e.g., gi|210095).
|
|
264 Added a few other SGD-specific markups to this method.
|
|
265 - New demo script "parse_multi.pl" provided as an alternative to the
|
|
266 memory-hungry parse_stream.pl when crunching many reports.
|
|
267 - Minor changes to parsing demo scripts and blast_config.pl.
|
|
268 - Added more tests to the t/blast.t test script.
|
|
269 - Small change in error handling in Webblast::_blast(): detects
|
|
270 when the Blast report file is empty and unlinks it.
|
|
271 - When the -email option is used, the list of files returned by
|
|
272 blast_remote() contains the string 'email' as the first element.
|
|
273 This provides a signal to Bio::Tools::Blast.pm.
|
|
274
|
|
275 0.056 Wed Jul 15 03:42:20 1998
|
|
276 - Fixed bug in HSP::seq_str() in which white space was not removed
|
|
277 from sequences generated by merging multiple lines, leading
|
|
278 to excessive warnings when building Bio::PreSeq objects.
|
|
279 This also involved related fixes in Bio::PreSeq.pm and
|
|
280 Bio::Root::Utilities.pm (q.v.). (Reported by Tim Dudgeon).
|
|
281 - Fixed bug in PreSeq.pm that caused bounds checking to fail
|
|
282 incorrectly when sequence numbering does not begin at 1.
|
|
283 - Uncompressing Blast reports no longer requires write access
|
|
284 in the directory containing the compressed files.
|
|
285 - Added another example script eg/blast/parse2.pl.
|
|
286 - Bio::Root::IOManager::read() now requires the supplied parsing
|
|
287 function reference to return true for each record parsed
|
|
288 (otherwise parsing stops).
|
|
289
|
|
290 0.055 Tue Jun 23 13:56:34 1998
|
|
291 - Fixed parsing problems for hit sequences without descriptions
|
|
292 or without database identifiers. This affected
|
|
293 Blast::_parse_hsp_data() and in Sbjct.pm, the methods
|
|
294 _set_id(), _set_desc(), and _set_hsps(). Most seriously,
|
|
295 _set_hsps() failed to locate the 'Length = ' line for
|
|
296 sequences without descriptions.
|
|
297 - Parses parameter data from NCBI Blast 2.0.5 (format change from 2.0.4)
|
|
298 This affected Blast::_set_parameters() and Blast::_parse_hsp_data().
|
|
299 (Reported by Elliot Lefkowitz.)
|
|
300 - Minor change in HSP::_set_seq() regexp for identifying sequence data.
|
|
301 - Table output methods remove tabs from ends of lines.
|
|
302 - Updated Bio::Tools::Blast::HTML.pm to work with the
|
|
303 new version of read() updated in 0.054.
|
|
304 - Improved file_date() method of Bio::Root::Utilities.pm
|
|
305
|
|
306 0.054 Tue Jun 16 20:24:36 1998
|
|
307 - Minor change in the way Bio::Root::IOManager::read() works
|
|
308 (now checks return value of called func ref). Only affects
|
|
309 module internals.
|
|
310 - Minor changes in eg/ scripts and some documentation additions.
|
|
311 - Fixed a bug in Makefile.PL regarding detection of UnivAln.pm.
|
|
312
|
|
313 0.053 Fri Jun 12 20:55:27 1998
|
|
314 - Added the "collapse" functionality to the seq_inds() method of
|
|
315 HSP.pm and added a seq_inds() method to Sbjct.pm.
|
|
316 - Added a new demo script eg/blast/parse_positions.pl.
|
|
317 - Remove the "-residues" and "-tile_hsps" parsing options since
|
|
318 they are not necessary.
|
|
319
|
|
320 0.052 Thu Jun 11 20:18:45 1998
|
|
321 - Minor bug fixes in I/O methods.
|
|
322 - Improved documentation for Bio::Root modules.
|
|
323 - Miscellaneous documentation additions.
|
|
324
|
|
325 0.051 Wed Jun 10 12:26:06 1998
|
|
326 - Bio::Tools::Blast::HSP.pm does some additional deferring of
|
|
327 sequence data processing for a slight performance enhancement.
|
|
328 - Fixed formatting bug in Bio::Tools::Blast::HTML.pm that caused
|
|
329 uneven formatting of some description lines.
|
|
330 - Minor bug fixes in parsing code of Bio::Tools::Blast.pm.
|
|
331 - Some documentation additions.
|
|
332 - Makefile prints warning if Perl version is < 5.003.
|
|
333
|
|
334 0.05 Fri Jun 5 15:58:22 1998
|
|
335 - Improved gapped Blast support.
|
|
336 - Enhanced IO (reads from existing file or STDIN).
|
|
337 - Enhanced parsing: can parse an input stream containing
|
|
338 many reports.
|
|
339 - Created modularized version of Alex Dong Li's webblast.
|
|
340 - Added support for running Blasts remotely at NCBI using
|
|
341 Bio::Tools::Blast::Run::Webblast.pm.
|
|
342 - Added support for running local Blasts using the skeletal
|
|
343 Bio::Tools::Blast::Run::LocalBlast.pm.
|
|
344 _ Bio::Tools::Blast::HSP.pm can create Bio::PreSeq.pm sequence
|
|
345 objects and Bio::UnivAln.pm sequence alignment objects on demand.
|
|
346 - Encapsulated HTML-formatting code into the new module
|
|
347 Bio::Tools::Blast::HTML.pm. Updated markup regexps.
|
|
348 - Blast.pm module can parse HTML-formatted reports
|
|
349 (but beware of unforseen changes in the HTML!).
|
|
350 - Enhanced tab-delimited table output methods.
|
|
351 - Improved documentation: compatible with 5.004 pod2html.
|
|
352 - Created a variety of demo scripts (see eg/blast/)
|
|
353 - Miscellaneous bug fixes and API adjustments..
|
|
354 - Bundled for distribution with MakeMaker and created test sript.
|
|
355
|
|
356 0.03 Fri Mar 27 23:35:32 1998
|
|
357 - Support for gapped Blast added.
|
|
358 - Improved parsing strategy.
|
|
359 - Inherits from Bio::Tools::SeqAnal.pm instead of Bio::Root::Object.pm
|
|
360 - Added more documentation.
|
|
361 - Miscellaneous bug fixes.
|
|
362
|
|
363 0.01 March 1996
|
|
364 - original version.
|
|
365 - Parses Blast 1.x report files.
|
|
366
|
|
367
|
|
368
|