comparison variant_effect_predictor/Bio/Tools/Blast/CHANGES @ 0:1f6dce3d34e0

Uploaded
author mahtabm
date Thu, 11 Apr 2013 02:01:53 -0400
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:1f6dce3d34e0
1 Revision history for Perl extension Bio::Tools::Blast.pm and related modules.
2
3 0.091 Thu May 18 15:15:35 2000
4 (These fixes were made on both branch-06 and the main trunk).
5 - Fixed Webblast.pm (jitterbug PR #220). It now uses SeqIO and IO::Scalar
6 for sending formatted query sequence data to the BLAST servers.
7 - Fixed the examples/blast/ scripts related to remote BLASTing.
8 - Added single quotes to an '_expect' data member in Sbjct
9 (reported by Janet Young in jitterbug PR #218).
10 - Fixed regexp bug in Blast::HTML.pm that caused accessions with
11 . and _ characters to not be recognized.
12
13
14 0.09 Wed Mar 22 00:44:01 2000
15 - Handles Blast reports that don't contain description sections.
16 - Handles parsing of NCBI 2.0.9 Blast2 reports.
17 - Identifiers of hits are assumed to be the first
18 chunk of non-whitespace characters in the description line
19 Does not assume any semantics in the structure of the identifier.
20 (Blast::Sbjct::_set_id() formerly attempted to extract database
21 name from the seq identifiers, but this was prone to breaking).
22 - Fixed bug in the parsing of reports that lack an hits
23 (incorporated code fix submitted by Michael B. Thornton
24 jitterbug PR#114, PR#173).
25 - New helper script retrieve_blast.pl added to examples/blast/
26 for retrieving Blast reports from NCBI after running
27 run_blast_remote.pl See docs in retrieve_blast.pl for details.
28 - Updated Webblast.pm with new URLs for the NCBI servers.
29 - Fixed bug (jitterbug PR#84) in the parsing of alignments that would
30 cause occaisionally cause some significant hits to be missed
31 (incorporates code suggested by Peter van Heusden).
32 - Added support to Blast.pm for not parsing alignment data
33 using the new NO_ALIGNS parsing option. Also added -noaligns
34 command-line option to blast_config.pl (addresses jitterbug PR#93).
35 - Blast::hit() method doesn't blow up if there are no hits
36 (fixes jitterbug PR#129).
37 - Addressed jitterbug PR#96 by correcting docs. If the significance
38 is defined but of bad format, no default is set.
39 - HSP.pm properly handles bit scores expressed using floating point
40 and scientific notation (fixes jitterbug report PR#110).
41 - Fixed logic error in HSP.pm that would incorrectly throw an
42 exception when the number of identical residues was zero
43 (fixes jitterbug PR#164).
44 - HSP::strand() returns -1, 0, 1 instead of 'Minus', 'Plus', ''
45 to be in-line with with behavior of SeqFeature.
46
47 Fixes within Bio::Root
48 ----------------------
49 - Fixed bug reported in PR#85. Bio::Root::Utilities::get_newline()
50 returns "\n" by default.
51 - IOManager::read doesn't call alarm() if it isn't available
52 (fixes bug #PR98).
53 - Root::Object::_set_io() does not pass @_ to the IOManager constructor
54 call (incorporates suggestion from Scot Harker, jitterbut PR#192).
55
56
57 0.085 Tue Jun 29 05:15:40 1999 (bioperl 0.05.1)
58 (Note: the $VERSION variable was not changed to 0.085 in this release!)
59 - Fixed bugs reported in PR#s: 30, 31, 41, 44, 45, 47, 48
60 - Blast.pm no longer throws exceptions for hitless reports,
61 even if a -signif criterion is specified.
62 - Better support for PSI Blasts (running and parsing) based on
63 comments/suggestions from Ross Crowhurst and Lu Sun.
64 - Bug fixes in Webblast.pm regarding blastServerURL and filenames
65 with special characters (bug reports from James Diggans and
66 Bradford Powell, respectively).
67 - Removed =head3 tags that were causing pod2man warnings.
68 - Fixed -best parsing switch.
69 - Added better POD to Webblast.pm
70
71 0.080 Sat Apr 24 19:39:37 1999 (bioperl 0.05)
72 - Complete re-implementation of the Blast report parsing strategy.
73 The parser now always streamifies all Blast report(s) and
74 segments them hit-wise instead of report-wise. This improves
75 performance and memory usage (but doesn't solve the memory leak).
76 - Documented a workaround for the memory leak (see the docs for
77 Blast.pm under the section "Memory Usage Issues").
78 - Webblast.pm now uses LWP instead of postclient.pl to post network
79 Blast requests (thanks to Ross Crowhurst for code submission).
80 This is a much more robust mechanism, although it now requires you
81 to have LWP installed ("a good thing"). For example, it makes it
82 easy to send requests and receive results through a firewall.
83 It can run Blast1, Blast2, and PSI-Blast at NCBI and also supports
84 WashU-Blast servers.
85 - Fixed some miscellaneous parsing bugs and errors reported by
86 Greg Colello, Xie Tao, and Richard Copley from the pre-release
87 - Updated example/blast scripts. Renamed parse.pl and parse2.pl to
88 parse_blast.pl and parse_blast2.pl. parse_stream.pl is gone
89 (since parse_blast.pl now handles both files and streams).
90 Renamed run.pl to run_blast_remote.pl and added additional
91 options to keep it in synch with the new Webblast.pm.
92
93
94 0.075 Wed Feb 17 01:29:57 1999 (bioperl 0.04.4)
95 - Bug fix in Blast::Run::postclient.pl which was always printing
96 usage information (reported by Ross Crowhurst).
97 - Improved error detection & reporting in Blast::Run::Webblast
98 and Blast::Run::postclient.pl.
99 - Bug fixes in HSP.pm: _set_match_stats() now properly handles
100 BLASTN output format and matches() properly handles BLASTX
101 reports.
102 - Fixed detection of substitution matrix type in Blast.pm.
103 - Updated Blast example scripts in examples/blast/
104 (in particular, run.pl).
105
106 0.074 Wed Feb 3 04:10:06 1999 (bioperl 0.04.3)
107 - Modified blast_config.pl to make the -prog and -db command-line
108 arguments mandatory. There are no default values for $opt_blast
109 and $opt_db. Also removed the -dna command-line argument.
110 - Fixed alignment section parsing bug in Blast.pm (bug report #24);
111 affects _parse_hsp_string().
112 - Modified Blast::_display_hits() to use table_tiled(), thus
113 avoiding the buggy Sbjct::_display_hits().
114 - Bug fix in Blast::HTML.pm regarding detection of HTML formatting.
115 - Changes in HSP::_set_data() to support BLAST-GP version 2.0.8.
116 - Changed the #! line of Bio/Tools/Blast/Run/postclient.pl
117 to the more standard /usr/bin/perl and predeclared some vars
118 to avoid compiler warnings.
119 - blast_config.pl and seqtools.pl were modified to include support
120 for the -wait option (sets timeout during file/stream reading).
121 - Assorted touch-ups to example/blast scripts.
122
123 0.073 Mon Dec 28 15:18:29 1998 (bioperl 0.04.2)
124 - Made changes in Sbjct.pm and HSP.pm to allow parsing of older
125 versions of Blast. This affected HSP::_set_data() ("Query:?")
126 and Sbjct::_set_hsps() ("\s*Score").
127 - Improved error handling in Sbjct::_set_hsps() and error
128 detection in HSP_set_data().
129 - Added methods for obtaining the number of identical and conserved
130 residues (sans gaps) sequences in HSP.pm (num_identical,
131 num_conserved).
132 - Modified the default behavior of frac_identical() and
133 frac_conserved() to use the procedure used in the BLAST report:
134 BLAST-GP: uses total length of alignment in denominator
135 WU-BLAST: uses length of query sequence (sans gaps) in denominator.
136 When these methods are called with an argument ('sbjct' or 'query'),
137 the length of the sbjct or query sequence (sans gaps) is used in
138 the denominator (recommended).
139 - Fixed error in test #29 in the t/Blast.t test script.
140
141 0.072 Wed Dec 16 05:05:21 1998 (bioperl 0.04.1)
142 - Fixed out of range exception in HSP::matches() as suggested by
143 Michael Lonetto in bio.perl.org bug report #11.
144 - Made changes in Sbjct.pm and HSP.pm to deal with the reporting of
145 TBLAST[NX] data in amino acid coordinate space. This affects
146 frac_identical() and frac_conserved() in HSP.pm, and _tile_hsps(),
147 length_aln(), frac_aligned_query(), frac_aligned_hit(),
148 frac_unaligned_query(), frac_unaligned_hit() in Sbjct.pm.
149 These also in response to bug report #11.
150 - Fixed behavior of frac_identical() and frac_aligned() in
151 both Sbjct.pm and HSP.pm to correspond to the data reported
152 by Blast. Default behavior now includes gaps instead of ignoring
153 them as before. This was in response to a report by Eli Venter.
154 - Cleaned up a few "uninitialized value" warnings.
155
156 0.071 Thu Dec 10 18:41:51 1998 (bioperl 0.04)
157 - Bio::Tools::Blast::Sbjct::_set_id() to no longer uppercases
158 sequence ids for sbjct sequences. (Note however that the Blast
159 dataset may have been built with a tool that uppercases all
160 sequence ids. Check your raw Blast report.).
161 - Bio::Tools::Blast::HTML.pm incorporates the new URL for
162 information about the Bioperl Blast module.
163
164 0.07 Thu Dec 3 13:43:06 1998
165 - Reduced memory leak that becomes apparent when parsing many Blast
166 reports. This resulted in changes to Bio::Tools::Blast::_get_parse_func()
167 and the addition of a new function to Bio::Root::Object.pm that
168 explicitly breaks cyclic object reference structures.
169 (This problem was highlighted by recently Lincoln Stein, who is has
170 created a BoulderIO-based Blast parser. See BoulderIO-1.10 or later).
171 - Fixed support in blast_config.pl for supplying a custom filtering
172 function as a string using the -filt_func command-line option.
173 Added some examples of this to the parse.pl example script.
174 - Added start() and end() methods to Bio::Tools::Blast::HSP
175 and Bio::Tools::Blast::Sbjct.pm. Sbjct also received a range()
176 method (based on discussion with Ewan Birney).
177 - Fixed bugs in Blast.pm related to accessing the name of the
178 database, expecially in conjunction with stream parsing.
179 - A few fixes in Blast.pm and Sbjct regarding handling of
180 hit sequence and database names (reported by John Calley and
181 and Karl-Heinz Ott).
182 - Fixed regexp in Blast::_parse_hsp_data (reported by Brian Karlak).
183 - Added columns for start/end of the query/sbjct sequences to the
184 table_tiled() output. Added columns for HSP query/sbjct strand
185 and frame information to the table() output.
186 - Changes in Bio::Root::Object.pm regarding strict() and verbose()
187 which are no longer object-specific.
188 - Increased documentation in Bio::Root::Object.pm.
189 - Removed autoloading and several deprecated methods from the
190 Bio::Root::Object and Utilities.
191
192
193 0.062 Fri Sep 4 00:45:05 1998
194 - Fixed bugs in Blast::_set_hits() which caused no hits to be
195 saved when _parse_hsp_data() threw an exception.
196 - Assorted documentation changes. This includes a fix in
197 the docs of Bio::Root::Object.pm that caused a compilation error
198 when you try to use it without installing it via Makefile.PL.
199 Most modules now have docs for data members in their PODs.
200 - Added method strip_html() to Bio::Tools::WWW.pm (a more general
201 version than that found in Bio::Tools::Blast::HTML.pm).
202 - Includes Bio::Seq.pm version 0.050 (formerly Bio::PreSeq.pm).
203 This version has Ewan Birney's modifications (start() and end()
204 methods added; numbering() now deprecated).
205 - By popular demand, Bio::Root::Object::src_obj() has been changed
206 to the more intuitive parent(). All modules have been updated to
207 incorporate this change.
208
209 0.061 Wed Aug 26 12:30:34 1998
210 - Fixed memory leak problem associated with Blast::_set_hits() that
211 lead to excessive memory usage when parsing large numbers reports,
212 expecially when using the -check_all=>1 parsing option.
213 (This may have solved the parse_stream.pl memory leak.)
214 - Fixed regexp in Sbjct::_set_hsps() to recognize sequence lengths
215 containing commas (reported by Lincoln Stein).
216 - Blast::signif() now always returns number in scientific notation.
217 - Fixed bug in Bio::Root::IOManager::display() (wasn't returning true).
218 - Changed Makefile.PL to always save backups for in-place edits
219 and prevents the 'inplace edit without backup' warning (reported by
220 Bobby Otillar).
221 - Assorted documentation changes/additions.
222
223 0.06 Sun Aug 16 18:31:27 1998
224 - Fixed bug in HSP::_set_match_stats() that failed to recognize Frames
225 on the + strand.
226 - Added HSP::strand() method for retrieval of strand information for
227 query and sbjct HSP sequences from nucleotide Blasts.
228 - Changed calls to Perl's length() function to CORE::length()
229 to avoid confusing the Perl 5.005_001 interpreter with
230 Bio::Tools::HSP::length() and Bio::Tools::Sbjct::length().
231 (Reported by Mike Cariaso).
232 - Added column to the output of the Blast::table_tiled() method
233 to include fraction-aligned of the sbjct sequence.
234 - Uses new version of Bio::Root::Object.pm that has new exception
235 handling policy: exceptions and warnings are no longer by default
236 attached to the objects that threw them. See notes about this in
237 Bio::Root::Object.pm. (Main reason for 0.003-point jump in version).
238 - Added some safety net methods in Sbjct.pm: frac_aligned_sbjct()
239 and num_unaligned_sbjct(), which map to frac_aligned_hit() and
240 num_unaligned_hit(), respectively (API consistency issue).
241 - Minor changes/fixes in some of the demo scripts (seqtools.pl et al.).
242 - Removed "Modifications" notes from Blast-related modules. Added
243 pointer to distribution site instead, which consolidates this info.
244 - Modules can be used by copying directly from the lib/ directory into
245 your perl/lib directory without loss of functionality but at the expense of
246 increased startup time. It is still recommended that you use the standard
247 Makefile.PL procedure, however, since this will properly autosplit
248 the modules as well as run other checks.
249
250 0.057 Thu Jul 23 02:28:26 1998
251 - HSP.pm handles "match" sequence lines that are not properly padded.
252 Affects HSP::_set_data() and other HSP.pm methods that use residue
253 strings or indices (seq_str(), seq_inds()). (Reported by Libby Shoop).
254 - Added signif_fmt() method to Blast.pm to allow retrieval of
255 P/Expect value data in different formats (e.g., exponent only).
256 - Changes in the table() methods of Blast.pm to accomodate signif_fmt().
257 - Minor changes in destructors to improve memory management.
258 - Added warnings in Blast.pm and parse_stream.pl regarding
259 memory usage issue when parsing Blast streams.
260 - Updated HTML::get_html_func() to work with the new version of
261 Bio::Root::IOManager::read().
262 - Added new rexexps to HTML::_markup_report() to deal with hits
263 which lack a database identifier (e.g., gi|210095).
264 Added a few other SGD-specific markups to this method.
265 - New demo script "parse_multi.pl" provided as an alternative to the
266 memory-hungry parse_stream.pl when crunching many reports.
267 - Minor changes to parsing demo scripts and blast_config.pl.
268 - Added more tests to the t/blast.t test script.
269 - Small change in error handling in Webblast::_blast(): detects
270 when the Blast report file is empty and unlinks it.
271 - When the -email option is used, the list of files returned by
272 blast_remote() contains the string 'email' as the first element.
273 This provides a signal to Bio::Tools::Blast.pm.
274
275 0.056 Wed Jul 15 03:42:20 1998
276 - Fixed bug in HSP::seq_str() in which white space was not removed
277 from sequences generated by merging multiple lines, leading
278 to excessive warnings when building Bio::PreSeq objects.
279 This also involved related fixes in Bio::PreSeq.pm and
280 Bio::Root::Utilities.pm (q.v.). (Reported by Tim Dudgeon).
281 - Fixed bug in PreSeq.pm that caused bounds checking to fail
282 incorrectly when sequence numbering does not begin at 1.
283 - Uncompressing Blast reports no longer requires write access
284 in the directory containing the compressed files.
285 - Added another example script eg/blast/parse2.pl.
286 - Bio::Root::IOManager::read() now requires the supplied parsing
287 function reference to return true for each record parsed
288 (otherwise parsing stops).
289
290 0.055 Tue Jun 23 13:56:34 1998
291 - Fixed parsing problems for hit sequences without descriptions
292 or without database identifiers. This affected
293 Blast::_parse_hsp_data() and in Sbjct.pm, the methods
294 _set_id(), _set_desc(), and _set_hsps(). Most seriously,
295 _set_hsps() failed to locate the 'Length = ' line for
296 sequences without descriptions.
297 - Parses parameter data from NCBI Blast 2.0.5 (format change from 2.0.4)
298 This affected Blast::_set_parameters() and Blast::_parse_hsp_data().
299 (Reported by Elliot Lefkowitz.)
300 - Minor change in HSP::_set_seq() regexp for identifying sequence data.
301 - Table output methods remove tabs from ends of lines.
302 - Updated Bio::Tools::Blast::HTML.pm to work with the
303 new version of read() updated in 0.054.
304 - Improved file_date() method of Bio::Root::Utilities.pm
305
306 0.054 Tue Jun 16 20:24:36 1998
307 - Minor change in the way Bio::Root::IOManager::read() works
308 (now checks return value of called func ref). Only affects
309 module internals.
310 - Minor changes in eg/ scripts and some documentation additions.
311 - Fixed a bug in Makefile.PL regarding detection of UnivAln.pm.
312
313 0.053 Fri Jun 12 20:55:27 1998
314 - Added the "collapse" functionality to the seq_inds() method of
315 HSP.pm and added a seq_inds() method to Sbjct.pm.
316 - Added a new demo script eg/blast/parse_positions.pl.
317 - Remove the "-residues" and "-tile_hsps" parsing options since
318 they are not necessary.
319
320 0.052 Thu Jun 11 20:18:45 1998
321 - Minor bug fixes in I/O methods.
322 - Improved documentation for Bio::Root modules.
323 - Miscellaneous documentation additions.
324
325 0.051 Wed Jun 10 12:26:06 1998
326 - Bio::Tools::Blast::HSP.pm does some additional deferring of
327 sequence data processing for a slight performance enhancement.
328 - Fixed formatting bug in Bio::Tools::Blast::HTML.pm that caused
329 uneven formatting of some description lines.
330 - Minor bug fixes in parsing code of Bio::Tools::Blast.pm.
331 - Some documentation additions.
332 - Makefile prints warning if Perl version is < 5.003.
333
334 0.05 Fri Jun 5 15:58:22 1998
335 - Improved gapped Blast support.
336 - Enhanced IO (reads from existing file or STDIN).
337 - Enhanced parsing: can parse an input stream containing
338 many reports.
339 - Created modularized version of Alex Dong Li's webblast.
340 - Added support for running Blasts remotely at NCBI using
341 Bio::Tools::Blast::Run::Webblast.pm.
342 - Added support for running local Blasts using the skeletal
343 Bio::Tools::Blast::Run::LocalBlast.pm.
344 _ Bio::Tools::Blast::HSP.pm can create Bio::PreSeq.pm sequence
345 objects and Bio::UnivAln.pm sequence alignment objects on demand.
346 - Encapsulated HTML-formatting code into the new module
347 Bio::Tools::Blast::HTML.pm. Updated markup regexps.
348 - Blast.pm module can parse HTML-formatted reports
349 (but beware of unforseen changes in the HTML!).
350 - Enhanced tab-delimited table output methods.
351 - Improved documentation: compatible with 5.004 pod2html.
352 - Created a variety of demo scripts (see eg/blast/)
353 - Miscellaneous bug fixes and API adjustments..
354 - Bundled for distribution with MakeMaker and created test sript.
355
356 0.03 Fri Mar 27 23:35:32 1998
357 - Support for gapped Blast added.
358 - Improved parsing strategy.
359 - Inherits from Bio::Tools::SeqAnal.pm instead of Bio::Root::Object.pm
360 - Added more documentation.
361 - Miscellaneous bug fixes.
362
363 0.01 March 1996
364 - original version.
365 - Parses Blast 1.x report files.
366
367
368