annotate variant_effect_predictor/Bio/Index/Abstract.pm @ 0:2bc9b66ada89 draft default tip

Uploaded
author mahtabm
date Thu, 11 Apr 2013 06:29:17 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
2 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
3 # $Id: Abstract.pm,v 1.41 2002/12/17 02:08:36 jason Exp $
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
4 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
5 # BioPerl module for Bio::Index::Abstract
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
6 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
7 # Cared for by Ewan Birney <birney@sanger.ac.uk>
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
8 # and James Gilbert <jgrg@sanger.ac.uk>
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
9 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
10 # You may distribute this module under the same terms as perl itself
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
11
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
12 # POD documentation - main docs before the code
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
13
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
14 =head1 NAME
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
15
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
16 Bio::Index::Abstract - Abstract interface for indexing a flat file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
17
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
18 =head1 SYNOPSIS
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
19
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
20 You should not be using this module directly
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
21
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
22 =head1 USING DB_FILE
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
23
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
24 To use DB_File and not SDBM for this index, pass the value:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
25
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
26 -dbm_package => 'DB_File'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
27
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
28 to new (see below).
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
29
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
30 =head1 DESCRIPTION
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
31
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
32 This object provides the basic mechanism to associate positions
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
33 in files with names. The position and filenames are stored in DBM
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
34 which can then be accessed later on. It is the equivalent of flat
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
35 file indexing (eg, SRS or efetch).
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
36
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
37 This object is the guts to the mechanism, which will be used by the
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
38 specific objects inheriting from it.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
39
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
40 =head1 FEEDBACK
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
41
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
42 =head2 Mailing Lists
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
43
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
44 User feedback is an integral part of the evolution of this and other
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
45 Bioperl modules. Send your comments and suggestions preferably to one
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
46 of the Bioperl mailing lists. Your participation is much appreciated.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
47
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
48 bioperl-l@bioperl.org - General discussion
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
49 http://bioperl.org/MailList.shtml - About the mailing lists
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
50
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
51 =head2 Reporting Bugs
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
52
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
53 Report bugs to the Bioperl bug tracking system to help us keep track
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
54 the bugs and their resolution. Bug reports can be submitted via
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
55 email or the web:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
56
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
57 bioperl-bugs@bio.perl.org
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
58 http://bugzilla.bioperl.org/
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
59
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
60 =head1 AUTHOR - Ewan Birney, James Gilbert
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
61
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
62 Email - birney@sanger.ac.uk, jgrg@sanger.ac.uk
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
63
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
64 =head1 APPENDIX
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
65
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
66 The rest of the documentation details each of the object methods. Internal
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
67 methods are usually preceded with an "_" (underscore).
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
68
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
69 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
70
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
71
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
72 # Let the code begin...
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
73
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
74 package Bio::Index::Abstract;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
75
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
76 use strict;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
77 use Fcntl qw( O_RDWR O_CREAT O_RDONLY );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
78 use vars qw( $TYPE_AND_VERSION_KEY
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
79 @ISA $USE_DBM_TYPE $DB_HASH );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
80
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
81 # Object preamble - inheriets from Bio::Root::Object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
82
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
83 use Bio::Root::Root;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
84 use Bio::Root::IO;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
85 use Symbol();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
86
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
87 @ISA = qw(Bio::Root::Root);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
88
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
89 # Generate accessor methods for simple object fields
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
90 BEGIN {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
91 foreach my $func (qw(filename write_flag)) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
92 no strict 'refs';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
93 my $field = "_$func";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
94
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
95 *$func = sub {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
96 my( $self, $value ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
97
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
98 if (defined $value) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
99 $self->{$field} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
100 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
101 return $self->{$field};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
102 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
103 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
104 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
105
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
106 =head2 new
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
107
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
108 Usage : $index = Bio::Index::Abstract->new(
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
109 -filename => $dbm_file,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
110 -write_flag => 0,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
111 -dbm_package => 'DB_File',
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
112 -verbose => 0);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
113 Function: Returns a new index object. If filename is
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
114 specified, then open_dbm() is immediately called.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
115 Bio::Index::Abstract->new() will usually be called
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
116 directly only when opening an existing index.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
117 Returns : A new index object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
118 Args : -filename The name of the dbm index file.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
119 -write_flag TRUE if write access to the dbm file is
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
120 needed.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
121 -dbm_package The Perl dbm module to use for the
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
122 index.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
123 -verbose Print debugging output to STDERR if
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
124 TRUE.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
125
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
126 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
127
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
128 sub new {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
129 my($class, @args) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
130 my $self = $class->SUPER::new(@args);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
131 my( $filename, $write_flag, $dbm_package, $cachesize, $ffactor ) =
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
132 $self->_rearrange([qw(FILENAME
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
133 WRITE_FLAG
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
134 DBM_PACKAGE
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
135 CACHESIZE
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
136 FFACTOR
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
137 )], @args);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
138
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
139 # Store any parameters passed
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
140 $self->filename($filename) if $filename;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
141 $self->cachesize($cachesize) if $cachesize;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
142 $self->ffactor($ffactor) if $ffactor;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
143 $self->write_flag($write_flag) if $write_flag;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
144 $self->dbm_package($dbm_package) if $dbm_package;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
145
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
146 $self->{'_filehandle'} = []; # Array in which to cache SeqIO objects
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
147 $self->{'_DB'} = {}; # Gets tied to the DBM file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
148
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
149 # Open database
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
150 $self->open_dbm() if $filename;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
151 return $self;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
152 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
153
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
154 =pod
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
155
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
156 =head2 filename
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
157
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
158 Title : filename
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
159 Usage : $value = $self->filename();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
160 $self->filename($value);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
161 Function: Gets or sets the name of the dbm index file.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
162 Returns : The current value of filename
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
163 Args : Value of filename if setting, or none if
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
164 getting the value.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
165
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
166 =head2 write_flag
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
167
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
168 Title : write_flag
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
169 Usage : $value = $self->write_flag();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
170 $self->write_flag($value);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
171 Function: Gets or sets the value of write_flag, which
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
172 is wether the dbm file should be opened with
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
173 write access.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
174 Returns : The current value of write_flag (default 0)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
175 Args : Value of write_flag if setting, or none if
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
176 getting the value.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
177
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
178 =head2 dbm_package
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
179
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
180 Usage : $value = $self->dbm_package();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
181 $self->dbm_package($value);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
182
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
183 Function: Gets or sets the name of the Perl dbm module used.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
184 If the value is unset, then it returns the value of
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
185 the package variable $USE_DBM_TYPE or if that is
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
186 unset, then it chooses the best available dbm type,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
187 choosing 'DB_File' in preference to 'SDBM_File'.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
188 Bio::Abstract::Index may work with other dbm file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
189 types.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
190
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
191 Returns : The current value of dbm_package
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
192 Args : Value of dbm_package if setting, or none if
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
193 getting the value.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
194
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
195 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
196
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
197 sub dbm_package {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
198 my( $self, $value ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
199 my $to_require = 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
200 if( $value || ! $self->{'_dbm_package'} ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
201 my $type = $value || $USE_DBM_TYPE || 'DB_File';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
202 if( $type =~ /DB_File/i ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
203 eval {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
204 require DB_File;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
205 };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
206 $type = ( $@ ) ? 'SDBM_File' : 'DB_File';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
207 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
208 if( $type ne 'DB_File' ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
209 eval { require "$type.pm"; };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
210 $self->throw($@) if( $@ );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
211 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
212 $self->{'_dbm_package'} = $type;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
213 if( ! defined $USE_DBM_TYPE ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
214 $USE_DBM_TYPE = $self->{'_dbm_package'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
215 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
216 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
217 return $self->{'_dbm_package'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
218 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
219
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
220 =head2 db
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
221
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
222 Title : db
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
223 Usage : $index->db
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
224 Function: Returns a ref to the hash which is tied to the dbm
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
225 file. Used internally when adding and retrieving
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
226 data from the database.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
227 Example : $db = $index->db();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
228 $db->{ $some_key } = $data
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
229 $data = $index->db->{ $some_key };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
230 Returns : ref to HASH
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
231 Args : NONE
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
232
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
233 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
234
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
235 sub db {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
236 return $_[0]->{'_DB'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
237 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
238
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
239
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
240 =head2 get_stream
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
241
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
242 Title : get_stream
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
243 Usage : $stream = $index->get_stream( $id );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
244 Function: Returns a file handle with the file pointer
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
245 at the approprite place
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
246
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
247 This provides for a way to get the actual
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
248 file contents and not an object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
249
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
250 WARNING: you must parse the record deliminter
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
251 *yourself*. Abstract wont do this for you
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
252 So this code
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
253
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
254 $fh = $index->get_stream($myid);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
255 while( <$fh> ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
256 # do something
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
257 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
258 will parse the entire file if you don't put in
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
259 a last statement in, like
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
260
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
261 while( <$fh> ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
262 /^\/\// && last; # end of record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
263 # do something
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
264 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
265
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
266 Returns : A filehandle object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
267 Args : string represents the accession number
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
268 Notes : This method should not be used without forethought
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
269
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
270 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
271
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
272 #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
273
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
274 sub get_stream {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
275 my ($self,$id) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
276
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
277 my ($desc,$acc,$out);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
278 my $db = $self->db();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
279
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
280 if (my $rec = $db->{ $id }) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
281 my( @record );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
282
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
283 my ($file, $begin, $end) = $self->unpack_record( $rec );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
284
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
285 # Get the (possibly cached) filehandle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
286 my $fh = $self->_file_handle( $file );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
287
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
288 # move to start
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
289 seek($fh, $begin, 0);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
290
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
291 return $fh;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
292 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
293 else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
294 $self->throw("Unable to find a record for $id in the flat file index");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
295 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
296 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
297
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
298
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
299 =head2 cachesize
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
300
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
301 Usage : $index->cachesize(1000000)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
302 Function: Sets the dbm file cache size for the index.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
303 Needs to be set before the DBM file gets opened.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
304 Example : $index->cachesize(1000000)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
305 Returns : size of the curent cache
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
306
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
307 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
308
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
309 sub cachesize {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
310 my( $self, $size ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
311
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
312 if(defined $size){
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
313 $self->{'_cachesize'} = $size;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
314 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
315 return ( $self->{'_cachesize'} );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
316
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
317 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
318
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
319
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
320 =head2 ffactor
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
321
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
322 Usage : $index->ffactor(1000000)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
323 Function: Sets the dbm file fill factor.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
324 Needs to be set before the DBM file gets opened.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
325
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
326 Example : $index->ffactor(1000000)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
327 Returns : size of the curent cache
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
328
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
329 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
330
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
331 sub ffactor {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
332 my( $self, $size ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
333
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
334 if(defined $size){
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
335 $self->{'_ffactor'} = $size;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
336 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
337 return ( $self->{'_ffactor'} );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
338
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
339 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
340
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
341
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
342 =head2 open_dbm
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
343
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
344 Usage : $index->open_dbm()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
345 Function: Opens the dbm file associated with the index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
346 object. Write access is only given if explicitly
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
347 asked for by calling new(-write => 1) or having set
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
348 the write_flag(1) on the index object. The type of
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
349 dbm file opened is that returned by dbm_package().
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
350 The name of the file to be is opened is obtained by
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
351 calling the filename() method.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
352
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
353 Example : $index->_open_dbm()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
354 Returns : 1 on success
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
355
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
356 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
357
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
358 sub open_dbm {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
359 my( $self ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
360
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
361 my $filename = $self->filename()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
362 or $self->throw("filename() not set");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
363
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
364 my $db = $self->db();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
365
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
366 # Close the dbm file if already open (maybe we're getting
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
367 # or dropping write access
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
368 if (ref($db) ne 'HASH') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
369 untie($db);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
370 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
371
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
372 # What kind of DBM file are we going to open?
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
373 my $dbm_type = $self->dbm_package;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
374
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
375 # Choose mode for opening dbm file (read/write+create or read-only).
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
376 my $mode_flags = $self->write_flag ? O_RDWR|O_CREAT : O_RDONLY;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
377
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
378 # Open the dbm file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
379 if ($dbm_type eq 'DB_File') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
380 my $hash_inf = DB_File::HASHINFO->new();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
381 my $cache = $self->cachesize();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
382 my $ffactor = $self->ffactor();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
383 if ($cache){
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
384 $hash_inf->{'cachesize'} = $cache;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
385 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
386 if ($ffactor){
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
387 $hash_inf->{'ffactor'} = $ffactor;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
388 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
389 tie( %$db, $dbm_type, $filename, $mode_flags, 0644, $hash_inf )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
390 or $self->throw("Can't open '$dbm_type' dbm file '$filename' : $!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
391 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
392 tie( %$db, $dbm_type, $filename, $mode_flags, 0644 )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
393 or $self->throw("Can't open '$dbm_type' dbm file '$filename' : $!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
394 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
395
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
396 # The following methods access data in the dbm file:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
397
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
398 # Now, if we're a Bio::Index::Abstract caterpillar, then we
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
399 # transform ourselves into a Bio::Index::<something> butterfly!
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
400 if( ref($self) eq "Bio::Index::Abstract" ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
401 my $pkg = $self->_code_base();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
402 bless $self, $pkg;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
403 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
404
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
405 # Check or set this is the right kind and version of index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
406 $self->_type_and_version();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
407
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
408 # Check files haven't changed size since they were indexed
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
409 $self->_check_file_sizes();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
410
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
411 return 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
412 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
413
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
414 =head2 _version
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
415
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
416 Title : _version
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
417 Usage : $type = $index->_version()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
418 Function: Returns a string which identifes the version of an
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
419 index module. Used to permanently identify an index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
420 file as having been created by a particular version
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
421 of the index module. Must be provided by the sub class
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
422 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
423 Returns :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
424 Args : none
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
425
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
426 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
427
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
428 sub _version {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
429 my $self = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
430
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
431 $self->throw("In Bio::Index::Abstract, no _version method in sub class");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
432 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
433
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
434 =head2 _code_base
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
435
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
436 Title : _code_base
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
437 Usage : $code = $db->_code_base();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
438 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
439 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
440 Returns : Code package to be used with this
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
441 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
442
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
443
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
444 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
445
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
446 sub _code_base {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
447 my ($self) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
448 my $code_key = '__TYPE_AND_VERSION';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
449 my $record;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
450
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
451 $record = $self->db->{$code_key};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
452
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
453 my($code,$version) = $self->unpack_record($record);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
454 if( wantarray ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
455 return ($code,$version);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
456 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
457 return $code;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
458 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
459 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
460
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
461
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
462 =head2 _type_and_version
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
463
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
464 Title : _type_and_version
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
465 Usage : Called by _initalize
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
466 Function: Checks that the index opened is made by the same index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
467 module and version of that module that made it. If the
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
468 index is empty, then it adds the information to the
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
469 database.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
470 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
471 Returns : 1 or exception
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
472 Args : none
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
473
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
474 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
475
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
476 sub _type_and_version {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
477 my $self = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
478 my $key = '__TYPE_AND_VERSION';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
479 my $version = $self->_version();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
480 my $type = ref $self;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
481
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
482 # Run check or add type and version key if missing
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
483 if (my $rec = $self->db->{ $key }) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
484 my( $db_type, $db_version ) = $self->unpack_record($rec);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
485 $self->throw("This index file is from version [$db_version] - You need to rebuild it to use module version [$version]")
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
486 unless $db_version == $version;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
487 $self->throw("This index file is type [$db_type] - Can't access it with module for [$type]")
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
488 unless $db_type eq $type;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
489 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
490 $self->add_record( $key, $type, $version )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
491 or $self->throw("Can't add Type and Version record");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
492 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
493 return 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
494 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
495
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
496
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
497 =head2 _check_file_sizes
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
498
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
499 Title : _check_file_sizes
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
500 Usage : $index->_check_file_sizes()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
501 Function: Verifies that the files listed in the database
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
502 are the same size as when the database was built,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
503 or throws an exception. Called by the new()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
504 function.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
505 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
506 Returns : 1 or exception
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
507 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
508
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
509 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
510
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
511 sub _check_file_sizes {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
512 my $self = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
513 my $num = $self->_file_count() || 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
514
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
515 for (my $i = 0; $i < $num; $i++) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
516 my( $file, $stored_size ) = $self->unpack_record( $self->db->{"__FILE_$i"} );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
517 my $size = -s $file;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
518 unless ($size == $stored_size) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
519 $self->throw("file $i [ $file ] has changed size $stored_size -> $size. This probably means you need to rebuild the index.");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
520 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
521 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
522 return 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
523 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
524
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
525
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
526 =head2 make_index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
527
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
528 Title : make_index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
529 Usage : $index->make_index( FILE_LIST )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
530 Function: Takes a list of file names, checks that they are
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
531 all fully qualified, and then calls _filename() on
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
532 each. It supplies _filename() with the name of the
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
533 file, and an integer which is stored with each record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
534 created by _filename(). Can be called multiple times,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
535 and can be used to add to an existing index file.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
536 Example : $index->make_index( '/home/seqs1', '/home/seqs2', '/nfs/pub/big_db' );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
537 Returns : Number of files indexed
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
538 Args : LIST OF FILES
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
539
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
540 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
541
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
542 sub make_index {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
543 my($self, @files) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
544 my $count = 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
545 my $recs = 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
546 # blow up if write flag is not set. EB fix
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
547
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
548 if( !defined $self->write_flag ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
549 $self->throw("Attempting to make an index on a read-only database. What about a WRITE flag on opening the index?");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
550 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
551
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
552 # We're really fussy/lazy, expecting all file names to be fully qualified
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
553 $self->throw("No files to index provided") unless @files;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
554 for(my $i=0;$i<scalar @files; $i++) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
555 if( $Bio::Root::IO::FILESPECLOADED && File::Spec->can('rel2abs') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
556 if( ! File::Spec->file_name_is_absolute($files[$i]) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
557 $files[$i] = File::Spec->rel2abs($files[$i]);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
558 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
559 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
560 if( $^O =~ /MSWin/i ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
561 ($files[$i] =~ m|^[A-Za-z]:/|) ||
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
562 $self->throw("Not an absolute file path '$files[$i]'");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
563 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
564 ($files[$i] =~ m|^/|) ||
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
565 $self->throw("Not an absolute file path '$files[$i]'");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
566 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
567 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
568 $self->throw("File does not exist '$files[$i]'") unless -e $files[$i];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
569 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
570
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
571 # Add each file to the index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
572 FILE :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
573 foreach my $file (@files) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
574
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
575 my $i; # index for this file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
576
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
577 # Get new index for this file and increment file count
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
578 if ( defined(my $count = $self->_file_count) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
579 $i = $count;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
580 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
581 $i = 0; $self->_file_count(0);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
582 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
583
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
584 # see whether this file has been already indexed
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
585 my ($record,$number,$size);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
586
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
587 if( ($record = $self->db->{"__FILENAME_$file"}) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
588 ($number,$size) = $self->unpack_record($record);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
589
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
590 # if it is the same size - fine. Otherwise die
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
591 if( -s $file == $size ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
592 warn "File $file already indexed. Skipping...\n";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
593 next FILE;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
594 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
595 $self->throw("In index, $file has changed size ($size). Indicates that the index is out of date");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
596 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
597 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
598
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
599 # index this file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
600 warn "Indexing file $file\n" if( $self->verbose > 0);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
601
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
602 # this is supplied by the subclass and does the serious work
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
603 $recs += $self->_index_file( $file, $i ); # Specific method for each type of index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
604
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
605
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
606 # Save file name and size for this index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
607 $self->add_record("__FILE_$i", $file, -s $file)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
608 or $self->throw("Can't add data to file: $file");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
609 $self->add_record("__FILENAME_$file", $i, -s $file)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
610 or $self->throw("Can't add data to file: $file");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
611
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
612 # increment file lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
613 $i++; $self->_file_count($i);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
614 my $temp;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
615 $temp = $self->_file_count();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
616
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
617
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
618 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
619 return ($count, $recs);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
620 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
621
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
622 =head2 _filename
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
623
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
624 Title : _filename
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
625 Usage : $index->_filename( FILE INT )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
626 Function: Indexes the file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
627 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
628 Returns :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
629 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
630
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
631 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
632
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
633 sub _index_file {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
634 my $self = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
635
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
636 my $pkg = ref($self);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
637 $self->throw("Error: '$pkg' does not provide the _index_file() method");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
638 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
639
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
640
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
641
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
642 =head2 _file_handle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
643
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
644 Title : _file_handle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
645 Usage : $fh = $index->_file_handle( INT )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
646 Function: Returns an open filehandle for the file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
647 index INT. On opening a new filehandle it
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
648 caches it in the @{$index->_filehandle} array.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
649 If the requested filehandle is already open,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
650 it simply returns it from the array.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
651 Example : $fist_file_indexed = $index->_file_handle( 0 );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
652 Returns : ref to a filehandle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
653 Args : INT
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
654
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
655 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
656
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
657 sub _file_handle {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
658 my( $self, $i ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
659
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
660 unless ($self->{'_filehandle'}[$i]) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
661 my $fh = Symbol::gensym();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
662 my @rec = $self->unpack_record($self->db->{"__FILE_$i"})
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
663 or $self->throw("Can't get filename for index : $i");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
664 my $file = $rec[0];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
665 open $fh, $file or $self->throw("Can't read file '$file' : $!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
666 $self->{'_filehandle'}[$i] = $fh; # Cache filehandle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
667 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
668 return $self->{'_filehandle'}[$i];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
669 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
670
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
671
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
672 =head2 _file_count
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
673
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
674 Title : _file_count
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
675 Usage : $index->_file_count( INT )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
676 Function: Used by the index building sub in a sub class to
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
677 track the number of files indexed. Sets or gets
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
678 the number of files indexed when called with or
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
679 without an argument.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
680 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
681 Returns : INT
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
682 Args : INT
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
683
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
684 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
685
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
686 sub _file_count {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
687 my $self = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
688 if (@_) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
689 $self->db->{'__FILE_COUNT'} = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
690 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
691 return $self->db->{'__FILE_COUNT'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
692 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
693
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
694
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
695 =head2 add_record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
696
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
697 Title : add_record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
698 Usage : $index->add_record( $id, @stuff );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
699 Function: Calls pack_record on @stuff, and adds the result
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
700 of pack_record to the index database under key $id.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
701 If $id is a reference to an array, then a new entry
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
702 is added under a key corresponding to each element
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
703 of the array.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
704 Example : $index->add_record( $id, $fileNumber, $begin, $end )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
705 Returns : TRUE on success or FALSE on failure
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
706 Args : ID LIST
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
707
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
708 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
709
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
710 sub add_record {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
711 my( $self, $id, @rec ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
712 $self->debug( "Adding key $id\n") if( $self->verbose > 0 );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
713 $self->db->{$id} = $self->pack_record( @rec );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
714 return 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
715 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
716
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
717
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
718 =head2 pack_record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
719
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
720 Title : pack_record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
721 Usage : $packed_string = $index->pack_record( LIST )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
722 Function: Packs an array of scalars into a single string
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
723 joined by ASCII 034 (which is unlikely to be used
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
724 in any of the strings), and returns it.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
725 Example : $packed_string = $index->pack_record( $fileNumber, $begin, $end )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
726 Returns : STRING or undef
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
727 Args : LIST
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
728
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
729 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
730
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
731 sub pack_record {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
732 my( $self, @args ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
733 return join "\034", @args;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
734 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
735
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
736 =head2 unpack_record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
737
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
738 Title : unpack_record
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
739 Usage : $index->unpack_record( STRING )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
740 Function: Splits the sting provided into an array,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
741 splitting on ASCII 034.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
742 Example : ( $fileNumber, $begin, $end ) = $index->unpack_record( $self->db->{$id} )
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
743 Returns : A 3 element ARRAY
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
744 Args : STRING containing ASCII 034
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
745
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
746 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
747
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
748 sub unpack_record {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
749 my( $self, @args ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
750 return split /\034/, $args[0];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
751 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
752
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
753 =head2 count_records
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
754
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
755 Title : count_records
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
756 Usage : $recs = $seqdb->count_records()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
757 Function: return count of all recs in the index
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
758 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
759 Returns : a scalar
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
760 Args : none
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
761
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
762
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
763 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
764
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
765 sub count_records {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
766 my ($self,@args) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
767 my $db = $self->db;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
768 my $c = 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
769 while (my($id, $rec) = each %$db) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
770 if( $id =~ /^__/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
771 # internal info
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
772 next;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
773 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
774 $c++;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
775 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
776
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
777 return ($c);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
778 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
779
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
780
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
781 =head2 DESTROY
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
782
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
783 Title : DESTROY
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
784 Usage : Called automatically when index goes out of scope
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
785 Function: Closes connection to database and handles to
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
786 sequence files
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
787 Returns : NEVER
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
788 Args : NONE
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
789
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
790
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
791 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
792
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
793 sub DESTROY {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
794 my $self = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
795 untie($self->{'_DB'});
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
796 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
797
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
798 1;