annotate variant_effect_predictor/Bio/Index/Abstract.pm @ 0:21066c0abaf5 draft

Uploaded
author willmclaren
date Fri, 03 Aug 2012 10:04:48 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
2 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
3 # $Id: Abstract.pm,v 1.41 2002/12/17 02:08:36 jason Exp $
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
4 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
5 # BioPerl module for Bio::Index::Abstract
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
6 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
7 # Cared for by Ewan Birney <birney@sanger.ac.uk>
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
8 # and James Gilbert <jgrg@sanger.ac.uk>
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
9 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
10 # You may distribute this module under the same terms as perl itself
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
11
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
12 # POD documentation - main docs before the code
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
13
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
14 =head1 NAME
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
15
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
16 Bio::Index::Abstract - Abstract interface for indexing a flat file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
17
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
18 =head1 SYNOPSIS
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
19
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
20 You should not be using this module directly
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
21
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
22 =head1 USING DB_FILE
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
23
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
24 To use DB_File and not SDBM for this index, pass the value:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
25
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
26 -dbm_package => 'DB_File'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
27
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
28 to new (see below).
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
29
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
30 =head1 DESCRIPTION
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
31
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
32 This object provides the basic mechanism to associate positions
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
33 in files with names. The position and filenames are stored in DBM
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
34 which can then be accessed later on. It is the equivalent of flat
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
35 file indexing (eg, SRS or efetch).
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
36
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
37 This object is the guts to the mechanism, which will be used by the
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
38 specific objects inheriting from it.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
39
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
40 =head1 FEEDBACK
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
41
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
42 =head2 Mailing Lists
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
43
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
44 User feedback is an integral part of the evolution of this and other
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
45 Bioperl modules. Send your comments and suggestions preferably to one
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
46 of the Bioperl mailing lists. Your participation is much appreciated.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
47
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
48 bioperl-l@bioperl.org - General discussion
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
49 http://bioperl.org/MailList.shtml - About the mailing lists
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
50
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
51 =head2 Reporting Bugs
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
52
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
53 Report bugs to the Bioperl bug tracking system to help us keep track
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
54 the bugs and their resolution. Bug reports can be submitted via
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
55 email or the web:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
56
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
57 bioperl-bugs@bio.perl.org
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
58 http://bugzilla.bioperl.org/
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
59
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
60 =head1 AUTHOR - Ewan Birney, James Gilbert
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
61
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
62 Email - birney@sanger.ac.uk, jgrg@sanger.ac.uk
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
63
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
64 =head1 APPENDIX
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
65
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
66 The rest of the documentation details each of the object methods. Internal
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
67 methods are usually preceded with an "_" (underscore).
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
68
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
69 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
70
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
71
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
72 # Let the code begin...
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
73
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
74 package Bio::Index::Abstract;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
75
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
76 use strict;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
77 use Fcntl qw( O_RDWR O_CREAT O_RDONLY );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
78 use vars qw( $TYPE_AND_VERSION_KEY
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
79 @ISA $USE_DBM_TYPE $DB_HASH );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
80
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
81 # Object preamble - inheriets from Bio::Root::Object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
82
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
83 use Bio::Root::Root;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
84 use Bio::Root::IO;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
85 use Symbol();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
86
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
87 @ISA = qw(Bio::Root::Root);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
88
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
89 # Generate accessor methods for simple object fields
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
90 BEGIN {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
91 foreach my $func (qw(filename write_flag)) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
92 no strict 'refs';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
93 my $field = "_$func";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
94
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
95 *$func = sub {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
96 my( $self, $value ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
97
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
98 if (defined $value) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
99 $self->{$field} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
100 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
101 return $self->{$field};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
102 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
103 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
104 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
105
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
106 =head2 new
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
107
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
108 Usage : $index = Bio::Index::Abstract->new(
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
109 -filename => $dbm_file,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
110 -write_flag => 0,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
111 -dbm_package => 'DB_File',
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
112 -verbose => 0);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
113 Function: Returns a new index object. If filename is
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
114 specified, then open_dbm() is immediately called.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
115 Bio::Index::Abstract->new() will usually be called
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
116 directly only when opening an existing index.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
117 Returns : A new index object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
118 Args : -filename The name of the dbm index file.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
119 -write_flag TRUE if write access to the dbm file is
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
120 needed.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
121 -dbm_package The Perl dbm module to use for the
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
122 index.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
123 -verbose Print debugging output to STDERR if
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
124 TRUE.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
125
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
126 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
127
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
128 sub new {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
129 my($class, @args) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
130 my $self = $class->SUPER::new(@args);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
131 my( $filename, $write_flag, $dbm_package, $cachesize, $ffactor ) =
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
132 $self->_rearrange([qw(FILENAME
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
133 WRITE_FLAG
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
134 DBM_PACKAGE
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
135 CACHESIZE
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
136 FFACTOR
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
137 )], @args);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
138
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
139 # Store any parameters passed
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
140 $self->filename($filename) if $filename;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
141 $self->cachesize($cachesize) if $cachesize;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
142 $self->ffactor($ffactor) if $ffactor;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
143 $self->write_flag($write_flag) if $write_flag;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
144 $self->dbm_package($dbm_package) if $dbm_package;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
145
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
146 $self->{'_filehandle'} = []; # Array in which to cache SeqIO objects
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
147 $self->{'_DB'} = {}; # Gets tied to the DBM file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
148
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
149 # Open database
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
150 $self->open_dbm() if $filename;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
151 return $self;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
152 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
153
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
154 =pod
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
155
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
156 =head2 filename
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
157
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
158 Title : filename
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
159 Usage : $value = $self->filename();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
160 $self->filename($value);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
161 Function: Gets or sets the name of the dbm index file.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
162 Returns : The current value of filename
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
163 Args : Value of filename if setting, or none if
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
164 getting the value.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
165
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
166 =head2 write_flag
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
167
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
168 Title : write_flag
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
169 Usage : $value = $self->write_flag();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
170 $self->write_flag($value);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
171 Function: Gets or sets the value of write_flag, which
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
172 is wether the dbm file should be opened with
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
173 write access.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
174 Returns : The current value of write_flag (default 0)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
175 Args : Value of write_flag if setting, or none if
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
176 getting the value.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
177
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
178 =head2 dbm_package
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
179
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
180 Usage : $value = $self->dbm_package();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
181 $self->dbm_package($value);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
182
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
183 Function: Gets or sets the name of the Perl dbm module used.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
184 If the value is unset, then it returns the value of
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
185 the package variable $USE_DBM_TYPE or if that is
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
186 unset, then it chooses the best available dbm type,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
187 choosing 'DB_File' in preference to 'SDBM_File'.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
188 Bio::Abstract::Index may work with other dbm file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
189 types.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
190
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
191 Returns : The current value of dbm_package
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
192 Args : Value of dbm_package if setting, or none if
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
193 getting the value.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
194
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
195 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
196
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
197 sub dbm_package {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
198 my( $self, $value ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
199 my $to_require = 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
200 if( $value || ! $self->{'_dbm_package'} ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
201 my $type = $value || $USE_DBM_TYPE || 'DB_File';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
202 if( $type =~ /DB_File/i ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
203 eval {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
204 require DB_File;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
205 };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
206 $type = ( $@ ) ? 'SDBM_File' : 'DB_File';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
207 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
208 if( $type ne 'DB_File' ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
209 eval { require "$type.pm"; };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
210 $self->throw($@) if( $@ );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
211 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
212 $self->{'_dbm_package'} = $type;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
213 if( ! defined $USE_DBM_TYPE ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
214 $USE_DBM_TYPE = $self->{'_dbm_package'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
215 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
216 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
217 return $self->{'_dbm_package'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
218 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
219
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
220 =head2 db
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
221
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
222 Title : db
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
223 Usage : $index->db
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
224 Function: Returns a ref to the hash which is tied to the dbm
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
225 file. Used internally when adding and retrieving
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
226 data from the database.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
227 Example : $db = $index->db();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
228 $db->{ $some_key } = $data
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
229 $data = $index->db->{ $some_key };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
230 Returns : ref to HASH
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
231 Args : NONE
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
232
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
233 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
234
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
235 sub db {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
236 return $_[0]->{'_DB'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
237 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
238
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
239
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
240 =head2 get_stream
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
241
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
242 Title : get_stream
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
243 Usage : $stream = $index->get_stream( $id );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
244 Function: Returns a file handle with the file pointer
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
245 at the approprite place
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
246
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
247 This provides for a way to get the actual
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
248 file contents and not an object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
249
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
250 WARNING: you must parse the record deliminter
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
251 *yourself*. Abstract wont do this for you
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
252 So this code
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
253
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
254 $fh = $index->get_stream($myid);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
255 while( <$fh> ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
256 # do something
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
257 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
258 will parse the entire file if you don't put in
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
259 a last statement in, like
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
260
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
261 while( <$fh> ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
262 /^\/\// && last; # end of record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
263 # do something
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
264 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
265
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
266 Returns : A filehandle object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
267 Args : string represents the accession number
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
268 Notes : This method should not be used without forethought
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
269
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
270 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
271
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
272 #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
273
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
274 sub get_stream {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
275 my ($self,$id) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
276
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
277 my ($desc,$acc,$out);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
278 my $db = $self->db();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
279
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
280 if (my $rec = $db->{ $id }) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
281 my( @record );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
282
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
283 my ($file, $begin, $end) = $self->unpack_record( $rec );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
284
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
285 # Get the (possibly cached) filehandle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
286 my $fh = $self->_file_handle( $file );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
287
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
288 # move to start
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
289 seek($fh, $begin, 0);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
290
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
291 return $fh;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
292 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
293 else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
294 $self->throw("Unable to find a record for $id in the flat file index");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
295 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
296 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
297
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
298
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
299 =head2 cachesize
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
300
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
301 Usage : $index->cachesize(1000000)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
302 Function: Sets the dbm file cache size for the index.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
303 Needs to be set before the DBM file gets opened.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
304 Example : $index->cachesize(1000000)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
305 Returns : size of the curent cache
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
306
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
307 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
308
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
309 sub cachesize {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
310 my( $self, $size ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
311
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
312 if(defined $size){
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
313 $self->{'_cachesize'} = $size;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
314 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
315 return ( $self->{'_cachesize'} );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
316
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
317 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
318
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
319
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
320 =head2 ffactor
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
321
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
322 Usage : $index->ffactor(1000000)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
323 Function: Sets the dbm file fill factor.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
324 Needs to be set before the DBM file gets opened.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
325
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
326 Example : $index->ffactor(1000000)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
327 Returns : size of the curent cache
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
328
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
329 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
330
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
331 sub ffactor {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
332 my( $self, $size ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
333
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
334 if(defined $size){
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
335 $self->{'_ffactor'} = $size;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
336 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
337 return ( $self->{'_ffactor'} );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
338
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
339 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
340
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
341
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
342 =head2 open_dbm
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
343
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
344 Usage : $index->open_dbm()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
345 Function: Opens the dbm file associated with the index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
346 object. Write access is only given if explicitly
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
347 asked for by calling new(-write => 1) or having set
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
348 the write_flag(1) on the index object. The type of
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
349 dbm file opened is that returned by dbm_package().
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
350 The name of the file to be is opened is obtained by
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
351 calling the filename() method.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
352
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
353 Example : $index->_open_dbm()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
354 Returns : 1 on success
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
355
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
356 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
357
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
358 sub open_dbm {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
359 my( $self ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
360
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
361 my $filename = $self->filename()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
362 or $self->throw("filename() not set");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
363
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
364 my $db = $self->db();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
365
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
366 # Close the dbm file if already open (maybe we're getting
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
367 # or dropping write access
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
368 if (ref($db) ne 'HASH') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
369 untie($db);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
370 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
371
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
372 # What kind of DBM file are we going to open?
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
373 my $dbm_type = $self->dbm_package;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
374
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
375 # Choose mode for opening dbm file (read/write+create or read-only).
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
376 my $mode_flags = $self->write_flag ? O_RDWR|O_CREAT : O_RDONLY;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
377
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
378 # Open the dbm file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
379 if ($dbm_type eq 'DB_File') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
380 my $hash_inf = DB_File::HASHINFO->new();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
381 my $cache = $self->cachesize();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
382 my $ffactor = $self->ffactor();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
383 if ($cache){
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
384 $hash_inf->{'cachesize'} = $cache;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
385 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
386 if ($ffactor){
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
387 $hash_inf->{'ffactor'} = $ffactor;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
388 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
389 tie( %$db, $dbm_type, $filename, $mode_flags, 0644, $hash_inf )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
390 or $self->throw("Can't open '$dbm_type' dbm file '$filename' : $!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
391 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
392 tie( %$db, $dbm_type, $filename, $mode_flags, 0644 )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
393 or $self->throw("Can't open '$dbm_type' dbm file '$filename' : $!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
394 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
395
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
396 # The following methods access data in the dbm file:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
397
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
398 # Now, if we're a Bio::Index::Abstract caterpillar, then we
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
399 # transform ourselves into a Bio::Index::<something> butterfly!
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
400 if( ref($self) eq "Bio::Index::Abstract" ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
401 my $pkg = $self->_code_base();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
402 bless $self, $pkg;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
403 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
404
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
405 # Check or set this is the right kind and version of index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
406 $self->_type_and_version();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
407
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
408 # Check files haven't changed size since they were indexed
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
409 $self->_check_file_sizes();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
410
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
411 return 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
412 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
413
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
414 =head2 _version
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
415
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
416 Title : _version
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
417 Usage : $type = $index->_version()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
418 Function: Returns a string which identifes the version of an
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
419 index module. Used to permanently identify an index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
420 file as having been created by a particular version
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
421 of the index module. Must be provided by the sub class
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
422 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
423 Returns :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
424 Args : none
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
425
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
426 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
427
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
428 sub _version {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
429 my $self = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
430
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
431 $self->throw("In Bio::Index::Abstract, no _version method in sub class");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
432 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
433
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
434 =head2 _code_base
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
435
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
436 Title : _code_base
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
437 Usage : $code = $db->_code_base();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
438 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
439 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
440 Returns : Code package to be used with this
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
441 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
442
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
443
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
444 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
445
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
446 sub _code_base {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
447 my ($self) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
448 my $code_key = '__TYPE_AND_VERSION';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
449 my $record;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
450
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
451 $record = $self->db->{$code_key};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
452
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
453 my($code,$version) = $self->unpack_record($record);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
454 if( wantarray ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
455 return ($code,$version);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
456 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
457 return $code;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
458 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
459 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
460
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
461
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
462 =head2 _type_and_version
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
463
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
464 Title : _type_and_version
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
465 Usage : Called by _initalize
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
466 Function: Checks that the index opened is made by the same index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
467 module and version of that module that made it. If the
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
468 index is empty, then it adds the information to the
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
469 database.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
470 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
471 Returns : 1 or exception
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
472 Args : none
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
473
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
474 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
475
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
476 sub _type_and_version {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
477 my $self = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
478 my $key = '__TYPE_AND_VERSION';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
479 my $version = $self->_version();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
480 my $type = ref $self;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
481
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
482 # Run check or add type and version key if missing
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
483 if (my $rec = $self->db->{ $key }) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
484 my( $db_type, $db_version ) = $self->unpack_record($rec);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
485 $self->throw("This index file is from version [$db_version] - You need to rebuild it to use module version [$version]")
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
486 unless $db_version == $version;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
487 $self->throw("This index file is type [$db_type] - Can't access it with module for [$type]")
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
488 unless $db_type eq $type;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
489 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
490 $self->add_record( $key, $type, $version )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
491 or $self->throw("Can't add Type and Version record");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
492 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
493 return 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
494 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
495
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
496
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
497 =head2 _check_file_sizes
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
498
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
499 Title : _check_file_sizes
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
500 Usage : $index->_check_file_sizes()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
501 Function: Verifies that the files listed in the database
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
502 are the same size as when the database was built,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
503 or throws an exception. Called by the new()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
504 function.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
505 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
506 Returns : 1 or exception
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
507 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
508
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
509 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
510
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
511 sub _check_file_sizes {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
512 my $self = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
513 my $num = $self->_file_count() || 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
514
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
515 for (my $i = 0; $i < $num; $i++) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
516 my( $file, $stored_size ) = $self->unpack_record( $self->db->{"__FILE_$i"} );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
517 my $size = -s $file;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
518 unless ($size == $stored_size) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
519 $self->throw("file $i [ $file ] has changed size $stored_size -> $size. This probably means you need to rebuild the index.");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
520 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
521 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
522 return 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
523 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
524
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
525
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
526 =head2 make_index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
527
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
528 Title : make_index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
529 Usage : $index->make_index( FILE_LIST )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
530 Function: Takes a list of file names, checks that they are
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
531 all fully qualified, and then calls _filename() on
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
532 each. It supplies _filename() with the name of the
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
533 file, and an integer which is stored with each record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
534 created by _filename(). Can be called multiple times,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
535 and can be used to add to an existing index file.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
536 Example : $index->make_index( '/home/seqs1', '/home/seqs2', '/nfs/pub/big_db' );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
537 Returns : Number of files indexed
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
538 Args : LIST OF FILES
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
539
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
540 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
541
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
542 sub make_index {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
543 my($self, @files) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
544 my $count = 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
545 my $recs = 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
546 # blow up if write flag is not set. EB fix
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
547
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
548 if( !defined $self->write_flag ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
549 $self->throw("Attempting to make an index on a read-only database. What about a WRITE flag on opening the index?");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
550 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
551
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
552 # We're really fussy/lazy, expecting all file names to be fully qualified
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
553 $self->throw("No files to index provided") unless @files;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
554 for(my $i=0;$i<scalar @files; $i++) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
555 if( $Bio::Root::IO::FILESPECLOADED && File::Spec->can('rel2abs') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
556 if( ! File::Spec->file_name_is_absolute($files[$i]) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
557 $files[$i] = File::Spec->rel2abs($files[$i]);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
558 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
559 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
560 if( $^O =~ /MSWin/i ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
561 ($files[$i] =~ m|^[A-Za-z]:/|) ||
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
562 $self->throw("Not an absolute file path '$files[$i]'");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
563 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
564 ($files[$i] =~ m|^/|) ||
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
565 $self->throw("Not an absolute file path '$files[$i]'");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
566 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
567 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
568 $self->throw("File does not exist '$files[$i]'") unless -e $files[$i];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
569 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
570
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
571 # Add each file to the index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
572 FILE :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
573 foreach my $file (@files) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
574
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
575 my $i; # index for this file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
576
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
577 # Get new index for this file and increment file count
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
578 if ( defined(my $count = $self->_file_count) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
579 $i = $count;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
580 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
581 $i = 0; $self->_file_count(0);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
582 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
583
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
584 # see whether this file has been already indexed
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
585 my ($record,$number,$size);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
586
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
587 if( ($record = $self->db->{"__FILENAME_$file"}) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
588 ($number,$size) = $self->unpack_record($record);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
589
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
590 # if it is the same size - fine. Otherwise die
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
591 if( -s $file == $size ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
592 warn "File $file already indexed. Skipping...\n";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
593 next FILE;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
594 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
595 $self->throw("In index, $file has changed size ($size). Indicates that the index is out of date");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
596 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
597 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
598
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
599 # index this file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
600 warn "Indexing file $file\n" if( $self->verbose > 0);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
601
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
602 # this is supplied by the subclass and does the serious work
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
603 $recs += $self->_index_file( $file, $i ); # Specific method for each type of index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
604
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
605
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
606 # Save file name and size for this index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
607 $self->add_record("__FILE_$i", $file, -s $file)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
608 or $self->throw("Can't add data to file: $file");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
609 $self->add_record("__FILENAME_$file", $i, -s $file)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
610 or $self->throw("Can't add data to file: $file");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
611
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
612 # increment file lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
613 $i++; $self->_file_count($i);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
614 my $temp;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
615 $temp = $self->_file_count();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
616
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
617
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
618 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
619 return ($count, $recs);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
620 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
621
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
622 =head2 _filename
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
623
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
624 Title : _filename
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
625 Usage : $index->_filename( FILE INT )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
626 Function: Indexes the file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
627 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
628 Returns :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
629 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
630
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
631 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
632
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
633 sub _index_file {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
634 my $self = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
635
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
636 my $pkg = ref($self);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
637 $self->throw("Error: '$pkg' does not provide the _index_file() method");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
638 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
639
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
640
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
641
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
642 =head2 _file_handle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
643
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
644 Title : _file_handle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
645 Usage : $fh = $index->_file_handle( INT )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
646 Function: Returns an open filehandle for the file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
647 index INT. On opening a new filehandle it
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
648 caches it in the @{$index->_filehandle} array.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
649 If the requested filehandle is already open,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
650 it simply returns it from the array.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
651 Example : $fist_file_indexed = $index->_file_handle( 0 );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
652 Returns : ref to a filehandle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
653 Args : INT
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
654
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
655 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
656
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
657 sub _file_handle {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
658 my( $self, $i ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
659
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
660 unless ($self->{'_filehandle'}[$i]) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
661 my $fh = Symbol::gensym();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
662 my @rec = $self->unpack_record($self->db->{"__FILE_$i"})
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
663 or $self->throw("Can't get filename for index : $i");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
664 my $file = $rec[0];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
665 open $fh, $file or $self->throw("Can't read file '$file' : $!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
666 $self->{'_filehandle'}[$i] = $fh; # Cache filehandle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
667 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
668 return $self->{'_filehandle'}[$i];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
669 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
670
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
671
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
672 =head2 _file_count
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
673
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
674 Title : _file_count
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
675 Usage : $index->_file_count( INT )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
676 Function: Used by the index building sub in a sub class to
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
677 track the number of files indexed. Sets or gets
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
678 the number of files indexed when called with or
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
679 without an argument.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
680 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
681 Returns : INT
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
682 Args : INT
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
683
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
684 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
685
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
686 sub _file_count {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
687 my $self = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
688 if (@_) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
689 $self->db->{'__FILE_COUNT'} = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
690 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
691 return $self->db->{'__FILE_COUNT'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
692 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
693
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
694
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
695 =head2 add_record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
696
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
697 Title : add_record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
698 Usage : $index->add_record( $id, @stuff );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
699 Function: Calls pack_record on @stuff, and adds the result
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
700 of pack_record to the index database under key $id.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
701 If $id is a reference to an array, then a new entry
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
702 is added under a key corresponding to each element
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
703 of the array.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
704 Example : $index->add_record( $id, $fileNumber, $begin, $end )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
705 Returns : TRUE on success or FALSE on failure
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
706 Args : ID LIST
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
707
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
708 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
709
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
710 sub add_record {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
711 my( $self, $id, @rec ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
712 $self->debug( "Adding key $id\n") if( $self->verbose > 0 );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
713 $self->db->{$id} = $self->pack_record( @rec );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
714 return 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
715 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
716
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
717
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
718 =head2 pack_record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
719
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
720 Title : pack_record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
721 Usage : $packed_string = $index->pack_record( LIST )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
722 Function: Packs an array of scalars into a single string
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
723 joined by ASCII 034 (which is unlikely to be used
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
724 in any of the strings), and returns it.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
725 Example : $packed_string = $index->pack_record( $fileNumber, $begin, $end )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
726 Returns : STRING or undef
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
727 Args : LIST
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
728
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
729 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
730
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
731 sub pack_record {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
732 my( $self, @args ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
733 return join "\034", @args;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
734 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
735
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
736 =head2 unpack_record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
737
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
738 Title : unpack_record
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
739 Usage : $index->unpack_record( STRING )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
740 Function: Splits the sting provided into an array,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
741 splitting on ASCII 034.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
742 Example : ( $fileNumber, $begin, $end ) = $index->unpack_record( $self->db->{$id} )
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
743 Returns : A 3 element ARRAY
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
744 Args : STRING containing ASCII 034
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
745
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
746 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
747
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
748 sub unpack_record {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
749 my( $self, @args ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
750 return split /\034/, $args[0];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
751 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
752
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
753 =head2 count_records
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
754
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
755 Title : count_records
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
756 Usage : $recs = $seqdb->count_records()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
757 Function: return count of all recs in the index
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
758 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
759 Returns : a scalar
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
760 Args : none
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
761
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
762
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
763 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
764
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
765 sub count_records {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
766 my ($self,@args) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
767 my $db = $self->db;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
768 my $c = 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
769 while (my($id, $rec) = each %$db) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
770 if( $id =~ /^__/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
771 # internal info
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
772 next;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
773 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
774 $c++;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
775 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
776
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
777 return ($c);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
778 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
779
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
780
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
781 =head2 DESTROY
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
782
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
783 Title : DESTROY
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
784 Usage : Called automatically when index goes out of scope
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
785 Function: Closes connection to database and handles to
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
786 sequence files
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
787 Returns : NEVER
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
788 Args : NONE
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
789
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
790
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
791 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
792
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
793 sub DESTROY {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
794 my $self = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
795 untie($self->{'_DB'});
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
796 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
797
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
798 1;