annotate variant_effect_predictor/Bio/Index/Abstract.pm @ 3:d30fa12e4cc5 default tip

Merge heads 2:a5976b2dce6f and 1:09613ce8151e which were created as a result of a recently fixed bug.
author devteam <devteam@galaxyproject.org>
date Mon, 13 Jan 2014 10:38:30 -0500
parents 1f6dce3d34e0
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
1
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
2 #
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
3 # $Id: Abstract.pm,v 1.41 2002/12/17 02:08:36 jason Exp $
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
4 #
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
5 # BioPerl module for Bio::Index::Abstract
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
6 #
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
7 # Cared for by Ewan Birney <birney@sanger.ac.uk>
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
8 # and James Gilbert <jgrg@sanger.ac.uk>
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
9 #
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
10 # You may distribute this module under the same terms as perl itself
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
11
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
12 # POD documentation - main docs before the code
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
13
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
14 =head1 NAME
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
15
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
16 Bio::Index::Abstract - Abstract interface for indexing a flat file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
17
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
18 =head1 SYNOPSIS
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
19
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
20 You should not be using this module directly
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
21
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
22 =head1 USING DB_FILE
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
23
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
24 To use DB_File and not SDBM for this index, pass the value:
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
25
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
26 -dbm_package => 'DB_File'
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
27
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
28 to new (see below).
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
29
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
30 =head1 DESCRIPTION
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
31
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
32 This object provides the basic mechanism to associate positions
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
33 in files with names. The position and filenames are stored in DBM
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
34 which can then be accessed later on. It is the equivalent of flat
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
35 file indexing (eg, SRS or efetch).
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
36
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
37 This object is the guts to the mechanism, which will be used by the
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
38 specific objects inheriting from it.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
39
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
40 =head1 FEEDBACK
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
41
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
42 =head2 Mailing Lists
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
43
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
44 User feedback is an integral part of the evolution of this and other
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
45 Bioperl modules. Send your comments and suggestions preferably to one
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
46 of the Bioperl mailing lists. Your participation is much appreciated.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
47
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
48 bioperl-l@bioperl.org - General discussion
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
49 http://bioperl.org/MailList.shtml - About the mailing lists
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
50
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
51 =head2 Reporting Bugs
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
52
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
53 Report bugs to the Bioperl bug tracking system to help us keep track
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
54 the bugs and their resolution. Bug reports can be submitted via
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
55 email or the web:
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
56
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
57 bioperl-bugs@bio.perl.org
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
58 http://bugzilla.bioperl.org/
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
59
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
60 =head1 AUTHOR - Ewan Birney, James Gilbert
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
61
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
62 Email - birney@sanger.ac.uk, jgrg@sanger.ac.uk
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
63
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
64 =head1 APPENDIX
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
65
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
66 The rest of the documentation details each of the object methods. Internal
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
67 methods are usually preceded with an "_" (underscore).
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
68
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
69 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
70
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
71
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
72 # Let the code begin...
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
73
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
74 package Bio::Index::Abstract;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
75
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
76 use strict;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
77 use Fcntl qw( O_RDWR O_CREAT O_RDONLY );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
78 use vars qw( $TYPE_AND_VERSION_KEY
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
79 @ISA $USE_DBM_TYPE $DB_HASH );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
80
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
81 # Object preamble - inheriets from Bio::Root::Object
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
82
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
83 use Bio::Root::Root;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
84 use Bio::Root::IO;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
85 use Symbol();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
86
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
87 @ISA = qw(Bio::Root::Root);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
88
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
89 # Generate accessor methods for simple object fields
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
90 BEGIN {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
91 foreach my $func (qw(filename write_flag)) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
92 no strict 'refs';
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
93 my $field = "_$func";
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
94
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
95 *$func = sub {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
96 my( $self, $value ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
97
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
98 if (defined $value) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
99 $self->{$field} = $value;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
100 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
101 return $self->{$field};
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
102 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
103 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
104 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
105
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
106 =head2 new
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
107
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
108 Usage : $index = Bio::Index::Abstract->new(
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
109 -filename => $dbm_file,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
110 -write_flag => 0,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
111 -dbm_package => 'DB_File',
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
112 -verbose => 0);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
113 Function: Returns a new index object. If filename is
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
114 specified, then open_dbm() is immediately called.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
115 Bio::Index::Abstract->new() will usually be called
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
116 directly only when opening an existing index.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
117 Returns : A new index object
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
118 Args : -filename The name of the dbm index file.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
119 -write_flag TRUE if write access to the dbm file is
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
120 needed.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
121 -dbm_package The Perl dbm module to use for the
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
122 index.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
123 -verbose Print debugging output to STDERR if
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
124 TRUE.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
125
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
126 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
127
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
128 sub new {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
129 my($class, @args) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
130 my $self = $class->SUPER::new(@args);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
131 my( $filename, $write_flag, $dbm_package, $cachesize, $ffactor ) =
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
132 $self->_rearrange([qw(FILENAME
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
133 WRITE_FLAG
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
134 DBM_PACKAGE
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
135 CACHESIZE
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
136 FFACTOR
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
137 )], @args);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
138
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
139 # Store any parameters passed
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
140 $self->filename($filename) if $filename;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
141 $self->cachesize($cachesize) if $cachesize;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
142 $self->ffactor($ffactor) if $ffactor;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
143 $self->write_flag($write_flag) if $write_flag;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
144 $self->dbm_package($dbm_package) if $dbm_package;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
145
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
146 $self->{'_filehandle'} = []; # Array in which to cache SeqIO objects
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
147 $self->{'_DB'} = {}; # Gets tied to the DBM file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
148
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
149 # Open database
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
150 $self->open_dbm() if $filename;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
151 return $self;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
152 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
153
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
154 =pod
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
155
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
156 =head2 filename
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
157
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
158 Title : filename
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
159 Usage : $value = $self->filename();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
160 $self->filename($value);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
161 Function: Gets or sets the name of the dbm index file.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
162 Returns : The current value of filename
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
163 Args : Value of filename if setting, or none if
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
164 getting the value.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
165
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
166 =head2 write_flag
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
167
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
168 Title : write_flag
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
169 Usage : $value = $self->write_flag();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
170 $self->write_flag($value);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
171 Function: Gets or sets the value of write_flag, which
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
172 is wether the dbm file should be opened with
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
173 write access.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
174 Returns : The current value of write_flag (default 0)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
175 Args : Value of write_flag if setting, or none if
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
176 getting the value.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
177
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
178 =head2 dbm_package
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
179
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
180 Usage : $value = $self->dbm_package();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
181 $self->dbm_package($value);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
182
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
183 Function: Gets or sets the name of the Perl dbm module used.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
184 If the value is unset, then it returns the value of
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
185 the package variable $USE_DBM_TYPE or if that is
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
186 unset, then it chooses the best available dbm type,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
187 choosing 'DB_File' in preference to 'SDBM_File'.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
188 Bio::Abstract::Index may work with other dbm file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
189 types.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
190
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
191 Returns : The current value of dbm_package
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
192 Args : Value of dbm_package if setting, or none if
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
193 getting the value.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
194
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
195 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
196
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
197 sub dbm_package {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
198 my( $self, $value ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
199 my $to_require = 0;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
200 if( $value || ! $self->{'_dbm_package'} ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
201 my $type = $value || $USE_DBM_TYPE || 'DB_File';
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
202 if( $type =~ /DB_File/i ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
203 eval {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
204 require DB_File;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
205 };
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
206 $type = ( $@ ) ? 'SDBM_File' : 'DB_File';
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
207 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
208 if( $type ne 'DB_File' ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
209 eval { require "$type.pm"; };
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
210 $self->throw($@) if( $@ );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
211 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
212 $self->{'_dbm_package'} = $type;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
213 if( ! defined $USE_DBM_TYPE ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
214 $USE_DBM_TYPE = $self->{'_dbm_package'};
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
215 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
216 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
217 return $self->{'_dbm_package'};
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
218 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
219
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
220 =head2 db
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
221
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
222 Title : db
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
223 Usage : $index->db
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
224 Function: Returns a ref to the hash which is tied to the dbm
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
225 file. Used internally when adding and retrieving
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
226 data from the database.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
227 Example : $db = $index->db();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
228 $db->{ $some_key } = $data
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
229 $data = $index->db->{ $some_key };
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
230 Returns : ref to HASH
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
231 Args : NONE
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
232
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
233 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
234
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
235 sub db {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
236 return $_[0]->{'_DB'};
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
237 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
238
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
239
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
240 =head2 get_stream
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
241
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
242 Title : get_stream
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
243 Usage : $stream = $index->get_stream( $id );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
244 Function: Returns a file handle with the file pointer
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
245 at the approprite place
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
246
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
247 This provides for a way to get the actual
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
248 file contents and not an object
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
249
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
250 WARNING: you must parse the record deliminter
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
251 *yourself*. Abstract wont do this for you
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
252 So this code
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
253
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
254 $fh = $index->get_stream($myid);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
255 while( <$fh> ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
256 # do something
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
257 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
258 will parse the entire file if you don't put in
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
259 a last statement in, like
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
260
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
261 while( <$fh> ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
262 /^\/\// && last; # end of record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
263 # do something
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
264 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
265
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
266 Returns : A filehandle object
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
267 Args : string represents the accession number
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
268 Notes : This method should not be used without forethought
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
269
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
270 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
271
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
272 #'
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
273
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
274 sub get_stream {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
275 my ($self,$id) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
276
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
277 my ($desc,$acc,$out);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
278 my $db = $self->db();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
279
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
280 if (my $rec = $db->{ $id }) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
281 my( @record );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
282
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
283 my ($file, $begin, $end) = $self->unpack_record( $rec );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
284
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
285 # Get the (possibly cached) filehandle
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
286 my $fh = $self->_file_handle( $file );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
287
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
288 # move to start
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
289 seek($fh, $begin, 0);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
290
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
291 return $fh;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
292 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
293 else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
294 $self->throw("Unable to find a record for $id in the flat file index");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
295 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
296 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
297
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
298
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
299 =head2 cachesize
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
300
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
301 Usage : $index->cachesize(1000000)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
302 Function: Sets the dbm file cache size for the index.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
303 Needs to be set before the DBM file gets opened.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
304 Example : $index->cachesize(1000000)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
305 Returns : size of the curent cache
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
306
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
307 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
308
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
309 sub cachesize {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
310 my( $self, $size ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
311
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
312 if(defined $size){
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
313 $self->{'_cachesize'} = $size;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
314 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
315 return ( $self->{'_cachesize'} );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
316
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
317 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
318
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
319
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
320 =head2 ffactor
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
321
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
322 Usage : $index->ffactor(1000000)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
323 Function: Sets the dbm file fill factor.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
324 Needs to be set before the DBM file gets opened.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
325
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
326 Example : $index->ffactor(1000000)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
327 Returns : size of the curent cache
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
328
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
329 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
330
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
331 sub ffactor {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
332 my( $self, $size ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
333
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
334 if(defined $size){
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
335 $self->{'_ffactor'} = $size;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
336 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
337 return ( $self->{'_ffactor'} );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
338
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
339 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
340
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
341
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
342 =head2 open_dbm
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
343
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
344 Usage : $index->open_dbm()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
345 Function: Opens the dbm file associated with the index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
346 object. Write access is only given if explicitly
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
347 asked for by calling new(-write => 1) or having set
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
348 the write_flag(1) on the index object. The type of
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
349 dbm file opened is that returned by dbm_package().
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
350 The name of the file to be is opened is obtained by
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
351 calling the filename() method.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
352
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
353 Example : $index->_open_dbm()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
354 Returns : 1 on success
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
355
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
356 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
357
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
358 sub open_dbm {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
359 my( $self ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
360
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
361 my $filename = $self->filename()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
362 or $self->throw("filename() not set");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
363
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
364 my $db = $self->db();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
365
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
366 # Close the dbm file if already open (maybe we're getting
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
367 # or dropping write access
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
368 if (ref($db) ne 'HASH') {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
369 untie($db);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
370 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
371
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
372 # What kind of DBM file are we going to open?
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
373 my $dbm_type = $self->dbm_package;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
374
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
375 # Choose mode for opening dbm file (read/write+create or read-only).
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
376 my $mode_flags = $self->write_flag ? O_RDWR|O_CREAT : O_RDONLY;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
377
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
378 # Open the dbm file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
379 if ($dbm_type eq 'DB_File') {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
380 my $hash_inf = DB_File::HASHINFO->new();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
381 my $cache = $self->cachesize();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
382 my $ffactor = $self->ffactor();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
383 if ($cache){
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
384 $hash_inf->{'cachesize'} = $cache;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
385 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
386 if ($ffactor){
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
387 $hash_inf->{'ffactor'} = $ffactor;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
388 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
389 tie( %$db, $dbm_type, $filename, $mode_flags, 0644, $hash_inf )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
390 or $self->throw("Can't open '$dbm_type' dbm file '$filename' : $!");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
391 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
392 tie( %$db, $dbm_type, $filename, $mode_flags, 0644 )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
393 or $self->throw("Can't open '$dbm_type' dbm file '$filename' : $!");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
394 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
395
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
396 # The following methods access data in the dbm file:
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
397
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
398 # Now, if we're a Bio::Index::Abstract caterpillar, then we
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
399 # transform ourselves into a Bio::Index::<something> butterfly!
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
400 if( ref($self) eq "Bio::Index::Abstract" ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
401 my $pkg = $self->_code_base();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
402 bless $self, $pkg;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
403 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
404
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
405 # Check or set this is the right kind and version of index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
406 $self->_type_and_version();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
407
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
408 # Check files haven't changed size since they were indexed
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
409 $self->_check_file_sizes();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
410
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
411 return 1;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
412 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
413
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
414 =head2 _version
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
415
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
416 Title : _version
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
417 Usage : $type = $index->_version()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
418 Function: Returns a string which identifes the version of an
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
419 index module. Used to permanently identify an index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
420 file as having been created by a particular version
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
421 of the index module. Must be provided by the sub class
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
422 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
423 Returns :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
424 Args : none
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
425
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
426 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
427
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
428 sub _version {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
429 my $self = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
430
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
431 $self->throw("In Bio::Index::Abstract, no _version method in sub class");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
432 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
433
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
434 =head2 _code_base
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
435
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
436 Title : _code_base
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
437 Usage : $code = $db->_code_base();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
438 Function:
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
439 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
440 Returns : Code package to be used with this
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
441 Args :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
442
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
443
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
444 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
445
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
446 sub _code_base {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
447 my ($self) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
448 my $code_key = '__TYPE_AND_VERSION';
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
449 my $record;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
450
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
451 $record = $self->db->{$code_key};
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
452
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
453 my($code,$version) = $self->unpack_record($record);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
454 if( wantarray ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
455 return ($code,$version);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
456 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
457 return $code;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
458 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
459 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
460
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
461
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
462 =head2 _type_and_version
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
463
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
464 Title : _type_and_version
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
465 Usage : Called by _initalize
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
466 Function: Checks that the index opened is made by the same index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
467 module and version of that module that made it. If the
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
468 index is empty, then it adds the information to the
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
469 database.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
470 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
471 Returns : 1 or exception
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
472 Args : none
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
473
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
474 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
475
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
476 sub _type_and_version {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
477 my $self = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
478 my $key = '__TYPE_AND_VERSION';
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
479 my $version = $self->_version();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
480 my $type = ref $self;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
481
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
482 # Run check or add type and version key if missing
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
483 if (my $rec = $self->db->{ $key }) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
484 my( $db_type, $db_version ) = $self->unpack_record($rec);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
485 $self->throw("This index file is from version [$db_version] - You need to rebuild it to use module version [$version]")
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
486 unless $db_version == $version;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
487 $self->throw("This index file is type [$db_type] - Can't access it with module for [$type]")
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
488 unless $db_type eq $type;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
489 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
490 $self->add_record( $key, $type, $version )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
491 or $self->throw("Can't add Type and Version record");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
492 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
493 return 1;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
494 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
495
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
496
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
497 =head2 _check_file_sizes
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
498
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
499 Title : _check_file_sizes
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
500 Usage : $index->_check_file_sizes()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
501 Function: Verifies that the files listed in the database
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
502 are the same size as when the database was built,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
503 or throws an exception. Called by the new()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
504 function.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
505 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
506 Returns : 1 or exception
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
507 Args :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
508
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
509 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
510
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
511 sub _check_file_sizes {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
512 my $self = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
513 my $num = $self->_file_count() || 0;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
514
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
515 for (my $i = 0; $i < $num; $i++) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
516 my( $file, $stored_size ) = $self->unpack_record( $self->db->{"__FILE_$i"} );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
517 my $size = -s $file;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
518 unless ($size == $stored_size) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
519 $self->throw("file $i [ $file ] has changed size $stored_size -> $size. This probably means you need to rebuild the index.");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
520 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
521 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
522 return 1;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
523 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
524
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
525
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
526 =head2 make_index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
527
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
528 Title : make_index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
529 Usage : $index->make_index( FILE_LIST )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
530 Function: Takes a list of file names, checks that they are
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
531 all fully qualified, and then calls _filename() on
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
532 each. It supplies _filename() with the name of the
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
533 file, and an integer which is stored with each record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
534 created by _filename(). Can be called multiple times,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
535 and can be used to add to an existing index file.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
536 Example : $index->make_index( '/home/seqs1', '/home/seqs2', '/nfs/pub/big_db' );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
537 Returns : Number of files indexed
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
538 Args : LIST OF FILES
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
539
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
540 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
541
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
542 sub make_index {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
543 my($self, @files) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
544 my $count = 0;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
545 my $recs = 0;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
546 # blow up if write flag is not set. EB fix
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
547
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
548 if( !defined $self->write_flag ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
549 $self->throw("Attempting to make an index on a read-only database. What about a WRITE flag on opening the index?");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
550 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
551
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
552 # We're really fussy/lazy, expecting all file names to be fully qualified
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
553 $self->throw("No files to index provided") unless @files;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
554 for(my $i=0;$i<scalar @files; $i++) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
555 if( $Bio::Root::IO::FILESPECLOADED && File::Spec->can('rel2abs') ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
556 if( ! File::Spec->file_name_is_absolute($files[$i]) ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
557 $files[$i] = File::Spec->rel2abs($files[$i]);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
558 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
559 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
560 if( $^O =~ /MSWin/i ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
561 ($files[$i] =~ m|^[A-Za-z]:/|) ||
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
562 $self->throw("Not an absolute file path '$files[$i]'");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
563 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
564 ($files[$i] =~ m|^/|) ||
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
565 $self->throw("Not an absolute file path '$files[$i]'");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
566 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
567 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
568 $self->throw("File does not exist '$files[$i]'") unless -e $files[$i];
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
569 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
570
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
571 # Add each file to the index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
572 FILE :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
573 foreach my $file (@files) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
574
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
575 my $i; # index for this file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
576
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
577 # Get new index for this file and increment file count
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
578 if ( defined(my $count = $self->_file_count) ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
579 $i = $count;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
580 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
581 $i = 0; $self->_file_count(0);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
582 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
583
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
584 # see whether this file has been already indexed
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
585 my ($record,$number,$size);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
586
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
587 if( ($record = $self->db->{"__FILENAME_$file"}) ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
588 ($number,$size) = $self->unpack_record($record);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
589
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
590 # if it is the same size - fine. Otherwise die
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
591 if( -s $file == $size ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
592 warn "File $file already indexed. Skipping...\n";
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
593 next FILE;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
594 } else {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
595 $self->throw("In index, $file has changed size ($size). Indicates that the index is out of date");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
596 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
597 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
598
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
599 # index this file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
600 warn "Indexing file $file\n" if( $self->verbose > 0);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
601
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
602 # this is supplied by the subclass and does the serious work
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
603 $recs += $self->_index_file( $file, $i ); # Specific method for each type of index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
604
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
605
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
606 # Save file name and size for this index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
607 $self->add_record("__FILE_$i", $file, -s $file)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
608 or $self->throw("Can't add data to file: $file");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
609 $self->add_record("__FILENAME_$file", $i, -s $file)
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
610 or $self->throw("Can't add data to file: $file");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
611
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
612 # increment file lines
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
613 $i++; $self->_file_count($i);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
614 my $temp;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
615 $temp = $self->_file_count();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
616
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
617
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
618 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
619 return ($count, $recs);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
620 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
621
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
622 =head2 _filename
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
623
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
624 Title : _filename
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
625 Usage : $index->_filename( FILE INT )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
626 Function: Indexes the file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
627 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
628 Returns :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
629 Args :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
630
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
631 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
632
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
633 sub _index_file {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
634 my $self = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
635
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
636 my $pkg = ref($self);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
637 $self->throw("Error: '$pkg' does not provide the _index_file() method");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
638 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
639
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
640
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
641
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
642 =head2 _file_handle
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
643
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
644 Title : _file_handle
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
645 Usage : $fh = $index->_file_handle( INT )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
646 Function: Returns an open filehandle for the file
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
647 index INT. On opening a new filehandle it
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
648 caches it in the @{$index->_filehandle} array.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
649 If the requested filehandle is already open,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
650 it simply returns it from the array.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
651 Example : $fist_file_indexed = $index->_file_handle( 0 );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
652 Returns : ref to a filehandle
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
653 Args : INT
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
654
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
655 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
656
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
657 sub _file_handle {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
658 my( $self, $i ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
659
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
660 unless ($self->{'_filehandle'}[$i]) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
661 my $fh = Symbol::gensym();
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
662 my @rec = $self->unpack_record($self->db->{"__FILE_$i"})
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
663 or $self->throw("Can't get filename for index : $i");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
664 my $file = $rec[0];
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
665 open $fh, $file or $self->throw("Can't read file '$file' : $!");
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
666 $self->{'_filehandle'}[$i] = $fh; # Cache filehandle
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
667 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
668 return $self->{'_filehandle'}[$i];
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
669 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
670
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
671
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
672 =head2 _file_count
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
673
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
674 Title : _file_count
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
675 Usage : $index->_file_count( INT )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
676 Function: Used by the index building sub in a sub class to
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
677 track the number of files indexed. Sets or gets
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
678 the number of files indexed when called with or
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
679 without an argument.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
680 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
681 Returns : INT
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
682 Args : INT
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
683
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
684 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
685
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
686 sub _file_count {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
687 my $self = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
688 if (@_) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
689 $self->db->{'__FILE_COUNT'} = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
690 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
691 return $self->db->{'__FILE_COUNT'};
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
692 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
693
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
694
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
695 =head2 add_record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
696
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
697 Title : add_record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
698 Usage : $index->add_record( $id, @stuff );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
699 Function: Calls pack_record on @stuff, and adds the result
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
700 of pack_record to the index database under key $id.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
701 If $id is a reference to an array, then a new entry
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
702 is added under a key corresponding to each element
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
703 of the array.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
704 Example : $index->add_record( $id, $fileNumber, $begin, $end )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
705 Returns : TRUE on success or FALSE on failure
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
706 Args : ID LIST
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
707
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
708 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
709
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
710 sub add_record {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
711 my( $self, $id, @rec ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
712 $self->debug( "Adding key $id\n") if( $self->verbose > 0 );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
713 $self->db->{$id} = $self->pack_record( @rec );
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
714 return 1;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
715 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
716
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
717
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
718 =head2 pack_record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
719
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
720 Title : pack_record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
721 Usage : $packed_string = $index->pack_record( LIST )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
722 Function: Packs an array of scalars into a single string
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
723 joined by ASCII 034 (which is unlikely to be used
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
724 in any of the strings), and returns it.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
725 Example : $packed_string = $index->pack_record( $fileNumber, $begin, $end )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
726 Returns : STRING or undef
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
727 Args : LIST
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
728
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
729 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
730
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
731 sub pack_record {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
732 my( $self, @args ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
733 return join "\034", @args;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
734 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
735
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
736 =head2 unpack_record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
737
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
738 Title : unpack_record
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
739 Usage : $index->unpack_record( STRING )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
740 Function: Splits the sting provided into an array,
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
741 splitting on ASCII 034.
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
742 Example : ( $fileNumber, $begin, $end ) = $index->unpack_record( $self->db->{$id} )
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
743 Returns : A 3 element ARRAY
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
744 Args : STRING containing ASCII 034
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
745
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
746 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
747
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
748 sub unpack_record {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
749 my( $self, @args ) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
750 return split /\034/, $args[0];
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
751 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
752
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
753 =head2 count_records
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
754
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
755 Title : count_records
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
756 Usage : $recs = $seqdb->count_records()
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
757 Function: return count of all recs in the index
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
758 Example :
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
759 Returns : a scalar
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
760 Args : none
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
761
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
762
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
763 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
764
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
765 sub count_records {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
766 my ($self,@args) = @_;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
767 my $db = $self->db;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
768 my $c = 0;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
769 while (my($id, $rec) = each %$db) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
770 if( $id =~ /^__/ ) {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
771 # internal info
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
772 next;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
773 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
774 $c++;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
775 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
776
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
777 return ($c);
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
778 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
779
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
780
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
781 =head2 DESTROY
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
782
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
783 Title : DESTROY
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
784 Usage : Called automatically when index goes out of scope
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
785 Function: Closes connection to database and handles to
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
786 sequence files
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
787 Returns : NEVER
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
788 Args : NONE
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
789
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
790
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
791 =cut
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
792
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
793 sub DESTROY {
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
794 my $self = shift;
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
795 untie($self->{'_DB'});
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
796 }
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
797
1f6dce3d34e0 Uploaded
mahtabm
parents:
diff changeset
798 1;