annotate variant_effect_predictor/Bio/SeqIO/embl.pm @ 1:d6778b5d8382 draft default tip

Deleted selected files
author willmclaren
date Fri, 03 Aug 2012 10:05:43 -0400
parents 21066c0abaf5
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1 # $Id: embl.pm,v 1.57.2.6 2003/09/14 19:06:51 jason Exp $
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
2 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
3 # BioPerl module for Bio::SeqIO::EMBL
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
4 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
5 # Cared for by Ewan Birney <birney@ebi.ac.uk>
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
6 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
7 # Copyright Ewan Birney
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
8 #
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
9 # You may distribute this module under the same terms as perl itself
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
10
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
11 # POD documentation - main docs before the code
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
12
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
13 =head1 NAME
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
14
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
15 Bio::SeqIO::embl - EMBL sequence input/output stream
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
16
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
17 =head1 SYNOPSIS
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
18
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
19 It is probably best not to use this object directly, but
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
20 rather go through the AnnSeqIO handler system. Go:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
21
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
22 $stream = Bio::SeqIO->new(-file => $filename, -format => 'EMBL');
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
23
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
24 while ( (my $seq = $stream->next_seq()) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
25 # do something with $seq
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
26 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
27
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
28 =head1 DESCRIPTION
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
29
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
30 This object can transform Bio::Seq objects to and from EMBL flat
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
31 file databases.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
32
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
33 There is alot of flexibility here about how to dump things which I need
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
34 to document fully.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
35
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
36 There should be a common object that this and genbank share (probably
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
37 with swissprot). Too much of the magic is identical.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
38
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
39 =head2 Optional functions
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
40
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
41 =over 3
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
42
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
43 =item _show_dna()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
44
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
45 (output only) shows the dna or not
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
46
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
47 =item _post_sort()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
48
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
49 (output only) provides a sorting func which is applied to the FTHelpers
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
50 before printing
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
51
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
52 =item _id_generation_func()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
53
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
54 This is function which is called as
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
55
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
56 print "ID ", $func($annseq), "\n";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
57
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
58 To generate the ID line. If it is not there, it generates a sensible ID
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
59 line using a number of tools.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
60
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
61 If you want to output annotations in embl format they need to be
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
62 stored in a Bio::Annotation::Collection object which is accessible
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
63 through the Bio::SeqI interface method L<annotation()|annotation>.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
64
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
65 The following are the names of the keys which are polled from a
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
66 L<Bio::Annotation::Collection> object.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
67
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
68 reference - Should contain Bio::Annotation::Reference objects
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
69 comment - Should contain Bio::Annotation::Comment objects
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
70 dblink - Should contain Bio::Annotation::DBLink objects
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
71
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
72 =back
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
73
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
74 =head1 FEEDBACK
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
75
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
76 =head2 Mailing Lists
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
77
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
78 User feedback is an integral part of the evolution of this
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
79 and other Bioperl modules. Send your comments and suggestions preferably
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
80 to one of the Bioperl mailing lists.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
81 Your participation is much appreciated.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
82
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
83 bioperl-l@bioperl.org - General discussion
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
84 http://www.bioperl.org/MailList.shtml - About the mailing lists
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
85
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
86 =head2 Reporting Bugs
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
87
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
88 Report bugs to the Bioperl bug tracking system to help us keep track
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
89 the bugs and their resolution.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
90 Bug reports can be submitted via email or the web:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
91
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
92 bioperl-bugs@bio.perl.org
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
93 http://bugzilla.bioperl.org/
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
94
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
95 =head1 AUTHOR - Ewan Birney
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
96
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
97 Email birney@ebi.ac.uk
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
98
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
99 Describe contact details here
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
100
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
101 =head1 APPENDIX
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
102
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
103 The rest of the documentation details each of the object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
104 methods. Internal methods are usually preceded with a _
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
105
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
106 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
107
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
108
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
109 # Let the code begin...
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
110
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
111
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
112 package Bio::SeqIO::embl;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
113 use vars qw(@ISA);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
114 use strict;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
115 use Bio::SeqIO::FTHelper;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
116 use Bio::SeqFeature::Generic;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
117 use Bio::Species;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
118 use Bio::Seq::SeqFactory;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
119 use Bio::Annotation::Collection;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
120 use Bio::Annotation::Comment;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
121 use Bio::Annotation::Reference;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
122 use Bio::Annotation::DBLink;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
123
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
124 @ISA = qw(Bio::SeqIO);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
125
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
126 sub _initialize {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
127 my($self,@args) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
128
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
129 $self->SUPER::_initialize(@args);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
130 # hash for functions for decoding keys.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
131 $self->{'_func_ftunit_hash'} = {};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
132 $self->_show_dna(1); # sets this to one by default. People can change it
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
133 if( ! defined $self->sequence_factory ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
134 $self->sequence_factory(new Bio::Seq::SeqFactory
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
135 (-verbose => $self->verbose(),
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
136 -type => 'Bio::Seq::RichSeq'));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
137 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
138 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
139
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
140 =head2 next_seq
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
141
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
142 Title : next_seq
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
143 Usage : $seq = $stream->next_seq()
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
144 Function: returns the next sequence in the stream
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
145 Returns : Bio::Seq object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
146 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
147
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
148 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
149
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
150 sub next_seq {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
151 my ($self,@args) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
152 my ($pseq,$c,$line,$name,$desc,$acc,$seqc,$mol,$div,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
153 $date, $comment, @date_arr);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
154
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
155 my ($annotation, %params, @features) = ( new Bio::Annotation::Collection);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
156
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
157 $line = $self->_readline; # This needs to be before the first eof() test
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
158
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
159 if( !defined $line ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
160 return undef; # no throws - end of file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
161 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
162
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
163 if( $line =~ /^\s+$/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
164 while( defined ($line = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
165 $line =~/^\S/ && last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
166 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
167 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
168 if( !defined $line ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
169 return undef; # end of file
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
170 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
171 $line =~ /^ID\s+\S+/ || $self->throw("EMBL stream with no ID. Not embl in my book");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
172 $line =~ /^ID\s+(\S+)\s+\S+\;\s+([^;]+)\;\s+(\S+)\;/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
173 $name = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
174 $mol = $2;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
175 $div = $3;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
176 if(! $name) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
177 $name = "unknown id";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
178 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
179 my $alphabet;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
180
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
181 # this is important to have the id for display in e.g. FTHelper, otherwise
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
182 # you won't know which entry caused an error
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
183 if($mol) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
184 if ( $mol =~ /circular/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
185 $params{'-is_circular'} = 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
186 $mol =~ s|circular ||;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
187 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
188 if (defined $mol ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
189 if ($mol =~ /DNA/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
190 $alphabet='dna';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
191 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
192 elsif ($mol =~ /RNA/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
193 $alphabet='rna';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
194 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
195 elsif ($mol =~ /AA/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
196 $alphabet='protein';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
197 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
198 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
199
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
200 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
201
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
202 # $self->warn("not parsing upper annotation in EMBL file yet!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
203 my $buffer = $line;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
204
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
205 local $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
206
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
207 BEFORE_FEATURE_TABLE :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
208 until( !defined $buffer ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
209 $_ = $buffer;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
210
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
211 # Exit at start of Feature table
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
212 last if /^F[HT]/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
213
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
214 # Description line(s)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
215 if (/^DE\s+(\S.*\S)/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
216 $desc .= $desc ? " $1" : $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
217 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
218
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
219 #accession number
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
220 if( /^AC\s+(.*)?/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
221 my @accs = split(/[; ]+/, $1); # allow space in addition
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
222 $params{'-accession_number'} = shift @accs
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
223 unless defined $params{'-accession_number'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
224 push @{$params{'-secondary_accessions'}}, @accs;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
225 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
226
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
227 #version number
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
228 if( /^SV\s+\S+\.(\d+);?/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
229 my $sv = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
230 #$sv =~ s/\;//;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
231 $params{'-seq_version'} = $sv;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
232 $params{'-version'} = $sv;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
233 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
234
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
235 #date (NOTE: takes last date line)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
236 if( /^DT\s+(.+)$/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
237 my $date = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
238 push @{$params{'-dates'}}, $date;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
239 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
240
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
241 #keywords
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
242 if( /^KW\s+(.*)\S*$/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
243 my @kw = split(/\s*\;\s*/,$1);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
244 push @{$params{'-keywords'}}, @kw;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
245 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
246
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
247 # Organism name and phylogenetic information
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
248 elsif (/^O[SC]/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
249 my $species = $self->_read_EMBL_Species(\$buffer);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
250 $params{'-species'}= $species;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
251 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
252
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
253 # References
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
254 elsif (/^R/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
255 my @refs = $self->_read_EMBL_References(\$buffer);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
256 foreach my $ref ( @refs ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
257 $annotation->add_Annotation('reference',$ref);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
258 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
259 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
260
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
261 # DB Xrefs
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
262 elsif (/^DR/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
263 my @links = $self->_read_EMBL_DBLink(\$buffer);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
264 foreach my $dblink ( @links ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
265 $annotation->add_Annotation('dblink',$dblink);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
266 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
267 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
268
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
269 # Comments
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
270 elsif (/^CC\s+(.*)/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
271 $comment .= $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
272 $comment .= " ";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
273 while (defined ($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
274 if (/^CC\s+(.*)/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
275 $comment .= $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
276 $comment .= " ";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
277 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
278 else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
279 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
280 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
281 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
282 my $commobj = Bio::Annotation::Comment->new();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
283 $commobj->text($comment);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
284 $annotation->add_Annotation('comment',$commobj);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
285 $comment = "";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
286 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
287
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
288 # Get next line.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
289 $buffer = $self->_readline;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
290 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
291
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
292 while( defined ($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
293 /^FT \w/ && last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
294 /^SQ / && last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
295 /^CO / && last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
296 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
297 $buffer = $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
298
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
299 if (defined($buffer) && $buffer =~ /^FT /) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
300 until( !defined ($buffer) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
301 my $ftunit = $self->_read_FTHelper_EMBL(\$buffer);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
302 # process ftunit
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
303
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
304 push(@features,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
305 $ftunit->_generic_seqfeature($self->location_factory(), $name));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
306
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
307 if( $buffer !~ /^FT/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
308 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
309 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
310 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
311 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
312
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
313
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
314 # skip comments
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
315 while( defined ($buffer) && $buffer =~ /^XX/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
316 $buffer = $self->_readline();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
317 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
318
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
319 if( $buffer =~ /^CO/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
320 until( !defined ($buffer) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
321 my $ftunit = $self->_read_FTHelper_EMBL(\$buffer);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
322 # process ftunit
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
323 push(@features,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
324 $ftunit->_generic_seqfeature($self->location_factory(),
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
325 $name));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
326
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
327 if( $buffer !~ /^CO/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
328 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
329 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
330 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
331 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
332 if( $buffer !~ /^SQ/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
333 while( defined ($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
334 /^SQ/ && last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
335 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
336 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
337
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
338 $seqc = "";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
339 while( defined ($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
340 /^\/\// && last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
341 $_ = uc($_);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
342 s/[^A-Za-z]//g;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
343 $seqc .= $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
344 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
345 my $seq = $self->sequence_factory->create
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
346 (-verbose => $self->verbose(),
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
347 -division => $div,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
348 -seq => $seqc,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
349 -desc => $desc,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
350 -display_id => $name,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
351 -annotation => $annotation,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
352 -molecule => $mol,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
353 -alphabet => $alphabet,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
354 -features => \@features,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
355 %params);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
356
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
357 return $seq;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
358 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
359
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
360 =head2 write_seq
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
361
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
362 Title : write_seq
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
363 Usage : $stream->write_seq($seq)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
364 Function: writes the $seq object (must be seq) to the stream
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
365 Returns : 1 for success and 0 for error
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
366 Args : array of 1 to n Bio::SeqI objects
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
367
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
368
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
369 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
370
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
371 sub write_seq {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
372 my ($self,@seqs) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
373
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
374 foreach my $seq ( @seqs ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
375 $self->throw("Attempting to write with no seq!") unless defined $seq;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
376 unless ( ref $seq && $seq->isa('Bio::SeqI' ) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
377 $self->warn("$seq is not a SeqI compliant sequence object!")
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
378 if $self->verbose >= 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
379 unless ( ref $seq && $seq->isa('Bio::PrimarySeqI' ) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
380 $self->throw("$seq is not a PrimarySeqI compliant sequence object!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
381 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
382 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
383 my $str = $seq->seq || '';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
384
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
385 my $mol;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
386 my $div;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
387 my $len = $seq->length();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
388
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
389 if ($seq->can('division') && defined $seq->division) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
390 $div = $seq->division();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
391 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
392 $div ||= 'UNK';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
393
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
394 if ($seq->can('molecule')) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
395 $mol = $seq->molecule();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
396 $mol = 'RNA' if defined $mol && $mol =~ /RNA/; # no 'mRNA'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
397 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
398 elsif ($seq->can('primary_seq') && defined $seq->primary_seq->alphabet) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
399 my $alphabet =$seq->primary_seq->alphabet;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
400 if ($alphabet eq 'dna') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
401 $mol ='DNA';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
402 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
403 elsif ($alphabet eq 'rna') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
404 $mol='RNA';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
405 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
406 elsif ($alphabet eq 'protein') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
407 $mol='AA';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
408 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
409 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
410 $mol ||= 'XXX';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
411 $mol = "circular $mol" if $seq->is_circular;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
412
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
413 my $temp_line;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
414 if( $self->_id_generation_func ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
415 $temp_line = &{$self->_id_generation_func}($seq);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
416 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
417 $temp_line = sprintf("%-11sstandard; $mol; $div; %d BP.", $seq->id(), $len);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
418 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
419
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
420 $self->_print( "ID $temp_line\n","XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
421
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
422 # Write the accession line if present
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
423 my( $acc );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
424 {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
425 if( my $func = $self->_ac_generation_func ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
426 $acc = &{$func}($seq);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
427 } elsif( $seq->isa('Bio::Seq::RichSeqI') &&
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
428 defined($seq->accession_number) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
429 $acc = $seq->accession_number;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
430 $acc = join(";", $acc, $seq->get_secondary_accessions);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
431 } elsif ( $seq->can('accession_number') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
432 $acc = $seq->accession_number;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
433 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
434
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
435 if (defined $acc) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
436 $self->_print("AC $acc;\n",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
437 "XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
438 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
439 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
440
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
441 # Write the sv line if present
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
442 {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
443 my( $sv );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
444 if (my $func = $self->_sv_generation_func) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
445 $sv = &{$func}($seq);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
446 } elsif($seq->isa('Bio::Seq::RichSeqI') &&
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
447 defined($seq->seq_version)) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
448 $sv = "$acc.". $seq->seq_version();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
449 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
450 if (defined $sv) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
451 $self->_print( "SV $sv\n",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
452 "XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
453 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
454 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
455
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
456 # Date lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
457 my $switch=0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
458 if( $seq->can('get_dates') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
459 foreach my $dt ( $seq->get_dates() ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
460 $self->_write_line_EMBL_regex("DT ","DT ",$dt,'\s+|$',80);#'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
461 $switch=1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
462 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
463 if ($switch == 1) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
464 $self->_print("XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
465 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
466 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
467
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
468 # Description lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
469 $self->_write_line_EMBL_regex("DE ","DE ",$seq->desc(),'\s+|$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
470 $self->_print( "XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
471
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
472 # if there, write the kw line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
473 {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
474 my( $kw );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
475 if( my $func = $self->_kw_generation_func ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
476 $kw = &{$func}($seq);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
477 } elsif( $seq->can('keywords') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
478 $kw = $seq->keywords;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
479 if( ref($kw) =~ /ARRAY/i ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
480 $kw = join("; ", @$kw);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
481 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
482 $kw .= '.' if( defined $kw && $kw !~ /\.$/ );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
483 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
484 if (defined $kw) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
485 $self->_write_line_EMBL_regex("KW ", "KW ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
486 $kw, '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
487 $self->_print( "XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
488
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
489 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
490 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
491
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
492 # Organism lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
493
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
494 if ($seq->can('species') && (my $spec = $seq->species)) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
495 my($species, @class) = $spec->classification();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
496 my $genus = $class[0];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
497 my $OS = "$genus $species";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
498 if (my $ssp = $spec->sub_species) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
499 $OS .= " $ssp";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
500 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
501 if (my $common = $spec->common_name) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
502 $OS .= " ($common)";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
503 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
504 $self->_print("OS $OS\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
505 my $OC = join('; ', reverse(@class)) .'.';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
506 $self->_write_line_EMBL_regex("OC ","OC ",$OC,'; |$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
507 if ($spec->organelle) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
508 $self->_write_line_EMBL_regex("OG ","OG ",$spec->organelle,'; |$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
509 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
510 $self->_print("XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
511 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
512
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
513 # Reference lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
514 my $t = 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
515 if ( $seq->can('annotation') && defined $seq->annotation ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
516 foreach my $ref ( $seq->annotation->get_Annotations('reference') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
517 $self->_print( "RN [$t]\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
518
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
519 # Having no RP line is legal, but we need both
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
520 # start and end for a valid location.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
521 my $start = $ref->start;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
522 my $end = $ref->end;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
523 if ($start and $end) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
524 $self->_print( "RP $start-$end\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
525 } elsif ($start or $end) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
526 $self->throw("Both start and end are needed for a valid RP line. Got: start='$start' end='$end'");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
527 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
528
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
529 if (my $med = $ref->medline) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
530 $self->_print( "RX MEDLINE; $med.\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
531 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
532 if (my $pm = $ref->pubmed) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
533 $self->_print( "RX PUBMED; $pm.\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
534 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
535 $self->_write_line_EMBL_regex("RA ", "RA ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
536 $ref->authors . ";",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
537 '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
538
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
539 # If there is no title to the reference, it appears
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
540 # as a single semi-colon. All titles must end in
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
541 # a semi-colon.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
542 my $ref_title = $ref->title || '';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
543 $ref_title =~ s/[\s;]*$/;/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
544 $self->_write_line_EMBL_regex("RT ", "RT ", $ref_title, '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
545 $self->_write_line_EMBL_regex("RL ", "RL ", $ref->location, '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
546 if ($ref->comment) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
547 $self->_write_line_EMBL_regex("RC ", "RC ", $ref->comment, '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
548 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
549 $self->_print("XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
550 $t++;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
551 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
552
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
553
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
554 # DB Xref lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
555 if (my @db_xref = $seq->annotation->get_Annotations('dblink') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
556 foreach my $dr (@db_xref) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
557 my $db_name = $dr->database;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
558 my $prim = $dr->primary_id;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
559 my $opt = $dr->optional_id || '';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
560
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
561 my $line = "$db_name; $prim; $opt.";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
562 $self->_write_line_EMBL_regex("DR ", "DR ", $line, '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
563 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
564 $self->_print("XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
565 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
566
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
567 # Comment lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
568 foreach my $comment ( $seq->annotation->get_Annotations('comment') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
569 $self->_write_line_EMBL_regex("CC ", "CC ", $comment->text, '\s+|$', 80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
570 $self->_print("XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
571 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
572 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
573 # "\\s\+\|\$"
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
574
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
575 ## FEATURE TABLE
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
576
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
577 $self->_print("FH Key Location/Qualifiers\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
578 $self->_print("FH\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
579
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
580 my @feats = $seq->can('top_SeqFeatures') ? $seq->top_SeqFeatures : ();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
581 if ($feats[0]) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
582 if( defined $self->_post_sort ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
583 # we need to read things into an array.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
584 # Process. Sort them. Print 'em
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
585
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
586 my $post_sort_func = $self->_post_sort();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
587 my @fth;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
588
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
589 foreach my $sf ( @feats ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
590 push(@fth,Bio::SeqIO::FTHelper::from_SeqFeature($sf,$seq));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
591 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
592
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
593 @fth = sort { &$post_sort_func($a,$b) } @fth;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
594
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
595 foreach my $fth ( @fth ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
596 $self->_print_EMBL_FTHelper($fth);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
597 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
598 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
599 # not post sorted. And so we can print as we get them.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
600 # lower memory load...
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
601
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
602 foreach my $sf ( @feats ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
603 my @fth = Bio::SeqIO::FTHelper::from_SeqFeature($sf,$seq);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
604 foreach my $fth ( @fth ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
605 if( $fth->key eq 'CONTIG') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
606 $self->_show_dna(0);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
607 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
608 $self->_print_EMBL_FTHelper($fth);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
609 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
610 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
611 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
612 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
613
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
614 if( $self->_show_dna() == 0 ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
615 $self->_print( "//\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
616 return;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
617 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
618 $self->_print( "XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
619
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
620 # finished printing features.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
621
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
622 $str =~ tr/A-Z/a-z/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
623
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
624 # Count each nucleotide
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
625 my $alen = $str =~ tr/a/a/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
626 my $clen = $str =~ tr/c/c/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
627 my $glen = $str =~ tr/g/g/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
628 my $tlen = $str =~ tr/t/t/;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
629
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
630 my $olen = $len - ($alen + $tlen + $clen + $glen);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
631 if( $olen < 0 ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
632 $self->warn("Weird. More atgc than bases. Problem!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
633 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
634
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
635 $self->_print("SQ Sequence $len BP; $alen A; $clen C; $glen G; $tlen T; $olen other;\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
636
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
637 my $nuc = 60; # Number of nucleotides per line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
638 my $whole_pat = 'a10' x 6; # Pattern for unpacking a whole line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
639 my $out_pat = 'A11' x 6; # Pattern for packing a line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
640 my $length = length($str);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
641
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
642 # Calculate the number of nucleotides which fit on whole lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
643 my $whole = int($length / $nuc) * $nuc;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
644
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
645 # Print the whole lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
646 my( $i );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
647 for ($i = 0; $i < $whole; $i += $nuc) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
648 my $blocks = pack $out_pat,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
649 unpack $whole_pat,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
650 substr($str, $i, $nuc);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
651 $self->_print(sprintf(" $blocks%9d\n", $i + $nuc));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
652 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
653
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
654 # Print the last line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
655 if (my $last = substr($str, $i)) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
656 my $last_len = length($last);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
657 my $last_pat = 'a10' x int($last_len / 10) .'a'. $last_len % 10;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
658 my $blocks = pack $out_pat,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
659 unpack($last_pat, $last);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
660 $self->_print(sprintf(" $blocks%9d\n", $length)); # Add the length to the end
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
661 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
662
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
663 $self->_print( "//\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
664
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
665 $self->flush if $self->_flush_on_write && defined $self->_fh;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
666 return 1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
667 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
668 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
669
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
670 =head2 _print_EMBL_FTHelper
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
671
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
672 Title : _print_EMBL_FTHelper
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
673 Usage :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
674 Function: Internal function
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
675 Returns :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
676 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
677
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
678
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
679 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
680
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
681 sub _print_EMBL_FTHelper {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
682 my ($self,$fth,$always_quote) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
683 $always_quote ||= 0;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
684
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
685 if( ! ref $fth || ! $fth->isa('Bio::SeqIO::FTHelper') ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
686 $fth->warn("$fth is not a FTHelper class. Attempting to print, but there could be tears!");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
687 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
688
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
689
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
690 #$self->_print( "FH Key Location/Qualifiers\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
691 #$self->_print( sprintf("FT %-15s %s\n",$fth->key,$fth->loc));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
692 # let
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
693 if( $fth->key eq 'CONTIG' ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
694 $self->_print("XX\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
695 $self->_write_line_EMBL_regex("CO ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
696 "CO ",$fth->loc,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
697 '\,|$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
698 return;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
699 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
700 $self->_write_line_EMBL_regex(sprintf("FT %-15s ",$fth->key),
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
701 "FT ",$fth->loc,
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
702 '\,|$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
703 foreach my $tag ( keys %{$fth->field} ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
704 if( ! defined $fth->field->{$tag} ) { next; }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
705 foreach my $value ( @{$fth->field->{$tag}} ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
706 $value =~ s/\"/\"\"/g;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
707 if ($value eq "_no_value") {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
708 $self->_write_line_EMBL_regex("FT ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
709 "FT ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
710 "/$tag",'.|$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
711 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
712 elsif( $always_quote == 1 || $value !~ /^\d+$/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
713 my $pat = $value =~ /\s/ ? '\s|$' : '.|$';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
714 $self->_write_line_EMBL_regex("FT ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
715 "FT ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
716 "/$tag=\"$value\"",$pat,80);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
717 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
718 else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
719 $self->_write_line_EMBL_regex("FT ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
720 "FT ",
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
721 "/$tag=$value",'.|$',80); #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
722 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
723 # $self->_print( "FT /", $tag, "=\"", $value, "\"\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
724 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
725 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
726 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
727
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
728 #'
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
729 =head2 _read_EMBL_References
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
730
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
731 Title : _read_EMBL_References
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
732 Usage :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
733 Function: Reads references from EMBL format. Internal function really
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
734 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
735 Returns :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
736 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
737
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
738
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
739 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
740
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
741 sub _read_EMBL_References {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
742 my ($self,$buffer) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
743 my (@refs);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
744
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
745 # assumme things are starting with RN
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
746
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
747 if( $$buffer !~ /^RN/ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
748 warn("Not parsing line '$$buffer' which maybe important");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
749 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
750 my $b1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
751 my $b2;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
752 my $title;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
753 my $loc;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
754 my $au;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
755 my $med;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
756 my $pm;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
757 my $com;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
758
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
759 while( defined ($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
760 /^R/ || last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
761 /^RP (\d+)-(\d+)/ && do {$b1=$1;$b2=$2;};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
762 /^RX MEDLINE;\s+(\d+)/ && do {$med=$1};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
763 /^RX PUBMED;\s+(\d+)/ && do {$pm=$1};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
764 /^RA (.*)/ && do {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
765 $au = $self->_concatenate_lines($au,$1); next;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
766 };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
767 /^RT (.*)/ && do {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
768 $title = $self->_concatenate_lines($title,$1); next;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
769 };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
770 /^RL (.*)/ && do {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
771 $loc = $self->_concatenate_lines($loc,$1); next;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
772 };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
773 /^RC (.*)/ && do {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
774 $com = $self->_concatenate_lines($com,$1); next;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
775 };
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
776 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
777
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
778 my $ref = new Bio::Annotation::Reference;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
779 $au =~ s/;\s*$//g;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
780 $title =~ s/;\s*$//g;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
781
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
782 $ref->start($b1);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
783 $ref->end($b2);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
784 $ref->authors($au);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
785 $ref->title($title);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
786 $ref->location($loc);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
787 $ref->medline($med);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
788 $ref->comment($com);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
789 $ref->pubmed($pm);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
790
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
791 push(@refs,$ref);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
792 $$buffer = $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
793
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
794 return @refs;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
795 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
796
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
797 =head2 _read_EMBL_Species
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
798
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
799 Title : _read_EMBL_Species
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
800 Usage :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
801 Function: Reads the EMBL Organism species and classification
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
802 lines.
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
803 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
804 Returns : A Bio::Species object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
805 Args : a reference to the current line buffer
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
806
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
807 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
808
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
809 sub _read_EMBL_Species {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
810 my( $self, $buffer ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
811 my $org;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
812
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
813 $_ = $$buffer;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
814 my( $sub_species, $species, $genus, $common, @class );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
815 while (defined( $_ ||= $self->_readline )) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
816
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
817 if (/^OS\s+(\S+)(?:\s+([^\(]\S*))?(?:\s+([^\(]\S*))?(?:\s+\((.*)\))?/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
818 $genus = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
819 $species = $2 || 'sp.';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
820 $sub_species = $3 if $3;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
821 $common = $4 if $4;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
822 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
823 elsif (s/^OC\s+//) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
824 # only split on ';' or '.' so that
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
825 # classification that is 2 words will
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
826 # still get matched
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
827 # use map to remove trailing/leading spaces
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
828 chomp;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
829 push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
830 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
831 elsif (/^OG\s+(.*)/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
832 $org = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
833 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
834 else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
835 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
836 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
837
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
838 $_ = undef; # Empty $_ to trigger read of next line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
839 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
840
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
841 $$buffer = $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
842
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
843 # Don't make a species object if it is "Unknown" or "None"
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
844 return if $genus =~ /^(Unknown|None)$/i;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
845
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
846 # Bio::Species array needs array in Species -> Kingdom direction
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
847 if ($class[$#class] eq $genus) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
848 push( @class, $species );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
849 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
850 push( @class, $genus, $species );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
851 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
852 @class = reverse @class;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
853
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
854 my $make = Bio::Species->new();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
855 $make->classification( \@class, "FORCE" ); # no name validation please
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
856 $make->common_name( $common ) if $common;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
857 $make->sub_species( $sub_species ) if $sub_species;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
858 $make->organelle ( $org ) if $org;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
859 return $make;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
860 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
861
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
862 =head2 _read_EMBL_DBLink
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
863
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
864 Title : _read_EMBL_DBLink
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
865 Usage :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
866 Function: Reads the EMBL database cross reference ("DR") lines
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
867 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
868 Returns : A list of Bio::Annotation::DBLink objects
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
869 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
870
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
871 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
872
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
873 sub _read_EMBL_DBLink {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
874 my( $self,$buffer ) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
875 my( @db_link );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
876
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
877 $_ = $$buffer;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
878 while (defined( $_ ||= $self->_readline )) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
879
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
880 if (my($databse, $prim_id, $sec_id)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
881 = /^DR ([^\s;]+);\s*([^\s;]+);\s*([^\s;]+)?\.$/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
882 my $link = Bio::Annotation::DBLink->new();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
883 $link->database ( $databse );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
884 $link->primary_id ( $prim_id );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
885 $link->optional_id( $sec_id ) if $sec_id;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
886 push(@db_link, $link);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
887 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
888 else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
889 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
890 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
891
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
892 $_ = undef; # Empty $_ to trigger read of next line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
893 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
894
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
895 $$buffer = $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
896
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
897 return @db_link;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
898 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
899
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
900 =head2 _filehandle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
901
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
902 Title : _filehandle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
903 Usage : $obj->_filehandle($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
904 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
905 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
906 Returns : value of _filehandle
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
907 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
908
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
909
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
910 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
911
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
912 sub _filehandle{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
913 my ($obj,$value) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
914 if( defined $value) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
915 $obj->{'_filehandle'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
916 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
917 return $obj->{'_filehandle'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
918
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
919 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
920
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
921 =head2 _read_FTHelper_EMBL
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
922
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
923 Title : _read_FTHelper_EMBL
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
924 Usage : _read_FTHelper_EMBL($buffer)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
925 Function: reads the next FT key line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
926 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
927 Returns : Bio::SeqIO::FTHelper object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
928 Args : filehandle and reference to a scalar
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
929
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
930
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
931 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
932
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
933 sub _read_FTHelper_EMBL {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
934 my ($self,$buffer) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
935
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
936 my ($key, # The key of the feature
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
937 $loc, # The location line from the feature
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
938 @qual, # An arrray of lines making up the qualifiers
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
939 );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
940
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
941 if ($$buffer =~ /^FT\s{3}(\S+)\s+(\S+)/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
942 $key = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
943 $loc = $2;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
944 # Read all the lines up to the next feature
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
945 while ( defined($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
946 if (/^FT(\s+)(.+?)\s*$/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
947 # Lines inside features are preceeded by 19 spaces
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
948 # A new feature is preceeded by 3 spaces
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
949 if (length($1) > 4) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
950 # Add to qualifiers if we're in the qualifiers
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
951 if (@qual) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
952 push(@qual, $2);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
953 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
954 # Start the qualifier list if it's the first qualifier
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
955 elsif (substr($2, 0, 1) eq '/') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
956 @qual = ($2);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
957 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
958 # We're still in the location line, so append to location
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
959 else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
960 $loc .= $2;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
961 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
962 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
963 # We've reached the start of the next feature
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
964 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
965 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
966 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
967 # We're at the end of the feature table
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
968 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
969 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
970 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
971 } elsif( $$buffer =~ /^CO\s+(\S+)/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
972 $key = 'CONTIG';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
973 $loc = $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
974 # Read all the lines up to the next feature
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
975 while ( defined($_ = $self->_readline) ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
976 if (/^CO\s+(\S+)\s*$/) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
977 $loc .= $1;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
978 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
979 # We've reached the start of the next feature
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
980 last;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
981 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
982 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
983 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
984 # No feature key
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
985 return;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
986 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
987
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
988 # Put the first line of the next feature into the buffer
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
989 $$buffer = $_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
990
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
991 # Make the new FTHelper object
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
992 my $out = new Bio::SeqIO::FTHelper();
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
993 $out->verbose($self->verbose());
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
994 $out->key($key);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
995 $out->loc($loc);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
996
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
997 # Now parse and add any qualifiers. (@qual is kept
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
998 # intact to provide informative error messages.)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
999 QUAL: for (my $i = 0; $i < @qual; $i++) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1000 $_ = $qual[$i];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1001 my( $qualifier, $value ) = m{^/([^=]+)(?:=(.+))?}
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1002 or $self->throw("Can't see new qualifier in: $_\nfrom:\n"
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1003 . join('', map "$_\n", @qual));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1004 if (defined $value) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1005 # Do we have a quoted value?
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1006 if (substr($value, 0, 1) eq '"') {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1007 # Keep adding to value until we find the trailing quote
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1008 # and the quotes are balanced
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1009 while ($value !~ /"$/ or $value =~ tr/"/"/ % 2) { #"
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1010 $i++;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1011 my $next = $qual[$i];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1012 unless (defined($next)) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1013 warn("Unbalanced quote in:\n", map("$_\n", @qual),
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1014 "No further qualifiers will be added for this feature");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1015 last QUAL;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1016 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1017
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1018 # Join to value with space if value or next line contains a space
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1019 $value .= (grep /\s/, ($value, $next)) ? " $next" : $next;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1020 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1021 # Trim leading and trailing quotes
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1022 $value =~ s/^"|"$//g;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1023 # Undouble internal quotes
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1024 $value =~ s/""/"/g; #"
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1025 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1026 } else {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1027 $value = '_no_value';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1028 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1029
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1030 # Store the qualifier
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1031 $out->field->{$qualifier} ||= [];
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1032 push(@{$out->field->{$qualifier}},$value);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1033 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1034
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1035 return $out;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1036 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1037
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1038 =head2 _write_line_EMBL
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1039
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1040 Title : _write_line_EMBL
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1041 Usage :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1042 Function: internal function
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1043 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1044 Returns :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1045 Args :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1046
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1047
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1048 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1049
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1050 sub _write_line_EMBL {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1051 my ($self,$pre1,$pre2,$line,$length) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1052
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1053 $length || die "Miscalled write_line_EMBL without length. Programming error!";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1054 my $subl = $length - length $pre2;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1055 my $linel = length $line;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1056 my $i;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1057
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1058 my $sub = substr($line,0,$length - length $pre1);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1059
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1060 $self->_print( "$pre1$sub\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1061
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1062 for($i= ($length - length $pre1);$i < $linel;) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1063 $sub = substr($line,$i,($subl));
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1064 $self->_print( "$pre2$sub\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1065 $i += $subl;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1066 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1067
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1068 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1069
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1070 =head2 _write_line_EMBL_regex
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1071
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1072 Title : _write_line_EMBL_regex
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1073 Usage :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1074 Function: internal function for writing lines of specified
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1075 length, with different first and the next line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1076 left hand headers and split at specific points in the
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1077 text
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1078 Example :
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1079 Returns : nothing
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1080 Args : file handle, first header, second header, text-line, regex for line breaks, total line length
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1081
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1082
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1083 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1084
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1085 sub _write_line_EMBL_regex {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1086 my ($self,$pre1,$pre2,$line,$regex,$length) = @_;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1087
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1088 #print STDOUT "Going to print with $line!\n";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1089
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1090 $length || die "Programming error - called write_line_EMBL_regex without length.";
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1091
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1092 my $subl = $length - (length $pre1) -1 ;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1093
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1094
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1095
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1096 my( @lines );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1097 while(defined $line &&
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1098 $line =~ m/(.{1,$subl})($regex)/g) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1099 push(@lines, $1.$2);
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1100 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1101 foreach (@lines) { s/\s+$//; }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1102
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1103 # Print first line
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1104 my $s = shift(@lines) || '';
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1105 $self->_print( "$pre1$s\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1106
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1107 # Print the rest
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1108 foreach my $s ( @lines ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1109 $s = '' if( !defined $s );
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1110 $self->_print( "$pre2$s\n");
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1111 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1112 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1113
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1114 =head2 _post_sort
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1115
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1116 Title : _post_sort
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1117 Usage : $obj->_post_sort($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1118 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1119 Returns : value of _post_sort
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1120 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1121
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1122
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1123 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1124
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1125 sub _post_sort{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1126 my $obj = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1127 if( @_ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1128 my $value = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1129 $obj->{'_post_sort'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1130 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1131 return $obj->{'_post_sort'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1132
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1133 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1134
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1135 =head2 _show_dna
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1136
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1137 Title : _show_dna
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1138 Usage : $obj->_show_dna($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1139 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1140 Returns : value of _show_dna
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1141 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1142
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1143
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1144 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1145
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1146 sub _show_dna{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1147 my $obj = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1148 if( @_ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1149 my $value = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1150 $obj->{'_show_dna'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1151 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1152 return $obj->{'_show_dna'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1153
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1154 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1155
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1156 =head2 _id_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1157
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1158 Title : _id_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1159 Usage : $obj->_id_generation_func($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1160 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1161 Returns : value of _id_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1162 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1163
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1164
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1165 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1166
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1167 sub _id_generation_func{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1168 my $obj = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1169 if( @_ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1170 my $value = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1171 $obj->{'_id_generation_func'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1172 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1173 return $obj->{'_id_generation_func'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1174
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1175 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1176
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1177 =head2 _ac_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1178
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1179 Title : _ac_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1180 Usage : $obj->_ac_generation_func($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1181 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1182 Returns : value of _ac_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1183 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1184
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1185
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1186 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1187
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1188 sub _ac_generation_func{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1189 my $obj = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1190 if( @_ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1191 my $value = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1192 $obj->{'_ac_generation_func'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1193 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1194 return $obj->{'_ac_generation_func'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1195
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1196 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1197
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1198 =head2 _sv_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1199
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1200 Title : _sv_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1201 Usage : $obj->_sv_generation_func($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1202 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1203 Returns : value of _sv_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1204 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1205
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1206
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1207 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1208
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1209 sub _sv_generation_func{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1210 my $obj = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1211 if( @_ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1212 my $value = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1213 $obj->{'_sv_generation_func'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1214 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1215 return $obj->{'_sv_generation_func'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1216
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1217 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1218
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1219 =head2 _kw_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1220
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1221 Title : _kw_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1222 Usage : $obj->_kw_generation_func($newval)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1223 Function:
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1224 Returns : value of _kw_generation_func
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1225 Args : newvalue (optional)
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1226
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1227
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1228 =cut
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1229
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1230 sub _kw_generation_func{
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1231 my $obj = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1232 if( @_ ) {
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1233 my $value = shift;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1234 $obj->{'_kw_generation_func'} = $value;
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1235 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1236 return $obj->{'_kw_generation_func'};
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1237
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1238 }
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1239
21066c0abaf5 Uploaded
willmclaren
parents:
diff changeset
1240 1;