annotate variant_effect_predictor/Bio/SeqIO/embl.pm @ 0:2bc9b66ada89 draft default tip

Uploaded
author mahtabm
date Thu, 11 Apr 2013 06:29:17 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1 # $Id: embl.pm,v 1.57.2.6 2003/09/14 19:06:51 jason Exp $
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
2 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
3 # BioPerl module for Bio::SeqIO::EMBL
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
4 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
5 # Cared for by Ewan Birney <birney@ebi.ac.uk>
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
6 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
7 # Copyright Ewan Birney
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
8 #
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
9 # You may distribute this module under the same terms as perl itself
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
10
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
11 # POD documentation - main docs before the code
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
12
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
13 =head1 NAME
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
14
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
15 Bio::SeqIO::embl - EMBL sequence input/output stream
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
16
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
17 =head1 SYNOPSIS
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
18
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
19 It is probably best not to use this object directly, but
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
20 rather go through the AnnSeqIO handler system. Go:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
21
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
22 $stream = Bio::SeqIO->new(-file => $filename, -format => 'EMBL');
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
23
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
24 while ( (my $seq = $stream->next_seq()) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
25 # do something with $seq
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
26 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
27
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
28 =head1 DESCRIPTION
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
29
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
30 This object can transform Bio::Seq objects to and from EMBL flat
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
31 file databases.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
32
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
33 There is alot of flexibility here about how to dump things which I need
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
34 to document fully.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
35
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
36 There should be a common object that this and genbank share (probably
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
37 with swissprot). Too much of the magic is identical.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
38
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
39 =head2 Optional functions
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
40
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
41 =over 3
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
42
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
43 =item _show_dna()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
44
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
45 (output only) shows the dna or not
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
46
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
47 =item _post_sort()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
48
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
49 (output only) provides a sorting func which is applied to the FTHelpers
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
50 before printing
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
51
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
52 =item _id_generation_func()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
53
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
54 This is function which is called as
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
55
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
56 print "ID ", $func($annseq), "\n";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
57
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
58 To generate the ID line. If it is not there, it generates a sensible ID
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
59 line using a number of tools.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
60
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
61 If you want to output annotations in embl format they need to be
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
62 stored in a Bio::Annotation::Collection object which is accessible
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
63 through the Bio::SeqI interface method L<annotation()|annotation>.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
64
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
65 The following are the names of the keys which are polled from a
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
66 L<Bio::Annotation::Collection> object.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
67
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
68 reference - Should contain Bio::Annotation::Reference objects
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
69 comment - Should contain Bio::Annotation::Comment objects
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
70 dblink - Should contain Bio::Annotation::DBLink objects
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
71
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
72 =back
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
73
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
74 =head1 FEEDBACK
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
75
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
76 =head2 Mailing Lists
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
77
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
78 User feedback is an integral part of the evolution of this
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
79 and other Bioperl modules. Send your comments and suggestions preferably
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
80 to one of the Bioperl mailing lists.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
81 Your participation is much appreciated.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
82
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
83 bioperl-l@bioperl.org - General discussion
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
84 http://www.bioperl.org/MailList.shtml - About the mailing lists
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
85
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
86 =head2 Reporting Bugs
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
87
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
88 Report bugs to the Bioperl bug tracking system to help us keep track
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
89 the bugs and their resolution.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
90 Bug reports can be submitted via email or the web:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
91
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
92 bioperl-bugs@bio.perl.org
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
93 http://bugzilla.bioperl.org/
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
94
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
95 =head1 AUTHOR - Ewan Birney
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
96
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
97 Email birney@ebi.ac.uk
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
98
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
99 Describe contact details here
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
100
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
101 =head1 APPENDIX
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
102
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
103 The rest of the documentation details each of the object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
104 methods. Internal methods are usually preceded with a _
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
105
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
106 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
107
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
108
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
109 # Let the code begin...
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
110
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
111
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
112 package Bio::SeqIO::embl;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
113 use vars qw(@ISA);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
114 use strict;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
115 use Bio::SeqIO::FTHelper;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
116 use Bio::SeqFeature::Generic;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
117 use Bio::Species;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
118 use Bio::Seq::SeqFactory;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
119 use Bio::Annotation::Collection;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
120 use Bio::Annotation::Comment;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
121 use Bio::Annotation::Reference;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
122 use Bio::Annotation::DBLink;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
123
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
124 @ISA = qw(Bio::SeqIO);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
125
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
126 sub _initialize {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
127 my($self,@args) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
128
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
129 $self->SUPER::_initialize(@args);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
130 # hash for functions for decoding keys.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
131 $self->{'_func_ftunit_hash'} = {};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
132 $self->_show_dna(1); # sets this to one by default. People can change it
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
133 if( ! defined $self->sequence_factory ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
134 $self->sequence_factory(new Bio::Seq::SeqFactory
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
135 (-verbose => $self->verbose(),
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
136 -type => 'Bio::Seq::RichSeq'));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
137 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
138 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
139
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
140 =head2 next_seq
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
141
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
142 Title : next_seq
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
143 Usage : $seq = $stream->next_seq()
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
144 Function: returns the next sequence in the stream
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
145 Returns : Bio::Seq object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
146 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
147
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
148 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
149
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
150 sub next_seq {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
151 my ($self,@args) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
152 my ($pseq,$c,$line,$name,$desc,$acc,$seqc,$mol,$div,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
153 $date, $comment, @date_arr);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
154
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
155 my ($annotation, %params, @features) = ( new Bio::Annotation::Collection);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
156
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
157 $line = $self->_readline; # This needs to be before the first eof() test
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
158
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
159 if( !defined $line ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
160 return undef; # no throws - end of file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
161 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
162
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
163 if( $line =~ /^\s+$/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
164 while( defined ($line = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
165 $line =~/^\S/ && last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
166 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
167 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
168 if( !defined $line ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
169 return undef; # end of file
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
170 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
171 $line =~ /^ID\s+\S+/ || $self->throw("EMBL stream with no ID. Not embl in my book");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
172 $line =~ /^ID\s+(\S+)\s+\S+\;\s+([^;]+)\;\s+(\S+)\;/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
173 $name = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
174 $mol = $2;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
175 $div = $3;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
176 if(! $name) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
177 $name = "unknown id";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
178 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
179 my $alphabet;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
180
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
181 # this is important to have the id for display in e.g. FTHelper, otherwise
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
182 # you won't know which entry caused an error
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
183 if($mol) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
184 if ( $mol =~ /circular/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
185 $params{'-is_circular'} = 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
186 $mol =~ s|circular ||;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
187 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
188 if (defined $mol ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
189 if ($mol =~ /DNA/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
190 $alphabet='dna';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
191 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
192 elsif ($mol =~ /RNA/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
193 $alphabet='rna';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
194 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
195 elsif ($mol =~ /AA/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
196 $alphabet='protein';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
197 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
198 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
199
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
200 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
201
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
202 # $self->warn("not parsing upper annotation in EMBL file yet!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
203 my $buffer = $line;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
204
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
205 local $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
206
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
207 BEFORE_FEATURE_TABLE :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
208 until( !defined $buffer ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
209 $_ = $buffer;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
210
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
211 # Exit at start of Feature table
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
212 last if /^F[HT]/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
213
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
214 # Description line(s)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
215 if (/^DE\s+(\S.*\S)/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
216 $desc .= $desc ? " $1" : $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
217 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
218
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
219 #accession number
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
220 if( /^AC\s+(.*)?/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
221 my @accs = split(/[; ]+/, $1); # allow space in addition
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
222 $params{'-accession_number'} = shift @accs
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
223 unless defined $params{'-accession_number'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
224 push @{$params{'-secondary_accessions'}}, @accs;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
225 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
226
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
227 #version number
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
228 if( /^SV\s+\S+\.(\d+);?/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
229 my $sv = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
230 #$sv =~ s/\;//;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
231 $params{'-seq_version'} = $sv;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
232 $params{'-version'} = $sv;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
233 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
234
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
235 #date (NOTE: takes last date line)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
236 if( /^DT\s+(.+)$/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
237 my $date = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
238 push @{$params{'-dates'}}, $date;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
239 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
240
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
241 #keywords
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
242 if( /^KW\s+(.*)\S*$/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
243 my @kw = split(/\s*\;\s*/,$1);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
244 push @{$params{'-keywords'}}, @kw;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
245 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
246
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
247 # Organism name and phylogenetic information
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
248 elsif (/^O[SC]/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
249 my $species = $self->_read_EMBL_Species(\$buffer);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
250 $params{'-species'}= $species;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
251 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
252
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
253 # References
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
254 elsif (/^R/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
255 my @refs = $self->_read_EMBL_References(\$buffer);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
256 foreach my $ref ( @refs ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
257 $annotation->add_Annotation('reference',$ref);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
258 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
259 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
260
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
261 # DB Xrefs
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
262 elsif (/^DR/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
263 my @links = $self->_read_EMBL_DBLink(\$buffer);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
264 foreach my $dblink ( @links ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
265 $annotation->add_Annotation('dblink',$dblink);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
266 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
267 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
268
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
269 # Comments
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
270 elsif (/^CC\s+(.*)/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
271 $comment .= $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
272 $comment .= " ";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
273 while (defined ($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
274 if (/^CC\s+(.*)/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
275 $comment .= $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
276 $comment .= " ";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
277 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
278 else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
279 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
280 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
281 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
282 my $commobj = Bio::Annotation::Comment->new();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
283 $commobj->text($comment);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
284 $annotation->add_Annotation('comment',$commobj);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
285 $comment = "";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
286 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
287
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
288 # Get next line.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
289 $buffer = $self->_readline;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
290 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
291
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
292 while( defined ($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
293 /^FT \w/ && last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
294 /^SQ / && last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
295 /^CO / && last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
296 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
297 $buffer = $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
298
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
299 if (defined($buffer) && $buffer =~ /^FT /) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
300 until( !defined ($buffer) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
301 my $ftunit = $self->_read_FTHelper_EMBL(\$buffer);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
302 # process ftunit
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
303
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
304 push(@features,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
305 $ftunit->_generic_seqfeature($self->location_factory(), $name));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
306
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
307 if( $buffer !~ /^FT/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
308 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
309 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
310 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
311 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
312
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
313
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
314 # skip comments
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
315 while( defined ($buffer) && $buffer =~ /^XX/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
316 $buffer = $self->_readline();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
317 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
318
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
319 if( $buffer =~ /^CO/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
320 until( !defined ($buffer) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
321 my $ftunit = $self->_read_FTHelper_EMBL(\$buffer);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
322 # process ftunit
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
323 push(@features,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
324 $ftunit->_generic_seqfeature($self->location_factory(),
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
325 $name));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
326
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
327 if( $buffer !~ /^CO/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
328 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
329 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
330 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
331 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
332 if( $buffer !~ /^SQ/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
333 while( defined ($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
334 /^SQ/ && last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
335 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
336 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
337
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
338 $seqc = "";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
339 while( defined ($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
340 /^\/\// && last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
341 $_ = uc($_);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
342 s/[^A-Za-z]//g;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
343 $seqc .= $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
344 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
345 my $seq = $self->sequence_factory->create
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
346 (-verbose => $self->verbose(),
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
347 -division => $div,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
348 -seq => $seqc,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
349 -desc => $desc,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
350 -display_id => $name,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
351 -annotation => $annotation,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
352 -molecule => $mol,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
353 -alphabet => $alphabet,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
354 -features => \@features,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
355 %params);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
356
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
357 return $seq;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
358 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
359
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
360 =head2 write_seq
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
361
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
362 Title : write_seq
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
363 Usage : $stream->write_seq($seq)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
364 Function: writes the $seq object (must be seq) to the stream
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
365 Returns : 1 for success and 0 for error
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
366 Args : array of 1 to n Bio::SeqI objects
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
367
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
368
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
369 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
370
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
371 sub write_seq {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
372 my ($self,@seqs) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
373
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
374 foreach my $seq ( @seqs ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
375 $self->throw("Attempting to write with no seq!") unless defined $seq;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
376 unless ( ref $seq && $seq->isa('Bio::SeqI' ) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
377 $self->warn("$seq is not a SeqI compliant sequence object!")
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
378 if $self->verbose >= 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
379 unless ( ref $seq && $seq->isa('Bio::PrimarySeqI' ) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
380 $self->throw("$seq is not a PrimarySeqI compliant sequence object!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
381 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
382 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
383 my $str = $seq->seq || '';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
384
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
385 my $mol;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
386 my $div;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
387 my $len = $seq->length();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
388
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
389 if ($seq->can('division') && defined $seq->division) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
390 $div = $seq->division();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
391 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
392 $div ||= 'UNK';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
393
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
394 if ($seq->can('molecule')) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
395 $mol = $seq->molecule();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
396 $mol = 'RNA' if defined $mol && $mol =~ /RNA/; # no 'mRNA'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
397 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
398 elsif ($seq->can('primary_seq') && defined $seq->primary_seq->alphabet) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
399 my $alphabet =$seq->primary_seq->alphabet;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
400 if ($alphabet eq 'dna') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
401 $mol ='DNA';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
402 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
403 elsif ($alphabet eq 'rna') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
404 $mol='RNA';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
405 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
406 elsif ($alphabet eq 'protein') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
407 $mol='AA';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
408 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
409 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
410 $mol ||= 'XXX';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
411 $mol = "circular $mol" if $seq->is_circular;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
412
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
413 my $temp_line;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
414 if( $self->_id_generation_func ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
415 $temp_line = &{$self->_id_generation_func}($seq);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
416 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
417 $temp_line = sprintf("%-11sstandard; $mol; $div; %d BP.", $seq->id(), $len);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
418 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
419
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
420 $self->_print( "ID $temp_line\n","XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
421
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
422 # Write the accession line if present
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
423 my( $acc );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
424 {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
425 if( my $func = $self->_ac_generation_func ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
426 $acc = &{$func}($seq);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
427 } elsif( $seq->isa('Bio::Seq::RichSeqI') &&
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
428 defined($seq->accession_number) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
429 $acc = $seq->accession_number;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
430 $acc = join(";", $acc, $seq->get_secondary_accessions);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
431 } elsif ( $seq->can('accession_number') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
432 $acc = $seq->accession_number;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
433 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
434
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
435 if (defined $acc) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
436 $self->_print("AC $acc;\n",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
437 "XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
438 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
439 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
440
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
441 # Write the sv line if present
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
442 {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
443 my( $sv );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
444 if (my $func = $self->_sv_generation_func) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
445 $sv = &{$func}($seq);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
446 } elsif($seq->isa('Bio::Seq::RichSeqI') &&
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
447 defined($seq->seq_version)) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
448 $sv = "$acc.". $seq->seq_version();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
449 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
450 if (defined $sv) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
451 $self->_print( "SV $sv\n",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
452 "XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
453 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
454 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
455
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
456 # Date lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
457 my $switch=0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
458 if( $seq->can('get_dates') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
459 foreach my $dt ( $seq->get_dates() ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
460 $self->_write_line_EMBL_regex("DT ","DT ",$dt,'\s+|$',80);#'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
461 $switch=1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
462 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
463 if ($switch == 1) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
464 $self->_print("XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
465 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
466 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
467
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
468 # Description lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
469 $self->_write_line_EMBL_regex("DE ","DE ",$seq->desc(),'\s+|$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
470 $self->_print( "XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
471
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
472 # if there, write the kw line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
473 {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
474 my( $kw );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
475 if( my $func = $self->_kw_generation_func ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
476 $kw = &{$func}($seq);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
477 } elsif( $seq->can('keywords') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
478 $kw = $seq->keywords;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
479 if( ref($kw) =~ /ARRAY/i ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
480 $kw = join("; ", @$kw);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
481 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
482 $kw .= '.' if( defined $kw && $kw !~ /\.$/ );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
483 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
484 if (defined $kw) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
485 $self->_write_line_EMBL_regex("KW ", "KW ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
486 $kw, '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
487 $self->_print( "XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
488
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
489 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
490 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
491
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
492 # Organism lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
493
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
494 if ($seq->can('species') && (my $spec = $seq->species)) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
495 my($species, @class) = $spec->classification();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
496 my $genus = $class[0];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
497 my $OS = "$genus $species";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
498 if (my $ssp = $spec->sub_species) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
499 $OS .= " $ssp";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
500 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
501 if (my $common = $spec->common_name) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
502 $OS .= " ($common)";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
503 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
504 $self->_print("OS $OS\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
505 my $OC = join('; ', reverse(@class)) .'.';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
506 $self->_write_line_EMBL_regex("OC ","OC ",$OC,'; |$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
507 if ($spec->organelle) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
508 $self->_write_line_EMBL_regex("OG ","OG ",$spec->organelle,'; |$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
509 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
510 $self->_print("XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
511 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
512
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
513 # Reference lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
514 my $t = 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
515 if ( $seq->can('annotation') && defined $seq->annotation ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
516 foreach my $ref ( $seq->annotation->get_Annotations('reference') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
517 $self->_print( "RN [$t]\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
518
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
519 # Having no RP line is legal, but we need both
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
520 # start and end for a valid location.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
521 my $start = $ref->start;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
522 my $end = $ref->end;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
523 if ($start and $end) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
524 $self->_print( "RP $start-$end\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
525 } elsif ($start or $end) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
526 $self->throw("Both start and end are needed for a valid RP line. Got: start='$start' end='$end'");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
527 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
528
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
529 if (my $med = $ref->medline) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
530 $self->_print( "RX MEDLINE; $med.\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
531 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
532 if (my $pm = $ref->pubmed) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
533 $self->_print( "RX PUBMED; $pm.\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
534 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
535 $self->_write_line_EMBL_regex("RA ", "RA ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
536 $ref->authors . ";",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
537 '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
538
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
539 # If there is no title to the reference, it appears
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
540 # as a single semi-colon. All titles must end in
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
541 # a semi-colon.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
542 my $ref_title = $ref->title || '';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
543 $ref_title =~ s/[\s;]*$/;/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
544 $self->_write_line_EMBL_regex("RT ", "RT ", $ref_title, '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
545 $self->_write_line_EMBL_regex("RL ", "RL ", $ref->location, '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
546 if ($ref->comment) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
547 $self->_write_line_EMBL_regex("RC ", "RC ", $ref->comment, '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
548 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
549 $self->_print("XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
550 $t++;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
551 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
552
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
553
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
554 # DB Xref lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
555 if (my @db_xref = $seq->annotation->get_Annotations('dblink') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
556 foreach my $dr (@db_xref) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
557 my $db_name = $dr->database;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
558 my $prim = $dr->primary_id;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
559 my $opt = $dr->optional_id || '';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
560
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
561 my $line = "$db_name; $prim; $opt.";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
562 $self->_write_line_EMBL_regex("DR ", "DR ", $line, '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
563 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
564 $self->_print("XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
565 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
566
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
567 # Comment lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
568 foreach my $comment ( $seq->annotation->get_Annotations('comment') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
569 $self->_write_line_EMBL_regex("CC ", "CC ", $comment->text, '\s+|$', 80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
570 $self->_print("XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
571 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
572 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
573 # "\\s\+\|\$"
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
574
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
575 ## FEATURE TABLE
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
576
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
577 $self->_print("FH Key Location/Qualifiers\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
578 $self->_print("FH\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
579
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
580 my @feats = $seq->can('top_SeqFeatures') ? $seq->top_SeqFeatures : ();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
581 if ($feats[0]) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
582 if( defined $self->_post_sort ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
583 # we need to read things into an array.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
584 # Process. Sort them. Print 'em
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
585
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
586 my $post_sort_func = $self->_post_sort();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
587 my @fth;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
588
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
589 foreach my $sf ( @feats ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
590 push(@fth,Bio::SeqIO::FTHelper::from_SeqFeature($sf,$seq));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
591 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
592
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
593 @fth = sort { &$post_sort_func($a,$b) } @fth;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
594
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
595 foreach my $fth ( @fth ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
596 $self->_print_EMBL_FTHelper($fth);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
597 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
598 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
599 # not post sorted. And so we can print as we get them.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
600 # lower memory load...
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
601
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
602 foreach my $sf ( @feats ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
603 my @fth = Bio::SeqIO::FTHelper::from_SeqFeature($sf,$seq);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
604 foreach my $fth ( @fth ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
605 if( $fth->key eq 'CONTIG') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
606 $self->_show_dna(0);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
607 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
608 $self->_print_EMBL_FTHelper($fth);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
609 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
610 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
611 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
612 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
613
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
614 if( $self->_show_dna() == 0 ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
615 $self->_print( "//\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
616 return;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
617 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
618 $self->_print( "XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
619
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
620 # finished printing features.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
621
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
622 $str =~ tr/A-Z/a-z/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
623
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
624 # Count each nucleotide
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
625 my $alen = $str =~ tr/a/a/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
626 my $clen = $str =~ tr/c/c/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
627 my $glen = $str =~ tr/g/g/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
628 my $tlen = $str =~ tr/t/t/;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
629
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
630 my $olen = $len - ($alen + $tlen + $clen + $glen);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
631 if( $olen < 0 ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
632 $self->warn("Weird. More atgc than bases. Problem!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
633 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
634
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
635 $self->_print("SQ Sequence $len BP; $alen A; $clen C; $glen G; $tlen T; $olen other;\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
636
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
637 my $nuc = 60; # Number of nucleotides per line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
638 my $whole_pat = 'a10' x 6; # Pattern for unpacking a whole line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
639 my $out_pat = 'A11' x 6; # Pattern for packing a line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
640 my $length = length($str);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
641
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
642 # Calculate the number of nucleotides which fit on whole lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
643 my $whole = int($length / $nuc) * $nuc;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
644
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
645 # Print the whole lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
646 my( $i );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
647 for ($i = 0; $i < $whole; $i += $nuc) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
648 my $blocks = pack $out_pat,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
649 unpack $whole_pat,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
650 substr($str, $i, $nuc);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
651 $self->_print(sprintf(" $blocks%9d\n", $i + $nuc));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
652 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
653
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
654 # Print the last line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
655 if (my $last = substr($str, $i)) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
656 my $last_len = length($last);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
657 my $last_pat = 'a10' x int($last_len / 10) .'a'. $last_len % 10;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
658 my $blocks = pack $out_pat,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
659 unpack($last_pat, $last);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
660 $self->_print(sprintf(" $blocks%9d\n", $length)); # Add the length to the end
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
661 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
662
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
663 $self->_print( "//\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
664
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
665 $self->flush if $self->_flush_on_write && defined $self->_fh;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
666 return 1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
667 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
668 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
669
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
670 =head2 _print_EMBL_FTHelper
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
671
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
672 Title : _print_EMBL_FTHelper
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
673 Usage :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
674 Function: Internal function
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
675 Returns :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
676 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
677
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
678
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
679 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
680
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
681 sub _print_EMBL_FTHelper {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
682 my ($self,$fth,$always_quote) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
683 $always_quote ||= 0;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
684
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
685 if( ! ref $fth || ! $fth->isa('Bio::SeqIO::FTHelper') ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
686 $fth->warn("$fth is not a FTHelper class. Attempting to print, but there could be tears!");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
687 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
688
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
689
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
690 #$self->_print( "FH Key Location/Qualifiers\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
691 #$self->_print( sprintf("FT %-15s %s\n",$fth->key,$fth->loc));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
692 # let
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
693 if( $fth->key eq 'CONTIG' ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
694 $self->_print("XX\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
695 $self->_write_line_EMBL_regex("CO ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
696 "CO ",$fth->loc,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
697 '\,|$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
698 return;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
699 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
700 $self->_write_line_EMBL_regex(sprintf("FT %-15s ",$fth->key),
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
701 "FT ",$fth->loc,
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
702 '\,|$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
703 foreach my $tag ( keys %{$fth->field} ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
704 if( ! defined $fth->field->{$tag} ) { next; }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
705 foreach my $value ( @{$fth->field->{$tag}} ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
706 $value =~ s/\"/\"\"/g;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
707 if ($value eq "_no_value") {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
708 $self->_write_line_EMBL_regex("FT ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
709 "FT ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
710 "/$tag",'.|$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
711 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
712 elsif( $always_quote == 1 || $value !~ /^\d+$/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
713 my $pat = $value =~ /\s/ ? '\s|$' : '.|$';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
714 $self->_write_line_EMBL_regex("FT ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
715 "FT ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
716 "/$tag=\"$value\"",$pat,80);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
717 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
718 else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
719 $self->_write_line_EMBL_regex("FT ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
720 "FT ",
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
721 "/$tag=$value",'.|$',80); #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
722 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
723 # $self->_print( "FT /", $tag, "=\"", $value, "\"\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
724 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
725 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
726 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
727
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
728 #'
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
729 =head2 _read_EMBL_References
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
730
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
731 Title : _read_EMBL_References
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
732 Usage :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
733 Function: Reads references from EMBL format. Internal function really
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
734 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
735 Returns :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
736 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
737
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
738
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
739 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
740
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
741 sub _read_EMBL_References {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
742 my ($self,$buffer) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
743 my (@refs);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
744
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
745 # assumme things are starting with RN
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
746
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
747 if( $$buffer !~ /^RN/ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
748 warn("Not parsing line '$$buffer' which maybe important");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
749 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
750 my $b1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
751 my $b2;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
752 my $title;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
753 my $loc;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
754 my $au;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
755 my $med;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
756 my $pm;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
757 my $com;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
758
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
759 while( defined ($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
760 /^R/ || last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
761 /^RP (\d+)-(\d+)/ && do {$b1=$1;$b2=$2;};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
762 /^RX MEDLINE;\s+(\d+)/ && do {$med=$1};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
763 /^RX PUBMED;\s+(\d+)/ && do {$pm=$1};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
764 /^RA (.*)/ && do {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
765 $au = $self->_concatenate_lines($au,$1); next;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
766 };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
767 /^RT (.*)/ && do {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
768 $title = $self->_concatenate_lines($title,$1); next;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
769 };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
770 /^RL (.*)/ && do {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
771 $loc = $self->_concatenate_lines($loc,$1); next;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
772 };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
773 /^RC (.*)/ && do {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
774 $com = $self->_concatenate_lines($com,$1); next;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
775 };
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
776 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
777
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
778 my $ref = new Bio::Annotation::Reference;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
779 $au =~ s/;\s*$//g;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
780 $title =~ s/;\s*$//g;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
781
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
782 $ref->start($b1);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
783 $ref->end($b2);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
784 $ref->authors($au);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
785 $ref->title($title);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
786 $ref->location($loc);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
787 $ref->medline($med);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
788 $ref->comment($com);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
789 $ref->pubmed($pm);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
790
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
791 push(@refs,$ref);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
792 $$buffer = $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
793
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
794 return @refs;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
795 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
796
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
797 =head2 _read_EMBL_Species
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
798
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
799 Title : _read_EMBL_Species
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
800 Usage :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
801 Function: Reads the EMBL Organism species and classification
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
802 lines.
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
803 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
804 Returns : A Bio::Species object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
805 Args : a reference to the current line buffer
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
806
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
807 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
808
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
809 sub _read_EMBL_Species {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
810 my( $self, $buffer ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
811 my $org;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
812
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
813 $_ = $$buffer;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
814 my( $sub_species, $species, $genus, $common, @class );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
815 while (defined( $_ ||= $self->_readline )) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
816
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
817 if (/^OS\s+(\S+)(?:\s+([^\(]\S*))?(?:\s+([^\(]\S*))?(?:\s+\((.*)\))?/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
818 $genus = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
819 $species = $2 || 'sp.';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
820 $sub_species = $3 if $3;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
821 $common = $4 if $4;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
822 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
823 elsif (s/^OC\s+//) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
824 # only split on ';' or '.' so that
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
825 # classification that is 2 words will
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
826 # still get matched
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
827 # use map to remove trailing/leading spaces
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
828 chomp;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
829 push(@class, map { s/^\s+//; s/\s+$//; $_; } split /[;\.]+/);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
830 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
831 elsif (/^OG\s+(.*)/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
832 $org = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
833 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
834 else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
835 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
836 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
837
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
838 $_ = undef; # Empty $_ to trigger read of next line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
839 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
840
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
841 $$buffer = $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
842
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
843 # Don't make a species object if it is "Unknown" or "None"
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
844 return if $genus =~ /^(Unknown|None)$/i;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
845
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
846 # Bio::Species array needs array in Species -> Kingdom direction
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
847 if ($class[$#class] eq $genus) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
848 push( @class, $species );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
849 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
850 push( @class, $genus, $species );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
851 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
852 @class = reverse @class;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
853
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
854 my $make = Bio::Species->new();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
855 $make->classification( \@class, "FORCE" ); # no name validation please
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
856 $make->common_name( $common ) if $common;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
857 $make->sub_species( $sub_species ) if $sub_species;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
858 $make->organelle ( $org ) if $org;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
859 return $make;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
860 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
861
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
862 =head2 _read_EMBL_DBLink
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
863
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
864 Title : _read_EMBL_DBLink
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
865 Usage :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
866 Function: Reads the EMBL database cross reference ("DR") lines
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
867 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
868 Returns : A list of Bio::Annotation::DBLink objects
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
869 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
870
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
871 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
872
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
873 sub _read_EMBL_DBLink {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
874 my( $self,$buffer ) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
875 my( @db_link );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
876
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
877 $_ = $$buffer;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
878 while (defined( $_ ||= $self->_readline )) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
879
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
880 if (my($databse, $prim_id, $sec_id)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
881 = /^DR ([^\s;]+);\s*([^\s;]+);\s*([^\s;]+)?\.$/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
882 my $link = Bio::Annotation::DBLink->new();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
883 $link->database ( $databse );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
884 $link->primary_id ( $prim_id );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
885 $link->optional_id( $sec_id ) if $sec_id;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
886 push(@db_link, $link);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
887 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
888 else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
889 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
890 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
891
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
892 $_ = undef; # Empty $_ to trigger read of next line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
893 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
894
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
895 $$buffer = $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
896
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
897 return @db_link;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
898 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
899
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
900 =head2 _filehandle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
901
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
902 Title : _filehandle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
903 Usage : $obj->_filehandle($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
904 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
905 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
906 Returns : value of _filehandle
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
907 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
908
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
909
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
910 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
911
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
912 sub _filehandle{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
913 my ($obj,$value) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
914 if( defined $value) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
915 $obj->{'_filehandle'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
916 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
917 return $obj->{'_filehandle'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
918
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
919 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
920
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
921 =head2 _read_FTHelper_EMBL
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
922
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
923 Title : _read_FTHelper_EMBL
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
924 Usage : _read_FTHelper_EMBL($buffer)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
925 Function: reads the next FT key line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
926 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
927 Returns : Bio::SeqIO::FTHelper object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
928 Args : filehandle and reference to a scalar
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
929
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
930
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
931 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
932
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
933 sub _read_FTHelper_EMBL {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
934 my ($self,$buffer) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
935
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
936 my ($key, # The key of the feature
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
937 $loc, # The location line from the feature
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
938 @qual, # An arrray of lines making up the qualifiers
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
939 );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
940
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
941 if ($$buffer =~ /^FT\s{3}(\S+)\s+(\S+)/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
942 $key = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
943 $loc = $2;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
944 # Read all the lines up to the next feature
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
945 while ( defined($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
946 if (/^FT(\s+)(.+?)\s*$/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
947 # Lines inside features are preceeded by 19 spaces
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
948 # A new feature is preceeded by 3 spaces
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
949 if (length($1) > 4) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
950 # Add to qualifiers if we're in the qualifiers
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
951 if (@qual) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
952 push(@qual, $2);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
953 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
954 # Start the qualifier list if it's the first qualifier
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
955 elsif (substr($2, 0, 1) eq '/') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
956 @qual = ($2);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
957 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
958 # We're still in the location line, so append to location
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
959 else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
960 $loc .= $2;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
961 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
962 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
963 # We've reached the start of the next feature
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
964 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
965 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
966 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
967 # We're at the end of the feature table
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
968 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
969 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
970 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
971 } elsif( $$buffer =~ /^CO\s+(\S+)/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
972 $key = 'CONTIG';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
973 $loc = $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
974 # Read all the lines up to the next feature
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
975 while ( defined($_ = $self->_readline) ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
976 if (/^CO\s+(\S+)\s*$/) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
977 $loc .= $1;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
978 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
979 # We've reached the start of the next feature
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
980 last;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
981 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
982 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
983 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
984 # No feature key
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
985 return;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
986 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
987
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
988 # Put the first line of the next feature into the buffer
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
989 $$buffer = $_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
990
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
991 # Make the new FTHelper object
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
992 my $out = new Bio::SeqIO::FTHelper();
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
993 $out->verbose($self->verbose());
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
994 $out->key($key);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
995 $out->loc($loc);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
996
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
997 # Now parse and add any qualifiers. (@qual is kept
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
998 # intact to provide informative error messages.)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
999 QUAL: for (my $i = 0; $i < @qual; $i++) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1000 $_ = $qual[$i];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1001 my( $qualifier, $value ) = m{^/([^=]+)(?:=(.+))?}
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1002 or $self->throw("Can't see new qualifier in: $_\nfrom:\n"
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1003 . join('', map "$_\n", @qual));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1004 if (defined $value) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1005 # Do we have a quoted value?
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1006 if (substr($value, 0, 1) eq '"') {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1007 # Keep adding to value until we find the trailing quote
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1008 # and the quotes are balanced
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1009 while ($value !~ /"$/ or $value =~ tr/"/"/ % 2) { #"
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1010 $i++;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1011 my $next = $qual[$i];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1012 unless (defined($next)) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1013 warn("Unbalanced quote in:\n", map("$_\n", @qual),
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1014 "No further qualifiers will be added for this feature");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1015 last QUAL;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1016 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1017
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1018 # Join to value with space if value or next line contains a space
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1019 $value .= (grep /\s/, ($value, $next)) ? " $next" : $next;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1020 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1021 # Trim leading and trailing quotes
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1022 $value =~ s/^"|"$//g;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1023 # Undouble internal quotes
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1024 $value =~ s/""/"/g; #"
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1025 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1026 } else {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1027 $value = '_no_value';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1028 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1029
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1030 # Store the qualifier
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1031 $out->field->{$qualifier} ||= [];
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1032 push(@{$out->field->{$qualifier}},$value);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1033 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1034
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1035 return $out;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1036 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1037
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1038 =head2 _write_line_EMBL
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1039
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1040 Title : _write_line_EMBL
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1041 Usage :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1042 Function: internal function
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1043 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1044 Returns :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1045 Args :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1046
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1047
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1048 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1049
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1050 sub _write_line_EMBL {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1051 my ($self,$pre1,$pre2,$line,$length) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1052
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1053 $length || die "Miscalled write_line_EMBL without length. Programming error!";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1054 my $subl = $length - length $pre2;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1055 my $linel = length $line;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1056 my $i;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1057
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1058 my $sub = substr($line,0,$length - length $pre1);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1059
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1060 $self->_print( "$pre1$sub\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1061
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1062 for($i= ($length - length $pre1);$i < $linel;) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1063 $sub = substr($line,$i,($subl));
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1064 $self->_print( "$pre2$sub\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1065 $i += $subl;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1066 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1067
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1068 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1069
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1070 =head2 _write_line_EMBL_regex
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1071
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1072 Title : _write_line_EMBL_regex
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1073 Usage :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1074 Function: internal function for writing lines of specified
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1075 length, with different first and the next line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1076 left hand headers and split at specific points in the
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1077 text
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1078 Example :
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1079 Returns : nothing
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1080 Args : file handle, first header, second header, text-line, regex for line breaks, total line length
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1081
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1082
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1083 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1084
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1085 sub _write_line_EMBL_regex {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1086 my ($self,$pre1,$pre2,$line,$regex,$length) = @_;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1087
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1088 #print STDOUT "Going to print with $line!\n";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1089
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1090 $length || die "Programming error - called write_line_EMBL_regex without length.";
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1091
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1092 my $subl = $length - (length $pre1) -1 ;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1093
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1094
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1095
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1096 my( @lines );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1097 while(defined $line &&
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1098 $line =~ m/(.{1,$subl})($regex)/g) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1099 push(@lines, $1.$2);
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1100 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1101 foreach (@lines) { s/\s+$//; }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1102
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1103 # Print first line
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1104 my $s = shift(@lines) || '';
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1105 $self->_print( "$pre1$s\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1106
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1107 # Print the rest
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1108 foreach my $s ( @lines ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1109 $s = '' if( !defined $s );
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1110 $self->_print( "$pre2$s\n");
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1111 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1112 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1113
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1114 =head2 _post_sort
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1115
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1116 Title : _post_sort
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1117 Usage : $obj->_post_sort($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1118 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1119 Returns : value of _post_sort
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1120 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1121
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1122
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1123 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1124
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1125 sub _post_sort{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1126 my $obj = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1127 if( @_ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1128 my $value = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1129 $obj->{'_post_sort'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1130 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1131 return $obj->{'_post_sort'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1132
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1133 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1134
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1135 =head2 _show_dna
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1136
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1137 Title : _show_dna
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1138 Usage : $obj->_show_dna($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1139 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1140 Returns : value of _show_dna
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1141 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1142
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1143
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1144 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1145
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1146 sub _show_dna{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1147 my $obj = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1148 if( @_ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1149 my $value = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1150 $obj->{'_show_dna'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1151 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1152 return $obj->{'_show_dna'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1153
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1154 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1155
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1156 =head2 _id_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1157
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1158 Title : _id_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1159 Usage : $obj->_id_generation_func($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1160 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1161 Returns : value of _id_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1162 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1163
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1164
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1165 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1166
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1167 sub _id_generation_func{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1168 my $obj = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1169 if( @_ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1170 my $value = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1171 $obj->{'_id_generation_func'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1172 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1173 return $obj->{'_id_generation_func'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1174
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1175 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1176
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1177 =head2 _ac_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1178
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1179 Title : _ac_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1180 Usage : $obj->_ac_generation_func($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1181 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1182 Returns : value of _ac_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1183 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1184
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1185
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1186 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1187
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1188 sub _ac_generation_func{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1189 my $obj = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1190 if( @_ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1191 my $value = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1192 $obj->{'_ac_generation_func'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1193 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1194 return $obj->{'_ac_generation_func'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1195
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1196 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1197
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1198 =head2 _sv_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1199
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1200 Title : _sv_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1201 Usage : $obj->_sv_generation_func($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1202 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1203 Returns : value of _sv_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1204 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1205
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1206
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1207 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1208
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1209 sub _sv_generation_func{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1210 my $obj = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1211 if( @_ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1212 my $value = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1213 $obj->{'_sv_generation_func'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1214 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1215 return $obj->{'_sv_generation_func'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1216
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1217 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1218
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1219 =head2 _kw_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1220
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1221 Title : _kw_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1222 Usage : $obj->_kw_generation_func($newval)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1223 Function:
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1224 Returns : value of _kw_generation_func
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1225 Args : newvalue (optional)
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1226
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1227
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1228 =cut
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1229
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1230 sub _kw_generation_func{
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1231 my $obj = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1232 if( @_ ) {
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1233 my $value = shift;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1234 $obj->{'_kw_generation_func'} = $value;
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1235 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1236 return $obj->{'_kw_generation_func'};
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1237
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1238 }
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1239
2bc9b66ada89 Uploaded
mahtabm
parents:
diff changeset
1240 1;