annotate test-data/cd_hit_protein_in.fasta @ 11:75fde37f69e5

Add cd-hit to protein fastas
author Jim Johnson <jj@umn.edu>
date Thu, 27 Jun 2013 21:27:06 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
11
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
1 >sp|P00325|ADH1B_HUMAN Alcohol dehydrogenase 1B OS=Homo sapiens GN=ADH1B PE=1 SV=2
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
2 MSTAGKVIKCKAAVLWEVKKPFSIEDVEVAPPKAYEVRIKMVAVGICRTDDHVVSGNLVT
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
3 PLPVILGHEAAGIVESVGEGVTTVKPGDKVIPLFTPQCGKCRVCKNPESNYCLKNDLGNP
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
4 RGTLQDGTRRFTCRGKPIHHFLGTSTFSQYTVVDENAVAKIDAASPLEKVCLIGCGFSTG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
5 YGSAVNVAKVTPGSTCAVFGLGGVGLSAVMGCKAAGAARIIAVDINKDKFAKAKELGATE
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
6 CINPQDYKKPIQEVLKEMTDGGVDFSFEVIGRLDTMMASLLCCHEACGTSVIVGVPPASQ
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
7 NLSINPMLLLTGRTWKGAVYGGFKSKEGIPKLVADFMAKKFSLDALITHVLPFEKINEGF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
8 DLLHSGKSIRTVLTF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
9 >tr|K7D361|K7D361_PANTR Alcohol dehydrogenase 1B (Class I), beta polypeptide OS=Pan troglodytes GN=ADH1B PE=2 SV=1
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
10 MSTAGKVIKCKAAVLWEVKKPFSIEDVEVAPPKAYEVRIKMVAVGICRTDDHVVSGNLVT
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
11 PLPAILGHEAAGIVESVGEGVTTVKPGDKVIPLFTPQCGKCRVCKNPESNYCLKNDLGNP
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
12 RGTLQDGTRRFTCRGKPIHHFLGTSTFSQYTVVDENAVAKIDAASPLEKVCLIGCGFSTG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
13 YGSAVNVAKVTPGSTCAVFGLGGVGLSAVMGCKAAGAARIIAVDINKDKFAKAKELGATE
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
14 CINPQDYKKPIQEVLKEMTDGGVDFSFEVIGRLDTMMASLLCCHEACGTSVIVGVPPASQ
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
15 NLSINPMLLLTGRTWKGAVYGGFKSKEGIPKLVADFMAKKFSLDALITHVLPFEKINEGF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
16 DLLHSGKSIRTVLTF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
17 >sp|P00329|ADH1_MOUSE Alcohol dehydrogenase 1 OS=Mus musculus GN=Adh1 PE=2 SV=2
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
18 MSTAGKVIKCKAAVLWELHKPFTIEDIEVAPPKAHEVRIKMVATGVCRSDDHVVSGTLVT
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
19 PLPAVLGHEGAGIVESVGEGVTCVKPGDKVIPLFSPQCGECRICKHPESNFCSRSDLLMP
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
20 RGTLREGTSRFSCKGKQIHNFISTSTFSQYTVVDDIAVAKIDGASPLDKVCLIGCGFSTG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
21 YGSAVKVAKVTPGSTCAVFGLGGVGLSVIIGCKAAGAARIIAVDINKDKFAKAKELGATE
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
22 CINPQDYSKPIQEVLQEMTDGGVDFSFEVIGRLDTMTSALLSCHAACGVSVVVGVPPNAQ
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
23 NLSMNPMLLLLGRTWKGAIFGGFKSKDSVPKLVADFMAKKFPLDPLITHVLPFEKINEAF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
24 DLLRSGKSIRTVLTF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
25 >sp|P00338-2|LDHA_HUMAN Isoform 2 of L-lactate dehydrogenase A chain OS=Homo sapiens GN=LDHA
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
26 MATLKDQLIYNLLKEEQTPQNKITVVGVGAVGMACAISILMKDLADELALVDVIEDKLKG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
27 EMMDLQHGSLFLRTPKIVSGKDYNVTANSKLVIITAGARQQEGESRLNLVQRNVNIFKFI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
28 IPNVVKYSPNCKLLIVSNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGV
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
29 HPLSCHGWVLGEHGDSSVPVWSGMNVAGVSLKTLHPDLGTDKDKEQWKECRYTLGDPKGA
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
30 AILKSSDVISFHCLGYNRILGGGCACCPFYLICD
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
31 >sp|P00338-5|LDHA_HUMAN Isoform 5 of L-lactate dehydrogenase A chain OS=Homo sapiens GN=LDHA
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
32 MATLKDQLIYNLLKEEQTPQNKITVVGVGAVGMACAISILMKDLADELALVDVIEDKLKG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
33 EMMDLQHGSLFLRTPKIVSGKDYNVTANSKLVIITAGARQQEGESRLNLVQRNVNIFKFI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
34 IPNVVKYSPNCKLLIVSNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGV
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
35 HPLSCHGWVLGEHGDSSVPVWSGMNVAGVSLKTLHPDLGTDKDKEQWKEVHKQVVERVFT
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
36 E
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
37 >sp|P00340|LDHA_CHICK L-lactate dehydrogenase A chain OS=Gallus gallus GN=LDHA PE=1 SV=3
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
38 MSLKDHLIHNVHKEEHAHAHNKISVVGVGAVGMACAISILMKDLADELTLVDVVEDKLKG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
39 EMLDLQHGSLFLKTPKIISGKDYSVTAHSKLVIVTAGARQQEGESRLNLVQRNVNIFKFI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
40 IPNVVKYSPDCKLLIVSNPVDILTYVAWKISGFPKHRVIGSGCNLDSARFRHLMGERLGI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
41 HPLSCHGWIVGEHGDSSVPVWSGVNVAGVSLKALHPDMGTDADKEHWKEVHKQVVDSAYE
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
42 VIKLKGYTSWAIGLSVADLAETIMKNLRRVHPISTAVKGMHGIKDDVFLSVPCVLGSSGI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
43 TDVVKMILKPDEEEKIKKSADTLWGIQKELQF
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
44 >sp|P19858|LDHA_BOVIN L-lactate dehydrogenase A chain OS=Bos taurus GN=LDHA PE=2 SV=2
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
45 MATLKDQLIQNLLKEEHVPQNKITIVGVGAVGMACAISILMKDLADEVALVDVMEDKLKG
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
46 EMMDLQHGSLFLRTPKIVSGKDYNVTANSRLVIITAGARQQEGESRLNLVQRNVNIFKFI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
47 IPNIVKYSPNCKLLVVSNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGV
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
48 HPLSCHGWILGEHGDSSVPVWSGVNVAGVSLKNLHPELGTDADKEQWKAVHKQVVDSAYE
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
49 VIKLKGYTSWAIGLSVADLAESIMKNLRRVHPISTMIKGLYGIKEDVFLSVPCILGQNGI
75fde37f69e5 Add cd-hit to protein fastas
Jim Johnson <jj@umn.edu>
parents:
diff changeset
50 SDVVKVTLTHEEEACLKKSADTLWGIQKELQF