# HG changeset patch # User dcorreia # Date 1486389995 18000 # Node ID baeab8664657db81f920781645c9c38671bc21d3 # Parent 814b453382cf23882a1446602b2690774ca7a543 planemo upload commit e0ca504b3313992020acf8ab7aed0a261237766e-dirty diff -r 814b453382cf -r baeab8664657 gblocks.xml --- a/gblocks.xml Mon Jan 23 07:42:56 2017 -0500 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,112 +0,0 @@ - - cleaning aligned sequences - - operation_0368 - - - gblocks - - - ln -s ${input} input.fasta; - Gblocks - input.fasta - #if not 'default' in $b1: - -b1=$b1 - #end if - #if not 'default' in $b2: - -b2=$b2 - #end if - -b3=$b3 - -b4=$b4 - -b5=$b5 - - $datatype - - > $gblocks_log; - mv input.fasta-gb $output_aln; - mv input.fasta-gb.htm $output_htm; - - - - - - - - - - - - - - - - - - - - - - - - display_gblocks_log == True - - - - - - - - - - - - - -.. class:: infomark - -**GBlocks version 0.91b, 2000** - ------ - -============== - Please cite: -============== - -"Improvement_ of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments." - - -.. _Improvement: http://sysbio.oxfordjournals.org/content/56/4/564.full - -**Talavera G., and Castresana J.** - -Systematic Biology 56, 564-577, 2007. - - -"Selection_ of conserved blocks from multiple alignments for their use in phylogenetic analysis." - - -.. _Selection: http://mbe.oxfordjournals.org/content/17/4/540 - -**Castresana J.** - -Molecular Biology and Evolution 17, 540-552, 2000. - ------ - -========== - Overview -========== - -Gblocks is a computer program written in ANSI C language that eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences. These positions may not be homologous or may have been saturated by multiple substitutions and it is convenient to eliminate them prior to phylogenetic analysis. Gblocks selects blocks in a similar way as it is usually done by hand but following a reproducible set of conditions. The selected blocks must fulfill certain requirements with respect to the lack of large segments of contiguous nonconserved positions, lack of gap positions and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. Gblocks outputs several files to visualize the selected blocks. - -The use of a program such as Gblocks reduces the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible and, finally, facilitates the reproduction of the alignments and subsequent phylogenetic analysis by other researchers. Gblocks is very fast in processing alignments and it is therefore highly suitable for large-scale phylogenetic analyses. Several parameters can be modified to make the selection of blocks more or less stringent. In general, a relaxed selection of blocks is better for short alignments, whereas a stringent selection is more adequate for longer ones. Be aware that the default options of Gblocks are stringent. - ------ - -For further informations, please visite the Gblocks_ website. - - -.. _Gblocks: http://molevol.cmima.csic.es/castresana/Gblocks.html - - - diff -r 814b453382cf -r baeab8664657 gblocks/gblocks.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/gblocks/gblocks.xml Mon Feb 06 09:06:35 2017 -0500 @@ -0,0 +1,112 @@ + + cleaning aligned sequences + + operation_0368 + + + gblocks + + + ln -s ${input} input.fasta; + Gblocks + input.fasta + #if not 'default' in $b1: + -b1=$b1 + #end if + #if not 'default' in $b2: + -b2=$b2 + #end if + -b3=$b3 + -b4=$b4 + -b5=$b5 + + $datatype + + > $gblocks_log; + mv input.fasta-gb $output_aln; + mv input.fasta-gb.htm $output_htm; + + + + + + + + + + + + + + + + + + + + + + + + display_gblocks_log == True + + + + + + + + + + + + + +.. class:: infomark + +**GBlocks version 0.91b, 2000** + +----- + +============== + Please cite: +============== + +"Improvement_ of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments." + + +.. _Improvement: http://sysbio.oxfordjournals.org/content/56/4/564.full + +**Talavera G., and Castresana J.** + +Systematic Biology 56, 564-577, 2007. + + +"Selection_ of conserved blocks from multiple alignments for their use in phylogenetic analysis." + + +.. _Selection: http://mbe.oxfordjournals.org/content/17/4/540 + +**Castresana J.** + +Molecular Biology and Evolution 17, 540-552, 2000. + +----- + +========== + Overview +========== + +Gblocks is a computer program written in ANSI C language that eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences. These positions may not be homologous or may have been saturated by multiple substitutions and it is convenient to eliminate them prior to phylogenetic analysis. Gblocks selects blocks in a similar way as it is usually done by hand but following a reproducible set of conditions. The selected blocks must fulfill certain requirements with respect to the lack of large segments of contiguous nonconserved positions, lack of gap positions and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. Gblocks outputs several files to visualize the selected blocks. + +The use of a program such as Gblocks reduces the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible and, finally, facilitates the reproduction of the alignments and subsequent phylogenetic analysis by other researchers. Gblocks is very fast in processing alignments and it is therefore highly suitable for large-scale phylogenetic analyses. Several parameters can be modified to make the selection of blocks more or less stringent. In general, a relaxed selection of blocks is better for short alignments, whereas a stringent selection is more adequate for longer ones. Be aware that the default options of Gblocks are stringent. + +----- + +For further informations, please visite the Gblocks_ website. + + +.. _Gblocks: http://molevol.cmima.csic.es/castresana/Gblocks.html + + + diff -r 814b453382cf -r baeab8664657 gblocks/test-data/nad3.pir --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/gblocks/test-data/nad3.pir Mon Feb 06 09:06:35 2017 -0500 @@ -0,0 +1,101 @@ +>P1;nad3_parde + +------MEYLLQEYLPILVFLGMASALAIVLILAAAVIAVRN--PDPEKVSAYECGFNAF +D-DARMKFDVRFYLVSILFIIFDLEVAFLFPWAVSFASLS-DVAFWGLMVFLAVLTVGFA +YEWKKGALEWA----------------------* + +>P1;nad3_acaca + +---------MTLEYIYIFIFFWGAFFISCLLIFLSYFLVYQE--SDIEKNSAYECGFQPF +E-DTRSKFNVRYYLIAILFMIFDLEIMYLFPWSISISTGS-FFGVWAIFLFLIILTVGFI +YEWQKGALEWD----------------------* + +>P1;nad3_allma + +--------------MTYLVYIVFTIVLTVGLILVSYLLSQAQ--PDSEKVSAYECGFSPL +G-DARQKFDVSFYLIAILFIIFDLEVVFILPFASVIHNVS-LLGGWITIIFLVILTIGFI +YEFVSGAITDSF---------------------* + +>P1;nad3_apec + +-----------IFNFLTLFVSILIFLITTLITFAAHFLPSRN-TD-SEKSSPYECGFDPL +N-SARVPFSFRFFLVAILFLLFDLEIALLFPLPFSVFFH--P--IHTP----LILTVGLI +FEWVQGGLDWAE---------------------* + +>P1;nad3_arath + +---------MMSEFAPISIYLVISLLVSLILLGVPFPFASNS-STYPEKLSAYECGFDPS +G-DARSRFDIRFYLVSILFLIPDLEVTFFFPWAVPPNKID-LFGFWSMMAFLFILTIGFL +YEWKRGASDRE----------------------* + +>P1;nad3_balca + +-------------MNSFLIYLLIAITLSFILSIVGHRLPTRN-MD-QEKLSPYECGFDPQ +A-SARLPFSLRFFLVAILFLLFDLEIALLLPFPAALSARDPQLSFTLAFLILLILTIGLI +YEWMEGGLEWAE---------------------* + +>P1;nad3_chocr + +------MKLIFTEYSAILIFFAISSLLSSVIFLLSYFLIPQK--PDQEKVSAYECGFNPF +D-DARATFDIRFYLVAILFLIFDLEISFLFPWSLVLGEIS-IIGFWSMIVFLVILTIGFI +YEWYKGALEWE----------------------* + +>P1;nad3_drome + +-------------MFSIIFIALLILLITTIVMFLASILSKKA-LIDREKSSPFECGFDPK +S-SSRLPFSLRFFLITIIFLIFDVEIALILPMIIIMKYSNIMIWTITSIIFILILLIGLY +HEWNQGMLNWSN---------------------* + +>P1;nad3_human + +-------------MN-FALILMINTLLALLLMIITFWLPQLN-GY-MEKSTPYECGFDPM +S-PARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMVMSSLLLIIILALSLA +YEWLQKGLDWTE---------------------* + +>P1;nad3_ktun + +-------------MFFVLSLVLFTFLLSLVLLSVSLSLTKKK-MMNREKSSPFECGFDPK +S-SARLPFSMRFFLITVVFLVFDVEIVLLLPYLFSSGWSIDVFSLVGSMMILVILIIGVL +HEWSEGSLEWFSSSN------------------* + +>P1;nad3_lter + +-------------MILTALSSAIALLVPIIILGAAWVLASRS-TEDREKSSPFECGFDPK +S-TARIPFSTRFFLLAIIFIVFDIEIVLLMPLPTILHTSDVFTTVTTSVLFLMILLIGLI +HEWKEGSLDWSS---------------------* + +>P1;nad3_marpo + +-----------MEFAPIFVYLVISLLLSLILIGVSFLFASSSSLAYPEKLSAYECGFDPF +D-DARSRFDIRFYLVSILFIIFDLEVTFLFPWAVSLNKIG-LFGFWSMMVFLFILTIGFV +YEWKKGALDWE----------------------* + +>P1;nad3_metse + +---------MYTEFYGILVLLIFSVVLSAIISGASYILGDKQ--PDREKVSAYECGFDPF +G-TPGRPFSIRFFLIGILFLIFDLEISFLFPWCVVCNQVF-PFGYWTMIVFLAVLTLGLV +YEWLKGGLEWE----------------------* + +>P1;nad3_picca + +MLNYFVYPYGIENDMGMKFYMMLVPMMSMVLMMINYMMTNKS-DNNMNKTGPYECGFDSF +R-QSRTTYSIKFILIAILFLPFDLELTSILPYTLSMYNTN-IYGLFILLYFLLPLIIGFI +IEINTKAIYMTKMFNRNVKSMTSYVKYNNKI--* + +>P1;nad3_podan + +-------------MSSMTLFILFVSIIALLFLFINLIFAPHN--PYQEKYSIFECGFHSF +LGQNRTQFGVKFFIFALVYLLLDLEILLTFPFAVSEYVNN-IYGLIILLGFITIITIGFV +YELGKSALKIDSRQVITMTRFNYSSTIEYLGKI* + +>P1;nad3_prowi + +----------MYEFLGILIYFFIALALSLLLLGLPFLVSTRK--ADPEKISAYECGFDPF +D-DARGRFDIQFYLVAILFIIFDLEVAFLFPWALTLNKIG-YFGFWSMMLFLFILTVGFI +YEWRKGALDWS----------------------* + +>P1;nad3_recam + +-----MNTMILSEYLSVLIFFIFSFGLSCIILGLSYVLATQN--ADTEKLSPYECGFNPF +D-DARGAFDVRFYLVAILFIIFDLEVAFLFPWAVALSDVT-IFGFWTMFIFLLILTVGFI +YEWKKGALDWE----------------------* diff -r 814b453382cf -r baeab8664657 gblocks/test-data/nad3.pir-gb --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/gblocks/test-data/nad3.pir-gb Mon Feb 06 09:06:35 2017 -0500 @@ -0,0 +1,68 @@ +>P1;nad3_parde + +EKVSAYECGF NAFARMKFDV RFYLVSILFI IFDLEVAFLF PVLTVGFAYE WKKGAL* + +>P1;nad3_acaca + +EKNSAYECGF QPFTRSKFNV RYYLIAILFM IFDLEIMYLF PILTVGFIYE WQKGAL* + +>P1;nad3_allma + +EKVSAYECGF SPLARQKFDV SFYLIAILFI IFDLEVVFIL PILTIGFIYE FVSGAI* + +>P1;nad3_apec + +EKSSPYECGF DPLARVPFSF RFFLVAILFL LFDLEIALLF PILTVGLIFE WVQGGL* + +>P1;nad3_arath + +EKLSAYECGF DPSARSRFDI RFYLVSILFL IPDLEVTFFF PILTIGFLYE WKRGAS* + +>P1;nad3_balca + +EKLSPYECGF DPQARLPFSL RFFLVAILFL LFDLEIALLL PILTIGLIYE WMEGGL* + +>P1;nad3_chocr + +EKVSAYECGF NPFARATFDI RFYLVAILFL IFDLEISFLF PILTIGFIYE WYKGAL* + +>P1;nad3_drome + +EKSSPFECGF DPKSRLPFSL RFFLITIIFL IFDVEIALIL PILLIGLYHE WNQGML* + +>P1;nad3_human + +EKSTPYECGF DPMARVPFSM KFFLVAITFL LFDLEIALLL PILALSLAYE WLQKGL* + +>P1;nad3_ktun + +EKSSPFECGF DPKARLPFSM RFFLITVVFL VFDVEIVLLL PILIIGVLHE WSEGSL* + +>P1;nad3_lter + +EKSSPFECGF DPKARIPFST RFFLLAIIFI VFDIEIVLLM PILLIGLIHE WKEGSL* + +>P1;nad3_marpo + +EKLSAYECGF DPFARSRFDI RFYLVSILFI IFDLEVTFLF PILTIGFVYE WKKGAL* + +>P1;nad3_metse + +EKVSAYECGF DPFPGRPFSI RFFLIGILFL IFDLEISFLF PVLTLGLVYE WLKGGL* + +>P1;nad3_picca + +NKTGPYECGF DSFSRTTYSI KFILIAILFL PFDLELTSIL PPLIIGFIIE INTKAI* + +>P1;nad3_podan + +EKYSIFECGF HSFNRTQFGV KFFIFALVYL LLDLEILLTF PIITIGFVYE LGKSAL* + +>P1;nad3_prowi + +EKISAYECGF DPFARGRFDI QFYLVAILFI IFDLEVAFLF PILTVGFIYE WRKGAL* + +>P1;nad3_recam + +EKLSPYECGF NPFARGAFDV RFYLVAILFI IFDLEVAFLF PILTVGFIYE WKKGAL* + diff -r 814b453382cf -r baeab8664657 gblocks/test-data/nad3.pir-gb.htm --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/gblocks/test-data/nad3.pir-gb.htm Mon Feb 06 09:06:35 2017 -0500 @@ -0,0 +1,124 @@ + + + +nad3.pir + + + + +

Gblocks 0.91b Results

+

+Processed file: nad3.pir
+Number of sequences: 17
Alignment assumed to be: Protein
+New number of positions: 56 (selected positions are underlined in blue) +

+
+                         10        20        30        40        50        60
+                 =========+=========+=========+=========+=========+=========+
+nad3_parde       ------MEYLLQEYLPILVFLGMASALAIVLILAAAVIAVRN--PDPEKVSAYECGFNAF
+nad3_acaca       ---------MTLEYIYIFIFFWGAFFISCLLIFLSYFLVYQE--SDIEKNSAYECGFQPF
+nad3_allma       --------------MTYLVYIVFTIVLTVGLILVSYLLSQAQ--PDSEKVSAYECGFSPL
+nad3_apec        -----------IFNFLTLFVSILIFLITTLITFAAHFLPSRN-TD-SEKSSPYECGFDPL
+nad3_arath       ---------MMSEFAPISIYLVISLLVSLILLGVPFPFASNS-STYPEKLSAYECGFDPS
+nad3_balca       -------------MNSFLIYLLIAITLSFILSIVGHRLPTRN-MD-QEKLSPYECGFDPQ
+nad3_chocr       ------MKLIFTEYSAILIFFAISSLLSSVIFLLSYFLIPQK--PDQEKVSAYECGFNPF
+nad3_drome       -------------MFSIIFIALLILLITTIVMFLASILSKKA-LIDREKSSPFECGFDPK
+nad3_human       -------------MN-FALILMINTLLALLLMIITFWLPQLN-GY-MEKSTPYECGFDPM
+nad3_ktun        -------------MFFVLSLVLFTFLLSLVLLSVSLSLTKKK-MMNREKSSPFECGFDPK
+nad3_lter        -------------MILTALSSAIALLVPIIILGAAWVLASRS-TEDREKSSPFECGFDPK
+nad3_marpo       -----------MEFAPIFVYLVISLLLSLILIGVSFLFASSSSLAYPEKLSAYECGFDPF
+nad3_metse       ---------MYTEFYGILVLLIFSVVLSAIISGASYILGDKQ--PDREKVSAYECGFDPF
+nad3_picca       MLNYFVYPYGIENDMGMKFYMMLVPMMSMVLMMINYMMTNKS-DNNMNKTGPYECGFDSF
+nad3_podan       -------------MSSMTLFILFVSIIALLFLFINLIFAPHN--PYQEKYSIFECGFHSF
+nad3_prowi       ----------MYEFLGILIYFFIALALSLLLLGLPFLVSTRK--ADPEKISAYECGFDPF
+nad3_recam       -----MNTMILSEYLSVLIFFIFSFGLSCIILGLSYVLATQN--ADTEKLSPYECGFNPF
+                                                                #############
+
+
+                         70        80        90       100       110       120
+                 =========+=========+=========+=========+=========+=========+
+nad3_parde       D-DARMKFDVRFYLVSILFIIFDLEVAFLFPWAVSFASLS-DVAFWGLMVFLAVLTVGFA
+nad3_acaca       E-DTRSKFNVRYYLIAILFMIFDLEIMYLFPWSISISTGS-FFGVWAIFLFLIILTVGFI
+nad3_allma       G-DARQKFDVSFYLIAILFIIFDLEVVFILPFASVIHNVS-LLGGWITIIFLVILTIGFI
+nad3_apec        N-SARVPFSFRFFLVAILFLLFDLEIALLFPLPFSVFFH--P--IHTP----LILTVGLI
+nad3_arath       G-DARSRFDIRFYLVSILFLIPDLEVTFFFPWAVPPNKID-LFGFWSMMAFLFILTIGFL
+nad3_balca       A-SARLPFSLRFFLVAILFLLFDLEIALLLPFPAALSARDPQLSFTLAFLILLILTIGLI
+nad3_chocr       D-DARATFDIRFYLVAILFLIFDLEISFLFPWSLVLGEIS-IIGFWSMIVFLVILTIGFI
+nad3_drome       S-SSRLPFSLRFFLITIIFLIFDVEIALILPMIIIMKYSNIMIWTITSIIFILILLIGLY
+nad3_human       S-PARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMVMSSLLLIIILALSLA
+nad3_ktun        S-SARLPFSMRFFLITVVFLVFDVEIVLLLPYLFSSGWSIDVFSLVGSMMILVILIIGVL
+nad3_lter        S-TARIPFSTRFFLLAIIFIVFDIEIVLLMPLPTILHTSDVFTTVTTSVLFLMILLIGLI
+nad3_marpo       D-DARSRFDIRFYLVSILFIIFDLEVTFLFPWAVSLNKIG-LFGFWSMMVFLFILTIGFV
+nad3_metse       G-TPGRPFSIRFFLIGILFLIFDLEISFLFPWCVVCNQVF-PFGYWTMIVFLAVLTLGLV
+nad3_picca       R-QSRTTYSIKFILIAILFLPFDLELTSILPYTLSMYNTN-IYGLFILLYFLLPLIIGFI
+nad3_podan       LGQNRTQFGVKFFIFALVYLLLDLEILLTFPFAVSEYVNN-IYGLIILLGFITIITIGFV
+nad3_prowi       D-DARGRFDIQFYLVAILFIIFDLEVAFLFPWALTLNKIG-YFGFWSMMLFLFILTVGFI
+nad3_recam       D-DARGAFDVRFYLVAILFIIFDLEVAFLFPWAVALSDVT-IFGFWTMFIFLLILTVGFI
+                    ############################                      #######
+
+
+                        130       140       150
+                 =========+=========+=========+===
+nad3_parde       YEWKKGALEWA----------------------
+nad3_acaca       YEWQKGALEWD----------------------
+nad3_allma       YEFVSGAITDSF---------------------
+nad3_apec        FEWVQGGLDWAE---------------------
+nad3_arath       YEWKRGASDRE----------------------
+nad3_balca       YEWMEGGLEWAE---------------------
+nad3_chocr       YEWYKGALEWE----------------------
+nad3_drome       HEWNQGMLNWSN---------------------
+nad3_human       YEWLQKGLDWTE---------------------
+nad3_ktun        HEWSEGSLEWFSSSN------------------
+nad3_lter        HEWKEGSLDWSS---------------------
+nad3_marpo       YEWKKGALDWE----------------------
+nad3_metse       YEWLKGGLEWE----------------------
+nad3_picca       IEINTKAIYMTKMFNRNVKSMTSYVKYNNKI--
+nad3_podan       YELGKSALKIDSRQVITMTRFNYSSTIEYLGKI
+nad3_prowi       YEWRKGALDWS----------------------
+nad3_recam       YEWKKGALDWE----------------------
+                 ########                         
+
+
+
+
+
+
+
+
Parameters used +Minimum Number Of Sequences For A Conserved Position: 9 +Minimum Number Of Sequences For A Flanking Position: 14 +Maximum Number Of Contiguous Nonconserved Positions: 8 +Minimum Length Of A Block: 5 +Allowed Gap Positions: None +Use Similarity Matrices: Yes + +
Flank positions of the 3 selected block(s)
+Flanks: [48  60]  [64  91]  [114  128]  
+
+New number of positions in nad3.pir-gb1:  56  (36% of the original 153 positions)
+
+
+
+
diff -r 814b453382cf -r baeab8664657 gblocks/tool_dependencies.xml
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/gblocks/tool_dependencies.xml	Mon Feb 06 09:06:35 2017 -0500
@@ -0,0 +1,6 @@
+
+
+    
+        
+    
+
diff -r 814b453382cf -r baeab8664657 test-data/nad3.pir
--- a/test-data/nad3.pir	Mon Jan 23 07:42:56 2017 -0500
+++ /dev/null	Thu Jan 01 00:00:00 1970 +0000
@@ -1,101 +0,0 @@
->P1;nad3_parde
-
-------MEYLLQEYLPILVFLGMASALAIVLILAAAVIAVRN--PDPEKVSAYECGFNAF
-D-DARMKFDVRFYLVSILFIIFDLEVAFLFPWAVSFASLS-DVAFWGLMVFLAVLTVGFA
-YEWKKGALEWA----------------------*
-
->P1;nad3_acaca
-
----------MTLEYIYIFIFFWGAFFISCLLIFLSYFLVYQE--SDIEKNSAYECGFQPF
-E-DTRSKFNVRYYLIAILFMIFDLEIMYLFPWSISISTGS-FFGVWAIFLFLIILTVGFI
-YEWQKGALEWD----------------------*
-
->P1;nad3_allma
-
---------------MTYLVYIVFTIVLTVGLILVSYLLSQAQ--PDSEKVSAYECGFSPL
-G-DARQKFDVSFYLIAILFIIFDLEVVFILPFASVIHNVS-LLGGWITIIFLVILTIGFI
-YEFVSGAITDSF---------------------*
-
->P1;nad3_apec
-
------------IFNFLTLFVSILIFLITTLITFAAHFLPSRN-TD-SEKSSPYECGFDPL
-N-SARVPFSFRFFLVAILFLLFDLEIALLFPLPFSVFFH--P--IHTP----LILTVGLI
-FEWVQGGLDWAE---------------------*
-
->P1;nad3_arath
-
----------MMSEFAPISIYLVISLLVSLILLGVPFPFASNS-STYPEKLSAYECGFDPS
-G-DARSRFDIRFYLVSILFLIPDLEVTFFFPWAVPPNKID-LFGFWSMMAFLFILTIGFL
-YEWKRGASDRE----------------------*
-
->P1;nad3_balca
-
--------------MNSFLIYLLIAITLSFILSIVGHRLPTRN-MD-QEKLSPYECGFDPQ
-A-SARLPFSLRFFLVAILFLLFDLEIALLLPFPAALSARDPQLSFTLAFLILLILTIGLI
-YEWMEGGLEWAE---------------------*
-
->P1;nad3_chocr
-
-------MKLIFTEYSAILIFFAISSLLSSVIFLLSYFLIPQK--PDQEKVSAYECGFNPF
-D-DARATFDIRFYLVAILFLIFDLEISFLFPWSLVLGEIS-IIGFWSMIVFLVILTIGFI
-YEWYKGALEWE----------------------*
-
->P1;nad3_drome
-
--------------MFSIIFIALLILLITTIVMFLASILSKKA-LIDREKSSPFECGFDPK
-S-SSRLPFSLRFFLITIIFLIFDVEIALILPMIIIMKYSNIMIWTITSIIFILILLIGLY
-HEWNQGMLNWSN---------------------*
-
->P1;nad3_human
-
--------------MN-FALILMINTLLALLLMIITFWLPQLN-GY-MEKSTPYECGFDPM
-S-PARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMVMSSLLLIIILALSLA
-YEWLQKGLDWTE---------------------*
-
->P1;nad3_ktun
-
--------------MFFVLSLVLFTFLLSLVLLSVSLSLTKKK-MMNREKSSPFECGFDPK
-S-SARLPFSMRFFLITVVFLVFDVEIVLLLPYLFSSGWSIDVFSLVGSMMILVILIIGVL
-HEWSEGSLEWFSSSN------------------*
-
->P1;nad3_lter
-
--------------MILTALSSAIALLVPIIILGAAWVLASRS-TEDREKSSPFECGFDPK
-S-TARIPFSTRFFLLAIIFIVFDIEIVLLMPLPTILHTSDVFTTVTTSVLFLMILLIGLI
-HEWKEGSLDWSS---------------------*
-
->P1;nad3_marpo
-
------------MEFAPIFVYLVISLLLSLILIGVSFLFASSSSLAYPEKLSAYECGFDPF
-D-DARSRFDIRFYLVSILFIIFDLEVTFLFPWAVSLNKIG-LFGFWSMMVFLFILTIGFV
-YEWKKGALDWE----------------------*
-
->P1;nad3_metse
-
----------MYTEFYGILVLLIFSVVLSAIISGASYILGDKQ--PDREKVSAYECGFDPF
-G-TPGRPFSIRFFLIGILFLIFDLEISFLFPWCVVCNQVF-PFGYWTMIVFLAVLTLGLV
-YEWLKGGLEWE----------------------*
-
->P1;nad3_picca
-
-MLNYFVYPYGIENDMGMKFYMMLVPMMSMVLMMINYMMTNKS-DNNMNKTGPYECGFDSF
-R-QSRTTYSIKFILIAILFLPFDLELTSILPYTLSMYNTN-IYGLFILLYFLLPLIIGFI
-IEINTKAIYMTKMFNRNVKSMTSYVKYNNKI--*
-
->P1;nad3_podan
-
--------------MSSMTLFILFVSIIALLFLFINLIFAPHN--PYQEKYSIFECGFHSF
-LGQNRTQFGVKFFIFALVYLLLDLEILLTFPFAVSEYVNN-IYGLIILLGFITIITIGFV
-YELGKSALKIDSRQVITMTRFNYSSTIEYLGKI*
-
->P1;nad3_prowi
-
-----------MYEFLGILIYFFIALALSLLLLGLPFLVSTRK--ADPEKISAYECGFDPF
-D-DARGRFDIQFYLVAILFIIFDLEVAFLFPWALTLNKIG-YFGFWSMMLFLFILTVGFI
-YEWRKGALDWS----------------------*
-
->P1;nad3_recam
-
------MNTMILSEYLSVLIFFIFSFGLSCIILGLSYVLATQN--ADTEKLSPYECGFNPF
-D-DARGAFDVRFYLVAILFIIFDLEVAFLFPWAVALSDVT-IFGFWTMFIFLLILTVGFI
-YEWKKGALDWE----------------------*
diff -r 814b453382cf -r baeab8664657 test-data/nad3.pir-gb
--- a/test-data/nad3.pir-gb	Mon Jan 23 07:42:56 2017 -0500
+++ /dev/null	Thu Jan 01 00:00:00 1970 +0000
@@ -1,68 +0,0 @@
->P1;nad3_parde
- 
-EKVSAYECGF NAFARMKFDV RFYLVSILFI IFDLEVAFLF PVLTVGFAYE WKKGAL*
-
->P1;nad3_acaca
- 
-EKNSAYECGF QPFTRSKFNV RYYLIAILFM IFDLEIMYLF PILTVGFIYE WQKGAL*
-
->P1;nad3_allma
- 
-EKVSAYECGF SPLARQKFDV SFYLIAILFI IFDLEVVFIL PILTIGFIYE FVSGAI*
-
->P1;nad3_apec
- 
-EKSSPYECGF DPLARVPFSF RFFLVAILFL LFDLEIALLF PILTVGLIFE WVQGGL*
-
->P1;nad3_arath
- 
-EKLSAYECGF DPSARSRFDI RFYLVSILFL IPDLEVTFFF PILTIGFLYE WKRGAS*
-
->P1;nad3_balca
- 
-EKLSPYECGF DPQARLPFSL RFFLVAILFL LFDLEIALLL PILTIGLIYE WMEGGL*
-
->P1;nad3_chocr
- 
-EKVSAYECGF NPFARATFDI RFYLVAILFL IFDLEISFLF PILTIGFIYE WYKGAL*
-
->P1;nad3_drome
- 
-EKSSPFECGF DPKSRLPFSL RFFLITIIFL IFDVEIALIL PILLIGLYHE WNQGML*
-
->P1;nad3_human
- 
-EKSTPYECGF DPMARVPFSM KFFLVAITFL LFDLEIALLL PILALSLAYE WLQKGL*
-
->P1;nad3_ktun
- 
-EKSSPFECGF DPKARLPFSM RFFLITVVFL VFDVEIVLLL PILIIGVLHE WSEGSL*
-
->P1;nad3_lter
- 
-EKSSPFECGF DPKARIPFST RFFLLAIIFI VFDIEIVLLM PILLIGLIHE WKEGSL*
-
->P1;nad3_marpo
- 
-EKLSAYECGF DPFARSRFDI RFYLVSILFI IFDLEVTFLF PILTIGFVYE WKKGAL*
-
->P1;nad3_metse
- 
-EKVSAYECGF DPFPGRPFSI RFFLIGILFL IFDLEISFLF PVLTLGLVYE WLKGGL*
-
->P1;nad3_picca
- 
-NKTGPYECGF DSFSRTTYSI KFILIAILFL PFDLELTSIL PPLIIGFIIE INTKAI*
-
->P1;nad3_podan
- 
-EKYSIFECGF HSFNRTQFGV KFFIFALVYL LLDLEILLTF PIITIGFVYE LGKSAL*
-
->P1;nad3_prowi
- 
-EKISAYECGF DPFARGRFDI QFYLVAILFI IFDLEVAFLF PILTVGFIYE WRKGAL*
-
->P1;nad3_recam
- 
-EKLSPYECGF NPFARGAFDV RFYLVAILFI IFDLEVAFLF PILTVGFIYE WKKGAL*
-
diff -r 814b453382cf -r baeab8664657 test-data/nad3.pir-gb.htm
--- a/test-data/nad3.pir-gb.htm	Mon Jan 23 07:42:56 2017 -0500
+++ /dev/null	Thu Jan 01 00:00:00 1970 +0000
@@ -1,124 +0,0 @@
-
-
-
-nad3.pir
-
-
-
-
-

Gblocks 0.91b Results

-

-Processed file: nad3.pir
-Number of sequences: 17
Alignment assumed to be: Protein
-New number of positions: 56 (selected positions are underlined in blue) -

-
-                         10        20        30        40        50        60
-                 =========+=========+=========+=========+=========+=========+
-nad3_parde       ------MEYLLQEYLPILVFLGMASALAIVLILAAAVIAVRN--PDPEKVSAYECGFNAF
-nad3_acaca       ---------MTLEYIYIFIFFWGAFFISCLLIFLSYFLVYQE--SDIEKNSAYECGFQPF
-nad3_allma       --------------MTYLVYIVFTIVLTVGLILVSYLLSQAQ--PDSEKVSAYECGFSPL
-nad3_apec        -----------IFNFLTLFVSILIFLITTLITFAAHFLPSRN-TD-SEKSSPYECGFDPL
-nad3_arath       ---------MMSEFAPISIYLVISLLVSLILLGVPFPFASNS-STYPEKLSAYECGFDPS
-nad3_balca       -------------MNSFLIYLLIAITLSFILSIVGHRLPTRN-MD-QEKLSPYECGFDPQ
-nad3_chocr       ------MKLIFTEYSAILIFFAISSLLSSVIFLLSYFLIPQK--PDQEKVSAYECGFNPF
-nad3_drome       -------------MFSIIFIALLILLITTIVMFLASILSKKA-LIDREKSSPFECGFDPK
-nad3_human       -------------MN-FALILMINTLLALLLMIITFWLPQLN-GY-MEKSTPYECGFDPM
-nad3_ktun        -------------MFFVLSLVLFTFLLSLVLLSVSLSLTKKK-MMNREKSSPFECGFDPK
-nad3_lter        -------------MILTALSSAIALLVPIIILGAAWVLASRS-TEDREKSSPFECGFDPK
-nad3_marpo       -----------MEFAPIFVYLVISLLLSLILIGVSFLFASSSSLAYPEKLSAYECGFDPF
-nad3_metse       ---------MYTEFYGILVLLIFSVVLSAIISGASYILGDKQ--PDREKVSAYECGFDPF
-nad3_picca       MLNYFVYPYGIENDMGMKFYMMLVPMMSMVLMMINYMMTNKS-DNNMNKTGPYECGFDSF
-nad3_podan       -------------MSSMTLFILFVSIIALLFLFINLIFAPHN--PYQEKYSIFECGFHSF
-nad3_prowi       ----------MYEFLGILIYFFIALALSLLLLGLPFLVSTRK--ADPEKISAYECGFDPF
-nad3_recam       -----MNTMILSEYLSVLIFFIFSFGLSCIILGLSYVLATQN--ADTEKLSPYECGFNPF
-                                                                #############
-
-
-                         70        80        90       100       110       120
-                 =========+=========+=========+=========+=========+=========+
-nad3_parde       D-DARMKFDVRFYLVSILFIIFDLEVAFLFPWAVSFASLS-DVAFWGLMVFLAVLTVGFA
-nad3_acaca       E-DTRSKFNVRYYLIAILFMIFDLEIMYLFPWSISISTGS-FFGVWAIFLFLIILTVGFI
-nad3_allma       G-DARQKFDVSFYLIAILFIIFDLEVVFILPFASVIHNVS-LLGGWITIIFLVILTIGFI
-nad3_apec        N-SARVPFSFRFFLVAILFLLFDLEIALLFPLPFSVFFH--P--IHTP----LILTVGLI
-nad3_arath       G-DARSRFDIRFYLVSILFLIPDLEVTFFFPWAVPPNKID-LFGFWSMMAFLFILTIGFL
-nad3_balca       A-SARLPFSLRFFLVAILFLLFDLEIALLLPFPAALSARDPQLSFTLAFLILLILTIGLI
-nad3_chocr       D-DARATFDIRFYLVAILFLIFDLEISFLFPWSLVLGEIS-IIGFWSMIVFLVILTIGFI
-nad3_drome       S-SSRLPFSLRFFLITIIFLIFDVEIALILPMIIIMKYSNIMIWTITSIIFILILLIGLY
-nad3_human       S-PARVPFSMKFFLVAITFLLFDLEIALLLPLPWALQTTNLPLMVMSSLLLIIILALSLA
-nad3_ktun        S-SARLPFSMRFFLITVVFLVFDVEIVLLLPYLFSSGWSIDVFSLVGSMMILVILIIGVL
-nad3_lter        S-TARIPFSTRFFLLAIIFIVFDIEIVLLMPLPTILHTSDVFTTVTTSVLFLMILLIGLI
-nad3_marpo       D-DARSRFDIRFYLVSILFIIFDLEVTFLFPWAVSLNKIG-LFGFWSMMVFLFILTIGFV
-nad3_metse       G-TPGRPFSIRFFLIGILFLIFDLEISFLFPWCVVCNQVF-PFGYWTMIVFLAVLTLGLV
-nad3_picca       R-QSRTTYSIKFILIAILFLPFDLELTSILPYTLSMYNTN-IYGLFILLYFLLPLIIGFI
-nad3_podan       LGQNRTQFGVKFFIFALVYLLLDLEILLTFPFAVSEYVNN-IYGLIILLGFITIITIGFV
-nad3_prowi       D-DARGRFDIQFYLVAILFIIFDLEVAFLFPWALTLNKIG-YFGFWSMMLFLFILTVGFI
-nad3_recam       D-DARGAFDVRFYLVAILFIIFDLEVAFLFPWAVALSDVT-IFGFWTMFIFLLILTVGFI
-                    ############################                      #######
-
-
-                        130       140       150
-                 =========+=========+=========+===
-nad3_parde       YEWKKGALEWA----------------------
-nad3_acaca       YEWQKGALEWD----------------------
-nad3_allma       YEFVSGAITDSF---------------------
-nad3_apec        FEWVQGGLDWAE---------------------
-nad3_arath       YEWKRGASDRE----------------------
-nad3_balca       YEWMEGGLEWAE---------------------
-nad3_chocr       YEWYKGALEWE----------------------
-nad3_drome       HEWNQGMLNWSN---------------------
-nad3_human       YEWLQKGLDWTE---------------------
-nad3_ktun        HEWSEGSLEWFSSSN------------------
-nad3_lter        HEWKEGSLDWSS---------------------
-nad3_marpo       YEWKKGALDWE----------------------
-nad3_metse       YEWLKGGLEWE----------------------
-nad3_picca       IEINTKAIYMTKMFNRNVKSMTSYVKYNNKI--
-nad3_podan       YELGKSALKIDSRQVITMTRFNYSSTIEYLGKI
-nad3_prowi       YEWRKGALDWS----------------------
-nad3_recam       YEWKKGALDWE----------------------
-                 ########                         
-
-
-
-
-
-
-
-
Parameters used -Minimum Number Of Sequences For A Conserved Position: 9 -Minimum Number Of Sequences For A Flanking Position: 14 -Maximum Number Of Contiguous Nonconserved Positions: 8 -Minimum Length Of A Block: 5 -Allowed Gap Positions: None -Use Similarity Matrices: Yes - -
Flank positions of the 3 selected block(s)
-Flanks: [48  60]  [64  91]  [114  128]  
-
-New number of positions in nad3.pir-gb1:  56  (36% of the original 153 positions)
-
-
-
-
diff -r 814b453382cf -r baeab8664657 tool_dependencies.xml
--- a/tool_dependencies.xml	Mon Jan 23 07:42:56 2017 -0500
+++ /dev/null	Thu Jan 01 00:00:00 1970 +0000
@@ -1,6 +0,0 @@
-
-
-    
-        
-    
-