Mercurial > repos > devteam > emboss_5
comparison emboss_needle.xml @ 10:9b98d3d903c6 draft
planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/tools/emboss_5 commit fc158bfe5f5927dc199321a2cf43310373cbc8ba
author | devteam |
---|---|
date | Fri, 12 Aug 2016 19:17:10 -0400 |
parents | |
children | 0e2484b6829b |
comparison
equal
deleted
inserted
replaced
9:18a63c800a4d | 10:9b98d3d903c6 |
---|---|
1 <tool id="EMBOSS: needle56" name="needle" version="5.0.0"> | |
2 <description>Needleman-Wunsch global alignment</description> | |
3 <requirements><requirement type="package" version="5.0.0">emboss</requirement></requirements> | |
4 <command>needle -asequence $input1 -bsequence $input2 -outfile $out_file1 -gapopen $gapopen -gapextend $gapextend -brief $brief -aformat3 $out_format1 -auto</command> | |
5 <inputs> | |
6 <param format="fasta" name="input1" type="data"> | |
7 <label>Sequence 1</label> | |
8 </param> | |
9 <param format="fasta" name="input2" type="data"> | |
10 <label>Sequence 2</label> | |
11 </param> | |
12 <param name="gapopen" type="text" value="10.0"> | |
13 <label>Gap open penalty</label> | |
14 </param> | |
15 <param name="gapextend" type="text" value="0.5"> | |
16 <label>Gap extension penalty</label> | |
17 </param> | |
18 <param name="brief" type="select"> | |
19 <label>Brief identity and similarity</label> | |
20 <option value="yes">Yes</option> | |
21 <option value="no">No</option> | |
22 </param> | |
23 <param name="out_format1" type="select"> | |
24 <label>Output Alignment File Format</label> | |
25 <option value="srspair">SRS pair (p)</option> | |
26 <option value="simple">Simple (m)</option> | |
27 <option value="fasta">FASTA (m)</option> | |
28 <option value="msf">MSF (m)</option> | |
29 <option value="srs">SRS (m)</option> | |
30 <option value="pair">Pair (p)</option> | |
31 <option value="markx0">Markx0 (p)</option> | |
32 <option value="markx1">Markx1 (p)</option> | |
33 <option value="markx2">Markx2 (p)</option> | |
34 <option value="markx3">Markx3 (p)</option> | |
35 <option value="markx10">Markx10 (p)</option> | |
36 <option value="score">Score (p)</option> | |
37 </param> | |
38 </inputs> | |
39 <outputs> | |
40 <data format="needle" name="out_file1" /> | |
41 </outputs> | |
42 <tests> | |
43 <test> | |
44 <param name="input1" value="2.fasta"/> | |
45 <param name="input2" value="1.fasta"/> | |
46 <param name="gapopen" value="10"/> | |
47 <param name="gapextend" value="0.5"/> | |
48 <param name="brief" value="yes"/> | |
49 <param name="out_format1" value="score"/> | |
50 <output name="out_file1" file="emboss_needle_out.score"/> | |
51 </test> | |
52 </tests> | |
53 <code file="emboss_format_corrector.py" /> | |
54 <help> | |
55 | |
56 .. class:: warningmark | |
57 | |
58 needle reads any two sequences of the same type (DNA or protein). | |
59 | |
60 ----- | |
61 | |
62 **Syntax** | |
63 | |
64 This tool uses the Needleman-Wunsch global alignment algorithm to find the optimum alignment (including gaps) of two sequences when considering their entire length. | |
65 | |
66 - **Optimal alignment:** Dynamic programming methods ensure the optimal global alignment by exploring all possible alignments and choosing the best. | |
67 | |
68 - **The Needleman-Wunsch algorithm** is a member of the class of algorithms that can calculate the best score and alignment in the order of mn steps, (where 'n' and 'm' are the lengths of the two sequences). | |
69 | |
70 - **Gap open penalty:** [10.0 for any sequence] The gap open penalty is the score taken away when a gap is created. The best value depends on the choice of comparison matrix. The default value assumes you are using the EBLOSUM62 matrix for protein sequences, and the EDNAFULL matrix for nucleotide sequences. (Floating point number from 1.0 to 100.0) | |
71 | |
72 - **Gap extension penalty:** [0.5 for any sequence] The gap extension, penalty is added to the standard gap penalty for each base or residue in the gap. This is how long gaps are penalized. Usually you will expect a few long gaps rather than many short gaps, so the gap extension penalty should be lower than the gap penalty. An exception is where one or both sequences are single reads with possible sequencing errors in which case you would expect many single base gaps. You can get this result by setting the gap open penalty to zero (or very low) and using the gap extension penalty to control gap scoring. (Floating point number from 0.0 to 10.0) | |
73 | |
74 You can view the original documentation here_. | |
75 | |
76 .. _here: http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/needle.html | |
77 | |
78 ----- | |
79 | |
80 **Example** | |
81 | |
82 - Input File:: | |
83 | |
84 >hg18_dna range=chrX:151073054-151073136 5'pad=0 3'pad=0 revComp=FALSE strand=? repeatMasking=none | |
85 TTTATGTCTATAATCCTTACCAAAAGTTACCTTGGAATAAGAAGAAGTCA | |
86 GTAAAAAGAAGGCTGTTGTTCCGTGAAATACTG | |
87 | |
88 - If both Sequence1 and Sequence2 take the above file as input, Gap open penalty equals 10.0, Gap extension penalty equals 0.5, Brief identity and similarity is set to Yes, Output Alignment File Format is set to SRS pairs, the output file is:: | |
89 | |
90 ######################################## | |
91 # Program: needle | |
92 # Rundate: Mon Apr 02 2007 14:23:16 | |
93 # Align_format: srspair | |
94 # Report_file: ./database/files/dataset_7.dat | |
95 ######################################## | |
96 | |
97 #======================================= | |
98 # | |
99 # Aligned_sequences: 2 | |
100 # 1: hg18_dna | |
101 # 2: hg18_dna | |
102 # Matrix: EDNAFULL | |
103 # Gap_penalty: 10.0 | |
104 # Extend_penalty: 0.5 | |
105 # | |
106 # Length: 83 | |
107 # Identity: 83/83 (100.0%) | |
108 # Similarity: 83/83 (100.0%) | |
109 # Gaps: 0/83 ( 0.0%) | |
110 # Score: 415.0 | |
111 # | |
112 #======================================= | |
113 | |
114 hg18_dna 1 TTTATGTCTATAATCCTTACCAAAAGTTACCTTGGAATAAGAAGAAGTCA 50 | |
115 |||||||||||||||||||||||||||||||||||||||||||||||||| | |
116 hg18_dna 1 TTTATGTCTATAATCCTTACCAAAAGTTACCTTGGAATAAGAAGAAGTCA 50 | |
117 | |
118 hg18_dna 51 GTAAAAAGAAGGCTGTTGTTCCGTGAAATACTG 83 | |
119 ||||||||||||||||||||||||||||||||| | |
120 hg18_dna 51 GTAAAAAGAAGGCTGTTGTTCCGTGAAATACTG 83 | |
121 | |
122 #--------------------------------------- | |
123 #--------------------------------------- | |
124 | |
125 | |
126 ------ | |
127 | |
128 **Citation** | |
129 | |
130 For the underlying tool, please cite `Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000 Jun;16(6):276-7. <http://www.ncbi.nlm.nih.gov/pubmed/10827456>`_ | |
131 | |
132 If you use this tool in Galaxy, please cite `Blankenberg D, Taylor J, Schenck I, He J, Zhang Y, Ghent M, Veeraraghavan N, Albert I, Miller W, Makova KD, Hardison RC, Nekrutenko A. A framework for collaborative analysis of ENCODE data: making large-scale analyses biologist-friendly. Genome Res. 2007 Jun;17(6):960-4. <http://www.ncbi.nlm.nih.gov/pubmed/17568012>`_ | |
133 </help> | |
134 </tool> |