annotate convert.xml @ 0:74993cc4af07 draft default tip

Uploaded
author wolma
date Tue, 09 Dec 2014 17:33:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
74993cc4af07 Uploaded
wolma
parents:
diff changeset
1 <tool id="convert" name="Convert">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
2 <description>between different sequence data formats</description>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
3 <requirements>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
4 <requirement type="package" version="0.1.5">mimodd</requirement>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
5 </requirements>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
6 <version_command>mimodd version -q</version_command>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
7 <command>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
8 mimodd convert
74993cc4af07 Uploaded
wolma
parents:
diff changeset
9
74993cc4af07 Uploaded
wolma
parents:
diff changeset
10 #for $i in $mode.input_list
74993cc4af07 Uploaded
wolma
parents:
diff changeset
11 ${i.file1}
74993cc4af07 Uploaded
wolma
parents:
diff changeset
12 #if $str($mode.iformat) in ("fastq_pe", "gz_pe"):
74993cc4af07 Uploaded
wolma
parents:
diff changeset
13 ${i.file2}
74993cc4af07 Uploaded
wolma
parents:
diff changeset
14 #end if
74993cc4af07 Uploaded
wolma
parents:
diff changeset
15 #end for
74993cc4af07 Uploaded
wolma
parents:
diff changeset
16 #if $str($mode.header) != "None":
74993cc4af07 Uploaded
wolma
parents:
diff changeset
17 --header $(mode.header)
74993cc4af07 Uploaded
wolma
parents:
diff changeset
18 #end if
74993cc4af07 Uploaded
wolma
parents:
diff changeset
19 --ofile $outputname
74993cc4af07 Uploaded
wolma
parents:
diff changeset
20 --iformat $(mode.iformat)
74993cc4af07 Uploaded
wolma
parents:
diff changeset
21 --oformat $(mode.oformat)
74993cc4af07 Uploaded
wolma
parents:
diff changeset
22 </command>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
23
74993cc4af07 Uploaded
wolma
parents:
diff changeset
24 <inputs>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
25 <conditional name="mode">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
26 <param name="iformat" type="select" label="input file format" help="Your choice will update the interface to display further choices appropriate for your type of input data.">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
27 <option value="fastq">fastq: single-end (one file)</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
28 <option value="fastq_pe">fastq: paired-end (two files)</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
29 <option value="gz">gzip compressed fastq: single-end (one file)</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
30 <option value="gz_pe">gzip compressed fastq: paired-end (two files)</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
31 <option value="sam">sam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
32 <option value="bam">bam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
33 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
34 <when value="fastq">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
35 <param name="oformat" type="select" label="output file format">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
36 <option value="sam">sam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
37 <option value="bam">bam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
38 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
39 <repeat name="input_list" title="fastq input dataset" default="1" min="1">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
40 <param name="file1" format="fastq" type="data" label="inputfile"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
41 </repeat>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
42 <param name="header" type="data" format="sam" label="Use Header File" help="A SAM file with header information, as generated, for example, by the NGS Run Annotation Tool, that will be used to attach metainformation to the results file."/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
43 </when>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
44 <when value="fastq_pe">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
45 <param name="oformat" type="select" label="output file format">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
46 <option value="sam">sam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
47 <option value="bam">bam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
48 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
49 <repeat name="input_list" title="fastq input datasets" default="1" min="1">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
50 <param format="fastq" name="file1" type="data" label="inputfile with the first set of reads of paired-end data"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
51 <param format="fastq" name="file2" type="data" label="inputfile with the second set of reads of paired-end data"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
52 </repeat>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
53 <param name="header" type="data" format="sam" label="Use Header File" help="A SAM file with header information, as generated, for example, by the NGS Run Annotation Tool, that will be used to attach metainformation to the results file."/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
54 </when>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
55 <when value="gz">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
56 <param name="oformat" type="select" label="output file format">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
57 <option value="sam">sam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
58 <option value="bam">bam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
59 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
60 <repeat name="input_list" title="fastq.gz input dataset" default="1" min="1">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
61 <param name="file1" format="data" type="data" label="inputfile"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
62 </repeat>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
63 <param name="header" type="data" format="sam" label="Use Header File" help="A SAM file with header information, as generated, for example, by the NGS Run Annotation Tool, that will be used to attach metainformation to the results file."/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
64 </when>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
65 <when value="gz_pe">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
66 <param name="oformat" type="select" label="output file format">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
67 <option value="sam">sam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
68 <option value="bam">bam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
69 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
70 <repeat name="input_list" title="fastq.gz input datasets" default="1" min="1">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
71 <param format="data" name="file1" type="data" label="inputfile with the first set of reads of paired-end data"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
72 <param format="data" name="file2" type="data" label="inputfile with the second set of reads of paired-end data"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
73 </repeat>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
74 <param name="header" type="data" format="sam" label="Use Header File" help="A SAM file with header information, as generated, for example, by the NGS Run Annotation Tool, that will be used to attach metainformation to the results file."/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
75 </when>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
76 <when value="sam">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
77 <param name="oformat" type="select" label="output file format">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
78 <option value="bam">bam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
79 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
80 <repeat name="input_list" title="sam input dataset" default="1" min="1" max="1">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
81 <param name="file1" format="sam" type="data" label="inputfile"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
82 </repeat>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
83 <param name="header" type="hidden" value="None"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
84 </when>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
85 <when value="bam">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
86 <param name="oformat" type="select" label="output file format">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
87 <option value="sam">sam</option>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
88 </param>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
89 <repeat name="input_list" title="bam input dataset" default="1" min="1" max="1">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
90 <param name="file1" format="bam" type="data" label="inputfile"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
91 </repeat>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
92 <param name="header" type="hidden" value="None"/>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
93 </when>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
94 </conditional>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
95 </inputs>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
96
74993cc4af07 Uploaded
wolma
parents:
diff changeset
97 <outputs>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
98 <data name="outputname" format="bam" label="Converted reads from MiModd ${tool.name} on ${on_string}">
74993cc4af07 Uploaded
wolma
parents:
diff changeset
99 <change_format>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
100 <when input="mode.oformat" value="sam" format="sam" />
74993cc4af07 Uploaded
wolma
parents:
diff changeset
101 </change_format>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
102 </data>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
103 </outputs>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
104
74993cc4af07 Uploaded
wolma
parents:
diff changeset
105 <help>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
106 .. class:: infomark
74993cc4af07 Uploaded
wolma
parents:
diff changeset
107
74993cc4af07 Uploaded
wolma
parents:
diff changeset
108 **What it does**
74993cc4af07 Uploaded
wolma
parents:
diff changeset
109
74993cc4af07 Uploaded
wolma
parents:
diff changeset
110 The tool converts between different file formats used for storing next-generation sequencing data.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
111
74993cc4af07 Uploaded
wolma
parents:
diff changeset
112 As input file types it can handle uncompressed or gzipped fastq, SAM or BAM format, which it can convert to SAM or BAM format.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
113
74993cc4af07 Uploaded
wolma
parents:
diff changeset
114 **Notes:**
74993cc4af07 Uploaded
wolma
parents:
diff changeset
115
74993cc4af07 Uploaded
wolma
parents:
diff changeset
116 1) In its standard configuration Galaxy will decompress any .gz files during their upload, so the option to align gzipped fastq input is useful only with customized Galaxy instances or by using linked files as explained in our `recipe for using gzipped fastq files in Galaxy`_ from the `MiModD user guide`_.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
117
74993cc4af07 Uploaded
wolma
parents:
diff changeset
118 2) The tool can convert fastq files representing data from paired-end sequencing runs to appropriate SAM/BAM format provided that the mate information is split over two fastq files in corresponding order.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
119
74993cc4af07 Uploaded
wolma
parents:
diff changeset
120 **TIP:** If your paired-end data is arranged differently, you may look into the *fastq splitter* and *fastq de-interlacer* tools for Galaxy from the `Fastq Manipulation category`_ of the Galaxy Tool Shed to see if they can convert your files to the expected format.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
121
74993cc4af07 Uploaded
wolma
parents:
diff changeset
122 3) Merging partial fastq (or gzipped fastq) files into a single SAM/BAM file is supported both for single-end and paired-end data. Simply add additional input datasets and select the appropriate files (pairs of files in case of paired-end data).
74993cc4af07 Uploaded
wolma
parents:
diff changeset
123
74993cc4af07 Uploaded
wolma
parents:
diff changeset
124 Concatenation of SAM/BAM file during conversion is currently not supported.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
125
74993cc4af07 Uploaded
wolma
parents:
diff changeset
126 4) For input in fastq format a SAM header file providing run metadata **has to be specified**. The information in this file will be used as the header data of the new SAM/BAM file. You can use the *NGS Run Annotation* tool to generate a new header file for your data.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
127
74993cc4af07 Uploaded
wolma
parents:
diff changeset
128 For input in SAM/BAM format the tool will simply copy the existing header data to the new file. To modify the header of an existing SAM/BAM file, use the *Reheader BAM file* tool instead.
74993cc4af07 Uploaded
wolma
parents:
diff changeset
129
74993cc4af07 Uploaded
wolma
parents:
diff changeset
130 .. _Fastq Manipulation category: https://toolshed.g2.bx.psu.edu/repository/browse_repositories_in_category?id=310ff67d4caf6531
74993cc4af07 Uploaded
wolma
parents:
diff changeset
131 .. _recipe for using gzipped fastq files in Galaxy: http://mimodd.readthedocs.org/en/latest/recipes.html#use-gzipped-fastq-files-in-galaxy
74993cc4af07 Uploaded
wolma
parents:
diff changeset
132 .. _MiModD user guide: http://mimodd.readthedocs.org/en/latest
74993cc4af07 Uploaded
wolma
parents:
diff changeset
133
74993cc4af07 Uploaded
wolma
parents:
diff changeset
134 </help>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
135 </tool>
74993cc4af07 Uploaded
wolma
parents:
diff changeset
136