annotate sam_header.xml @ 0:7da2c9654a83 draft default tip

Uploaded
author wolma
date Tue, 12 Aug 2014 11:26:15 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
1 <tool id="sam_header" name="NGS Run Annotation">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
2 <description>Create a SAM format header from run metadata for sample annotation.</description>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
3 <requirements>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
4 <requirement type="package" version="3.4.1">python3</requirement>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
5 <requirement type="package" version="0.1.3_9af04e0e9125">MiModD</requirement>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
6 </requirements>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
7 <command>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
8 mimodd header
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
9
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
10 --rg_id "$rg_id"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
11 --rg_sm "$rg_sm"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
12
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
13 #if $str($rg_cn):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
14 --rg_cn "$rg_cn"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
15 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
16 #if $str($rg_ds):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
17 --rg_ds "$rg_ds"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
18 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
19 #if $str($anno) and $str($month) and $str($day):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
20 --rg_dt "$anno-$month-$day"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
21 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
22 #if $str($rg_lb):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
23 --rg_lb "$rg_lb"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
24 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
25 #if $str($rg_pl):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
26 --rg_pl "$rg_pl"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
27 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
28 #if $str($rg_ds):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
29 --rg_pi "$rg_pi"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
30 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
31 #if $str($rg_pu):
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
32 --rg_pu "$rg_pu"
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
33 #end if
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
34
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
35 --outputfile $outputfile
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
36
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
37 </command>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
38
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
39 <inputs>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
40 <param name="rg_id" type="text" size="80" label="read-group ID (mandatory)">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
41 <sanitizer invalid_char="">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
42 <valid initial="string.printable">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
43 <remove value="&quot;" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
44 </valid>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
45 <mapping initial="none">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
46 <add source="&quot;" target="\&quot;"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
47 </mapping>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
48 </sanitizer>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
49 </param>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
50 <param name="rg_sm" type="text" size="80" label="sample name (mandatory)">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
51 <sanitizer invalid_char="">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
52 <valid initial="string.printable">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
53 <remove value="&quot;" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
54 </valid>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
55 <mapping initial="none">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
56 <add source="&quot;" target="\&quot;"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
57 </mapping>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
58 </sanitizer>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
59 </param>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
60 <param name="rg_cn" type="text" size="80" label="name of sequencing center">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
61 <sanitizer invalid_char="">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
62 <valid initial="string.printable">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
63 <remove value="&quot;" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
64 </valid>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
65 <mapping initial="none">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
66 <add source="&quot;" target="\&quot;"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
67 </mapping>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
68 </sanitizer>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
69 </param>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
70 <param name="rg_ds" type="text" size="80" label="description">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
71 <sanitizer invalid_char="">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
72 <valid initial="string.printable">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
73 <remove value="&quot;" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
74 </valid>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
75 <mapping initial="none">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
76 <add source="&quot;" target="\&quot;"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
77 </mapping>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
78 </sanitizer>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
79 </param>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
80 <param name="anno" type="text" label="year (YYYY) the run was produced" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
81 <param name="month" type="text" label="month (MM) the run was produced" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
82 <param name="day" type="text" label="day (DD) the run was produced" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
83 <param name="rg_lb" type="text" size="80" label="read-group library">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
84 <sanitizer invalid_char="">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
85 <valid initial="string.printable">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
86 <remove value="&quot;" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
87 </valid>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
88 <mapping initial="none">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
89 <add source="&quot;" target="\&quot;"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
90 </mapping>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
91 </sanitizer>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
92 </param>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
93 <param name="rg_pl" type="text" label="platform/technology used to produce the reads" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
94 <param name="rg_pi" type="text" label="predicted median insert size" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
95 <param name="rg_pu" type="text" size="80" label="platform unit; unique identifier">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
96 <sanitizer invalid_char="">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
97 <valid initial="string.printable">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
98 <remove value="&quot;" />
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
99 </valid>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
100 <mapping initial="none">
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
101 <add source="&quot;" target="\&quot;"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
102 </mapping>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
103 </sanitizer>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
104 </param>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
105 </inputs>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
106
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
107 <outputs>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
108 <data name="outputfile" format="sam" label="${rg_sm} (${rg_id}) header information from MiModd ${tool.name} on ${on_string}"/>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
109 </outputs>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
110
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
111 <help>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
112 .. class:: infomark
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
113
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
114 **What it does**
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
115
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
116 This tool takes the user-provided information about a next-generation sequencing run and constructs a valid header in the SAM file format from it.
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
117
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
118 The result file can be used by the tools *Convert* and *Reheader* or in the *SNAP Read Alignment* step to add run metadata to sequenced reads files (or to overwrite pre-existing information).
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
119
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
120 **Note:**
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
121
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
122 **MiModD requires run metadata for every input file at the Alignment step !**
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
123
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
124 **Tip:**
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
125
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
126 While you can do Alignments from fastq file format by providing a custom header file directly to the *SNAP Read Alignment* tool, the **recommended approach** is to first convert all input files to and archive all datasets in SAM/BAM format with appropriate header information prior to any downstream analysis. Although a bit more time-consuming this practice protects against information loss and ensures that the input datasets will remain useful for others in the future.
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
127
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
128 </help>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
129 </tool>
7da2c9654a83 Uploaded
wolma
parents:
diff changeset
130