annotate sam_header.xml @ 0:b2bc4e918462 draft

Uploaded
author wolma
date Thu, 14 Aug 2014 09:57:49 -0400
parents
children 7bce49512bad
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
1 <tool id="sam_header" name="NGS Run Annotation">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
2 <description>Create a SAM format header from run metadata for sample annotation.</description>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
3 <requirements>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
4 <requirement type="package" version="0.1.3">mimodd</requirement>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
5 </requirements>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
6 <command>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
7 mimodd header
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
8
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
9 --rg_id "$rg_id"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
10 --rg_sm "$rg_sm"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
11
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
12 #if $str($rg_cn):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
13 --rg_cn "$rg_cn"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
14 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
15 #if $str($rg_ds):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
16 --rg_ds "$rg_ds"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
17 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
18 #if $str($anno) and $str($month) and $str($day):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
19 --rg_dt "$anno-$month-$day"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
20 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
21 #if $str($rg_lb):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
22 --rg_lb "$rg_lb"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
23 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
24 #if $str($rg_pl):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
25 --rg_pl "$rg_pl"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
26 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
27 #if $str($rg_ds):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
28 --rg_pi "$rg_pi"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
29 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
30 #if $str($rg_pu):
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
31 --rg_pu "$rg_pu"
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
32 #end if
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
33
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
34 --outputfile $outputfile
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
35
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
36 </command>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
37
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
38 <inputs>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
39 <param name="rg_id" type="text" size="80" label="read-group ID (mandatory)">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
40 <sanitizer invalid_char="">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
41 <valid initial="string.printable">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
42 <remove value="&quot;" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
43 </valid>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
44 <mapping initial="none">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
45 <add source="&quot;" target="\&quot;"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
46 </mapping>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
47 </sanitizer>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
48 </param>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
49 <param name="rg_sm" type="text" size="80" label="sample name (mandatory)">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
50 <sanitizer invalid_char="">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
51 <valid initial="string.printable">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
52 <remove value="&quot;" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
53 </valid>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
54 <mapping initial="none">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
55 <add source="&quot;" target="\&quot;"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
56 </mapping>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
57 </sanitizer>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
58 </param>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
59 <param name="rg_cn" type="text" size="80" label="name of sequencing center">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
60 <sanitizer invalid_char="">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
61 <valid initial="string.printable">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
62 <remove value="&quot;" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
63 </valid>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
64 <mapping initial="none">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
65 <add source="&quot;" target="\&quot;"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
66 </mapping>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
67 </sanitizer>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
68 </param>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
69 <param name="rg_ds" type="text" size="80" label="description">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
70 <sanitizer invalid_char="">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
71 <valid initial="string.printable">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
72 <remove value="&quot;" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
73 </valid>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
74 <mapping initial="none">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
75 <add source="&quot;" target="\&quot;"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
76 </mapping>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
77 </sanitizer>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
78 </param>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
79 <param name="anno" type="text" label="year (YYYY) the run was produced" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
80 <param name="month" type="text" label="month (MM) the run was produced" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
81 <param name="day" type="text" label="day (DD) the run was produced" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
82 <param name="rg_lb" type="text" size="80" label="read-group library">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
83 <sanitizer invalid_char="">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
84 <valid initial="string.printable">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
85 <remove value="&quot;" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
86 </valid>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
87 <mapping initial="none">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
88 <add source="&quot;" target="\&quot;"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
89 </mapping>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
90 </sanitizer>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
91 </param>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
92 <param name="rg_pl" type="text" label="platform/technology used to produce the reads" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
93 <param name="rg_pi" type="text" label="predicted median insert size" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
94 <param name="rg_pu" type="text" size="80" label="platform unit; unique identifier">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
95 <sanitizer invalid_char="">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
96 <valid initial="string.printable">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
97 <remove value="&quot;" />
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
98 </valid>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
99 <mapping initial="none">
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
100 <add source="&quot;" target="\&quot;"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
101 </mapping>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
102 </sanitizer>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
103 </param>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
104 </inputs>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
105
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
106 <outputs>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
107 <data name="outputfile" format="sam" label="${rg_sm} (${rg_id}) header information from MiModd ${tool.name} on ${on_string}"/>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
108 </outputs>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
109
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
110 <help>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
111 .. class:: infomark
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
112
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
113 **What it does**
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
114
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
115 This tool takes the user-provided information about a next-generation sequencing run and constructs a valid header in the SAM file format from it.
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
116
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
117 The result file can be used by the tools *Convert* and *Reheader* or in the *SNAP Read Alignment* step to add run metadata to sequenced reads files (or to overwrite pre-existing information).
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
118
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
119 **Note:**
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
120
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
121 **MiModD requires run metadata for every input file at the Alignment step !**
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
122
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
123 **Tip:**
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
124
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
125 While you can do Alignments from fastq file format by providing a custom header file directly to the *SNAP Read Alignment* tool, the **recommended approach** is to first convert all input files to and archive all datasets in SAM/BAM format with appropriate header information prior to any downstream analysis. Although a bit more time-consuming this practice protects against information loss and ensures that the input datasets will remain useful for others in the future.
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
126
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
127 </help>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
128 </tool>
b2bc4e918462 Uploaded
wolma
parents:
diff changeset
129