annotate edgeR_Differential_Gene_Expression.xml @ 89:875f080136b6 draft

Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
author yhoogstrate
date Wed, 28 Jan 2015 09:22:54 -0500
parents b2738b4d7c8c
children f87938c392bf
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
1 <?xml version="1.0" encoding="UTF-8"?>
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
2 <tool id="edger_dge" name="edgeR: Differential Gene(Expression) Analysis" version="3.0.3-latest.b">
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
3 <description>RNA-Seq gene expression analysis using edgeR (R package)</description>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
4
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
5 <requirements>
62
9384bc21272d Uploaded
yhoogstrate
parents: 60
diff changeset
6 <!--<requirement type="package" version="3.0.1">package_r3_withx</requirement>-->
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
7 <!--<requirement type="package" version="3.1.0">R</requirement>-->
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
8 <requirement type="package" version="3.0.3">R</requirement>
77
2ee2336d71cc Uploaded
yhoogstrate
parents: 75
diff changeset
9 <requirement type="package" version="latest">biocLite_edgeR_limma</requirement>
72
e75afef75c13 Uploaded
yhoogstrate
parents: 70
diff changeset
10 <requirement type="package" version="1.3.18">graphicsmagick</requirement>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
11 </requirements>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
12
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
13 <version_command>R --vanilla --slave -e "library(edgeR) ; cat(sessionInfo()\$otherPkgs\$edgeR\$Version)" 2&gt; /dev/null</version_command>
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
14
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
15 <command>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
16 <!--
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
17 The following script is written in the "Cheetah" language:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
18 http://www.cheetahtemplate.org/docs/users_guide_html_multipage/contents.html
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
19 -->
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
20
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
21 R --vanilla --slave -f $R_script '--args
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
22 $expression_matrix
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
23 $design_matrix
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
24 $contrast
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
26 $fdr
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
27
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
28 $output_count_edgeR
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
29 $output_cpm
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
30
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
31 /dev/null <!-- Calculation of FPKM/RPKM should come here -->
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
32
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
33 #if $output_raw_counts:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
34 $output_raw_counts
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
35 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
36 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
37 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
38
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
39 #if $output_MDSplot_logFC:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
40 $output_MDSplot_logFC
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
41 #else:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
42 /dev/null
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
43 #end if
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
44
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
45 #if $output_MDSplot_bcv:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
46 $output_MDSplot_bcv
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
47 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
48 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
49 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
50
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
51 #if $output_BCVplot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
52 $output_BCVplot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
53 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
54 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
55 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
56
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
57 #if $output_MAplot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
58 $output_MAplot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
59 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
60 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
61 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
62
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
63 #if $output_PValue_distribution_plot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
64 $output_PValue_distribution_plot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
65 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
66 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
67 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
68
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
69 #if $output_hierarchical_clustering_plot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
70 $output_hierarchical_clustering_plot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
71 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
72 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
73 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
74
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
75 #if $output_heatmap_plot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
76 $output_heatmap_plot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
77 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
78 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
79 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
80
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
81 #if $output_RData_obj:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
82 $output_RData_obj
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
83 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
84 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
85 #end if
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
86
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
87 $output_format_images
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
88 '
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
89 #if $output_R:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
90 > $output_R
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
91 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
92 > /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
93 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
94
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
95 2> stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
96
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
97 #if $output_format_images.value == "png":
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
98 echo "Converting PDF figures to PNG" ;
69
1dc20e3e11df Uploaded
yhoogstrate
parents: 67
diff changeset
99
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
100 #if $output_MDSplot_logFC:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
101 #set $output_MDSplot_logFC_tmp = str($output_MDSplot_logFC)+".png"
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
102
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
103 gm convert $output_MDSplot_logFC $output_MDSplot_logFC_tmp ;
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
104 mv $output_MDSplot_logFC_tmp $output_MDSplot_logFC ;
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
105 #end if
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
106
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
107 #if $output_MDSplot_bcv:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
108 #set $output_MDSplot_bcv_tmp = str($output_MDSplot_bcv)+".png"
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
109
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
110 gm convert $output_MDSplot_bcv $output_MDSplot_bcv_tmp ;
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
111 mv $output_MDSplot_bcv_tmp $output_MDSplot_bcv ;
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
112 #end if
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
113
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
114 #if $output_BCVplot:
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
115 #set $output_BCVplot_tmp = str($output_BCVplot)+".png"
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
116
72
e75afef75c13 Uploaded
yhoogstrate
parents: 70
diff changeset
117 gm convert $output_BCVplot $output_BCVplot_tmp ;
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
118 mv $output_BCVplot_tmp $output_BCVplot ;
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
119 #end if
69
1dc20e3e11df Uploaded
yhoogstrate
parents: 67
diff changeset
120
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
121 #if $output_MAplot:
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
122 #set $output_MAplot_tmp = str($output_MAplot)+".png"
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
123
72
e75afef75c13 Uploaded
yhoogstrate
parents: 70
diff changeset
124 gm convert $output_MAplot $output_MAplot_tmp ;
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
125 mv $output_MAplot_tmp $output_MAplot ;
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
126 #end if
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
127
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
128 #if $output_PValue_distribution_plot:
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
129 #set $output_PValue_distribution_plot_tmp = str($output_PValue_distribution_plot)+".png"
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
130
72
e75afef75c13 Uploaded
yhoogstrate
parents: 70
diff changeset
131 gm convert $output_PValue_distribution_plot $output_PValue_distribution_plot_tmp ;
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
132 mv $output_PValue_distribution_plot_tmp $output_PValue_distribution_plot ;
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
133 #end if
69
1dc20e3e11df Uploaded
yhoogstrate
parents: 67
diff changeset
134
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
135 #if $output_hierarchical_clustering_plot:
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
136 #set $output_hierarchical_clustering_plot_tmp = str($output_hierarchical_clustering_plot)+".png"
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
137
72
e75afef75c13 Uploaded
yhoogstrate
parents: 70
diff changeset
138 gm convert $output_hierarchical_clustering_plot $output_hierarchical_clustering_plot_tmp ;
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
139 mv $output_hierarchical_clustering_plot_tmp $output_hierarchical_clustering_plot ;
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
140 #end if
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
141
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
142 #if $output_heatmap_plot:
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
143 #set $output_heatmap_plot_tmp = str($output_heatmap_plot)+".png"
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
144
72
e75afef75c13 Uploaded
yhoogstrate
parents: 70
diff changeset
145 gm convert $output_heatmap_plot $output_heatmap_plot_tmp ;
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
146 mv $output_heatmap_plot_tmp $output_heatmap_plot ;
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
147 #end if
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
148 #end if
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
149
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
150 grep -v 'Calculating library sizes from column' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
151
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
152 ## Locale error messages:
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
153 grep -v 'During startup - Warning messages' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
154 grep -v 'Setting LC_TIME failed' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
155 grep -v 'Setting LC_MONETARY failed' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
156 grep -v 'Setting LC_PAPER failed' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
157 grep -v 'Setting LC_MEASUREMENT failed' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
158 grep -v 'Setting LC_CTYPE failed' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
159 grep -v 'Setting LC_COLLATE failed' stderr.txt > stderr2.txt ; rm stderr.txt ; mv stderr2.txt stderr.txt ;
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
160
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
161 cat stderr.txt >&amp;2
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
162 </command>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
163
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
164 <inputs>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
165 <param name="expression_matrix" type="data" format="tabular" label="Expression (read count) matrix" />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
166 <param name="design_matrix" type="data" format="tabular" label="Design matrix" hepl="Ensure your samplenames are identical to those in the expression matrix. Preferentially, create the contrast matrix using 'edgeR: Design- from Expression matrix'." />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
167
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
168 <param name="contrast" type="text" label="Contrast (biological question)" help="e.g. 'tumor-normal' or '(G1+G2)/2-G3' using the factors chosen in the design matrix. Read the 'makeContrasts' manual from Limma package for more info: http://www.bioconductor.org/packages/release/bioc/html/limma.html and http://www.bioconductor.org/packages/release/bioc/vignettes/limma/inst/doc/usersguide.pdf." />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
169
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
170 <param name="fdr" type="float" min="0" max="1" value="0.05" label="False Discovery Rate (FDR)" />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
171
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
172 <param name="outputs" type="select" label="Optional desired outputs" multiple="true" display="checkboxes">
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
173 <option value="make_output_raw_counts">Raw counts table</option>
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
174 <option value="make_output_MDSplot_logFC">MDS-plot (logFC-method)</option>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
175 <option value="make_output_MDSplot_bcv">MDS-plot (BCV-method; much slower)</option>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
176 <option value="make_output_BCVplot">BCV-plot</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
177 <option value="make_output_MAplot">MA-plot</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
178 <option value="make_output_PValue_distribution_plot">P-Value distribution plot</option>
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
179 <option value="make_output_hierarchical_clustering_plot">Hierarchical custering (under contstruction)</option>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
180 <option value="make_output_heatmap_plot">Heatmap</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
181
43
c6e787bb605c Uploaded
yhoogstrate
parents: 42
diff changeset
182 <option value="make_output_R_stdout">R stdout</option>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
183 <option value="make_output_RData_obj">R Data object</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
184 </param>
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
185
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
186 <param name="output_format_images" type="select" label="Output format of images" display="radio">
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
187 <option value="png">Portable network graphics (.png)</option>
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
188 <option value="pdf">Portable document format (.pdf)</option>
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
189 <option value="svg">Scalable vector graphics (.svg)</option>
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
190 </param>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
191 </inputs>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
192
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
193 <configfiles>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
194 <configfile name="R_script">
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
195 library(limma,quietly=TRUE) ## enable quietly to avoid unnecessaity stderr dumping
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
196 library(edgeR,quietly=TRUE) ## enable quietly to avoid unnecessaity stderr dumping
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
197 library(splines,quietly=TRUE) ## enable quietly to avoid unnecessaity stderr dumping
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
198
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
199 ## Fetch commandline arguments
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
200 args &lt;- commandArgs(trailingOnly = TRUE)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
201
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
202 expression_matrix_file = args[1]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
203 design_matrix_file = args[2]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
204 contrast = args[3]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
205
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
206 fdr = args[4]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
207
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
208 output_count_edgeR = args[5]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
209 output_cpm = args[6]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
210
43
c6e787bb605c Uploaded
yhoogstrate
parents: 42
diff changeset
211 output_xpkm = args[7] ##FPKM file - yet to be implemented
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
212
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
213 output_raw_counts = args[8]
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
214 output_MDSplot_logFC = args[9]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
215 output_MDSplot_bcv = args[10]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
216 output_BCVplot = args[11]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
217 output_MAplot = args[12]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
218 output_PValue_distribution_plot = args[13]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
219 output_hierarchical_clustering_plot = args[14]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
220 output_heatmap_plot = args[15]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
221 output_RData_obj = args[16]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
222 output_format_images = args[17]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
223
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
224
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
225 library(edgeR)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
226 ##raw_data &lt;- read.delim(designmatrix,header=T,stringsAsFactors=T)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
227 ## Obtain read-counts
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
228
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
229 expression_matrix &lt;- read.delim(expression_matrix_file,header=T,stringsAsFactors=F,row.names=1,check.names=FALSE,na.strings=c(""))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
230 design_matrix &lt;- read.delim(design_matrix_file,header=T,stringsAsFactors=F,row.names=1,check.names=FALSE,na.strings=c(""))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
231
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
232 colnames(design_matrix) &lt;- make.names(colnames(design_matrix))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
233
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
234 for(i in 1:ncol(design_matrix)) {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
235 old &lt;- design_matrix[,i]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
236 design_matrix[,i] &lt;- make.names(design_matrix[,i])
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
237 if(paste(design_matrix[,i],collapse="\t") != paste(old,collapse="\t")) {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
238 print("Renaming of factors:")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
239 print(old)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
240 print("To:")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
241 print(design_matrix[,i])
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
242 }
45
f710e5ed7cea Uploaded
yhoogstrate
parents: 44
diff changeset
243 ## The following line seems to malfunction the script:
f710e5ed7cea Uploaded
yhoogstrate
parents: 44
diff changeset
244 ##design_matrix[,i] &lt;- as.factor(design_matrix[,i])
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
245 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
246
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
247 ## 1) In the expression matrix, you only want to have the samples described in the design matrix
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
248 columns &lt;- match(rownames(design_matrix),colnames(expression_matrix))
43
c6e787bb605c Uploaded
yhoogstrate
parents: 42
diff changeset
249 columns &lt;- columns[!is.na(columns)]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
250 read_counts &lt;- expression_matrix[,columns]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
251
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
252 ## 2) In the design matrix, you only want to have samples of which you really have the counts
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
253 columns &lt;- match(colnames(read_counts),rownames(design_matrix))
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
254 columns &lt;- columns[!is.na(columns)]
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
255 design_matrix &lt;- design_matrix[columns,,drop=FALSE]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
256
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
257 ## Filter for HTSeq predifined counts:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
258 exclude_HTSeq &lt;- c("no_feature","ambiguous","too_low_aQual","not_aligned","alignment_not_unique")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
259 exclude_DEXSeq &lt;- c("_ambiguous","_empty","_lowaqual","_notaligned")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
260
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
261 exclude &lt;- match(c(exclude_HTSeq, exclude_DEXSeq),rownames(read_counts))
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
262 exclude &lt;- exclude[is.na(exclude)==0]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
263 if(length(exclude) != 0) {
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
264 read_counts &lt;- read_counts[-exclude,]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
265 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
266
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
267
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
268 ## sorting expression matrix with the order of the read_counts
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
269 ##order &lt;- match(colnames(read_counts) , rownames(design_matrix))
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
270 ##read_counts_ordered &lt;- read_counts[,order2]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
271
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
272 empty_samples &lt;- apply(read_counts,2,function(x) sum(x) == 0)
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
273 if(sum(empty_samples) > 0) {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
274 write(paste("There are ",sum(empty_samples)," empty samples found:",sep=""),stderr())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
275 write(colnames(read_counts)[empty_samples],stderr())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
276 } else {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
277
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
278 dge &lt;- DGEList(counts=read_counts,genes=rownames(read_counts))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
279
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
280 formula &lt;- paste(c("~0",make.names(colnames(design_matrix))),collapse = " + ")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
281 design_matrix_tmp &lt;- design_matrix
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
282 colnames(design_matrix_tmp) &lt;- make.names(colnames(design_matrix_tmp))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
283 design &lt;- model.matrix(as.formula(formula),design_matrix_tmp)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
284 rm(design_matrix_tmp)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
285
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
286 # Filter prefixes
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
287 prefixes = colnames(design_matrix)[attr(design,"assign")]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
288 avoid = nchar(prefixes) == nchar(colnames(design))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
289 replacements = substr(colnames(design),nchar(prefixes)+1,nchar(colnames(design)))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
290 replacements[avoid] = colnames(design)[avoid]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
291 colnames(design) = replacements
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
292
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
293 # Do normalization
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
294 write("Calculating normalization factors...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
295 dge &lt;- calcNormFactors(dge)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
296 write("Estimating common dispersion...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
297 dge &lt;- estimateGLMCommonDisp(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
298 write("Estimating trended dispersion...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
299 dge &lt;- estimateGLMTrendedDisp(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
300 write("Estimating tagwise dispersion...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
301 dge &lt;- estimateGLMTagwiseDisp(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
302
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
303
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
304 if(output_MDSplot_logFC != "/dev/null") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
305 write("Creating MDS plot (logFC method)",stdout())
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
306 points &lt;- plotMDS.DGEList(dge,top=500,labels=rep("",nrow(dge\$samples)))# Get coordinates of unflexible plot
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
307 dev.off()# Kill it
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
308
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
309 if(output_format_images == "pdf" || output_format_images == "png") {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
310 pdf(output_MDSplot_logFC,height=14,width=14)
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
311 } else if(output_format_images == "svg") {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
312 svg(output_MDSplot_logFC,height=14,width=14)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
313 }
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
314 ## else {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
315 ## png(output_MDSplot_logFC)
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
316 ##}
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
317 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
318
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
319 diff_x &lt;- abs(max(points\$x)-min(points\$x))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
320 diff_y &lt;-(max(points\$y)-min(points\$y))
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
321 plot(c(min(points\$x),max(points\$x) + 0.45 * diff_x), c(min(points\$y) - 0.05 * diff_y,max(points\$y) + 0.05 * diff_y), main="edgeR logFC-MDS Plot on top 500 genes",type="n", xlab="Leading logFC dim 1", ylab="Leading logFC dim 2")
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
322 points(points\$x,points\$y,pch=20)
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
323 text(points\$x, points\$y,rownames(dge\$samples),cex=1.25,col="gray",pos=4)
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
324 rm(diff_x,diff_y)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
325
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
326 dev.off()
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
327 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
328
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
329 if(output_MDSplot_bcv != "/dev/null") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
330 write("Creating MDS plot (bcv method)",stdout())
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
331 pdf("/home/youri/Desktop/bcvmds.pdf")
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
332 points &lt;- plotMDS.DGEList(dge,method="bcv",top=500,labels=rep("",nrow(dge\$samples)))# Get coordinates of unflexible plot
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
333 dev.off()# Kill it
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
334
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
335 if(output_format_images == "pdf" || output_format_images == "png") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
336 pdf(output_MDSplot_bcv,height=14,width=14)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
337 } else if(output_format_images == "svg") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
338 svg(output_MDSplot_bcv,height=14,width=14)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
339 }
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
340 ## else {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
341 ## png(output_MDSplot_bcv)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
342 ##}
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
343 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
344
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
345 diff_x &lt;- abs(max(points\$x)-min(points\$x))
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
346 diff_y &lt;-(max(points\$y)-min(points\$y))
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
347 plot(c(min(points\$x),max(points\$x) + 0.45 * diff_x), c(min(points\$y) - 0.05 * diff_y,max(points\$y) + 0.05 * diff_y), main="edgeR BCV-MDS Plot",type="n", xlab="Leading BCV dim 1", ylab="Leading BCV dim 2")
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
348 points(points\$x,points\$y,pch=20)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
349 text(points\$x, points\$y,rownames(dge\$samples),cex=1.25,col="gray",pos=4)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
350 rm(diff_x,diff_y)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
351
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
352 dev.off()
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
353 }
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
354
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
355
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
356 if(output_BCVplot != "/dev/null") {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
357 write("Creating Biological coefficient of variation plot",stdout())
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
358
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
359 if(output_format_images == "pdf" || output_format_images == "png") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
360 pdf(output_BCVplot)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
361 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
362 svg(output_BCVplot)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
363 }
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
364 ##else {
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
365 ## png(output_BCVplot)
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
366 ##}
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
367
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
368 plotBCV(dge, cex=0.4, main="edgeR: Biological coefficient of variation (BCV) vs abundance")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
369 dev.off()
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
370 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
371
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
372
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
373 write("Fitting GLM...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
374 fit &lt;- glmFit(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
375
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
376 write(paste("Performing likelihood ratio test: ",contrast,sep=""),stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
377 cont &lt;- c(contrast)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
378 cont &lt;- makeContrasts(contrasts=cont, levels=design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
379
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
380 lrt &lt;- glmLRT(fit, contrast=cont[,1])
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
381 write(paste("Exporting to file: ",output_count_edgeR,sep=""),stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
382 write.table(file=output_count_edgeR,topTags(lrt,n=nrow(read_counts))\$table,sep="\t",row.names=TRUE,col.names=NA)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
383 write.table(file=output_cpm,cpm(dge,normalized.lib.sizes=TRUE),sep="\t",row.names=TRUE,col.names=NA)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
384
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
385 ## todo EXPORT FPKM
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
386 write.table(file=output_raw_counts,dge\$counts,sep="\t",row.names=TRUE,col.names=NA)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
387
34
a36a9af266e5 Uploaded
yhoogstrate
parents: 33
diff changeset
388 if(output_MAplot != "/dev/null" || output_PValue_distribution_plot != "/dev/null") {
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
389 etable &lt;- topTags(lrt, n=nrow(dge))\$table
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
390 etable &lt;- etable[order(etable\$FDR), ]
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
391
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
392 if(output_MAplot != "/dev/null") {
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
393 write("Creating MA plot...",stdout())
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
394
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
395 if(output_format_images == "pdf" || output_format_images == "png") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
396 pdf(output_MAplot)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
397 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
398 svg(output_MAplot)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
399 }
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
400 ##else {
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
401 ## png(output_MAplot)
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
402 ##}
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
403
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
404 with(etable, plot(logCPM, logFC, pch=20, main="edgeR: Fold change vs abundance"))
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
405 with(subset(etable, FDR &lt; fdr), points(logCPM, logFC, pch=20, col="red"))
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
406 abline(h=c(-1,1), col="blue")
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
407 dev.off()
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
408 }
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
409
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
410 if(output_PValue_distribution_plot != "/dev/null") {
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
411 write("Creating P-value distribution plot...",stdout())
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
412
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
413 if(output_format_images == "pdf" || output_format_images == "png") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
414 pdf(output_PValue_distribution_plot)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
415 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
416 svg(output_PValue_distribution_plot)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
417 }
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
418 ##else {
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
419 ## png(output_PValue_distribution_plot)
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
420 ##}
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
421
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
422 expressed_genes &lt;- subset(etable, PValue &lt; 0.99)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
423 h &lt;- hist(expressed_genes\$PValue,breaks=nrow(expressed_genes)/15,main="Binned P-Values (&lt; 0.99)")
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
424 center &lt;- sum(h\$counts) / length(h\$counts)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
425 lines(c(0,1),c(center,center),lty=2,col="red",lwd=2)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
426 k &lt;- ksmooth(h\$mid, h\$counts)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
427 lines(k\$x,k\$y,col="red",lwd=2)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
428 rmsd &lt;- (h\$counts) - center
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
429 rmsd &lt;- rmsd^2
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
430 rmsd &lt;- sum(rmsd)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
431 rmsd &lt;- sqrt(rmsd)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
432 text(0,max(h\$counts),paste("e=",round(rmsd,2),sep=""),pos=4,col="blue")
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
433 ## change e into epsilon somehow
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
434 dev.off()
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
435 }
40
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
436 }
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
437
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
438 if(output_heatmap_plot != "/dev/null") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
439
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
440 if(output_format_images == "pdf" || output_format_images == "png") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
441 pdf(output_heatmap_plot,width=10.5)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
442 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
443 svg(output_heatmap_plot,width=10.5)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
444 }
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
445 ## else {
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
446 ## png(output_heatmap_plot,width=10.5)
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
447 ##}
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
448
40
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
449 etable2 &lt;- topTags(lrt, n=100)\$table
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
450 order &lt;- rownames(etable2)
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
451 cpm_sub &lt;- cpm(dge,normalized.lib.sizes=TRUE,log=TRUE)[as.numeric(order),]
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
452 heatmap(t(cpm_sub))
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
453 dev.off()
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
454 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
455
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
456 ##output_hierarchical_clustering_plot = args[13]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
457
35
fab87cecb3e5 Uploaded
yhoogstrate
parents: 34
diff changeset
458 if(output_RData_obj != "/dev/null") {
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
459 save.image(output_RData_obj)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
460 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
461
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
462 write("Done!",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
463 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
464 </configfile>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
465 </configfiles>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
466
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
467 <outputs>
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
468 <data format="tabular" name="output_count_edgeR" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - differentially expressed genes" />
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
469 <data format="tabular" name="output_cpm" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - CPM" />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
470
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
471 <data format="tabular" name="output_raw_counts" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - raw counts">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
472 <filter>outputs and ("make_output_raw_counts" in outputs)</filter>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
473 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
474
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
475 <data format="png" name="output_MDSplot_logFC" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - MDS-plot (logFC method)">
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
476 <filter>outputs and ("make_output_MDSplot_logFC" in outputs)</filter>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
477
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
478 <change_format>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
479 <when input="output_format_images" value="png" format="png" />
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
480 <when input="output_format_images" value="pdf" format="pdf" />
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
481 <when input="output_format_images" value="svg" format="svg" />
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
482 </change_format>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
483 </data>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
484
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
485 <data format="png" name="output_MDSplot_bcv" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - MDS-plot (bcv method)">
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
486 <filter>outputs and ("make_output_MDSplot_bcv" in outputs)</filter>
59
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
487
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
488 <change_format>
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
489 <when input="output_format_images" value="png" format="png" />
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
490 <when input="output_format_images" value="pdf" format="pdf" />
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
491 <when input="output_format_images" value="svg" format="svg" />
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
492 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
493 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
494
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
495 <data format="png" name="output_BCVplot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - BCV-plot">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
496 <filter>outputs and ("make_output_BCVplot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
497
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
498 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
499 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
500 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
501 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
502 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
503 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
504
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
505 <data format="png" name="output_MAplot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - MA-plot">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
506 <filter>outputs and ("make_output_MAplot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
507
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
508 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
509 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
510 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
511 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
512 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
513 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
514
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
515 <data format="png" name="output_PValue_distribution_plot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - P-Value distribution">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
516 <filter>outputs and ("make_output_PValue_distribution_plot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
517
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
518 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
519 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
520 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
521 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
522 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
523 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
524
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
525 <data format="png" name="output_hierarchical_clustering_plot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - Hierarchical custering">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
526 <filter>outputs and ("make_output_hierarchical_clustering_plot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
527
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
528 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
529 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
530 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
531 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
532 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
533 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
534
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
535 <data format="png" name="output_heatmap_plot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - Heatmap">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
536 <filter>outputs and ("make_output_heatmap_plot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
537
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
538 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
539 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
540 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
541 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
542 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
543 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
544
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
545 <data format="RData" name="output_RData_obj" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - R data object">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
546 <filter>outputs and ("make_output_RData_obj" in outputs)</filter>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
547 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
548
40
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
549 <data format="txt" name="output_R" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - R output (debug)" >
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
550 <filter>outputs and ("make_output_R_stdout" in outputs)</filter>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
551 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
552 </outputs>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
553
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
554 <help>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
555 edgeR: Differential Gene(Expression) Analysis
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
556 #############################################
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
557
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
558 Overview
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
559 --------
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
560 Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data [1].
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
561
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
562 For every experiment, the algorithm requires a design matrix. This matrix describes which samples belong to which groups.
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
563 More details on this are given in the edgeR manual: http://www.bioconductor.org/packages/2.12/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
564 and the limma manual.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
565
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
566 Because the creation of a design matrix can be complex and time consuming, especially if no GUI is used, this package comes with an alternative tool which can help you with it.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
567 This tool is called *edgeR Design Matrix Creator*.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
568 If the appropriate design matrix (with corresponding links to the files) is given,
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
569 the correct contrast ( http://en.wikipedia.org/wiki/Contrast_(statistics) ) has to be given.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
570
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
571 If you have for example two groups, with an equal weight, you would like to compare either
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
572 "g1-g2" or "normal-cancer".
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
573
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
574 The test function makes use of a MCF7 dataset used in a study that indicates that a higher sequencing depth is not neccesairily more important than a higher amount of replaciates[2].
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
575
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
576 Input
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
577 -----
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
578 Expression matrix
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
579 ^^^^^^^^^^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
580 ::
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
581
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
582 Geneid "\t" Sample-1 "\t" Sample-2 "\t" Sample-3 "\t" Sample-4 [...] "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
583 SMURF "\t" 123 "\t" 21 "\t" 34545 "\t" 98 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
584 BRCA1 "\t" 435 "\t" 6655 "\t" 45 "\t" 55 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
585 LINK33 "\t" 4 "\t" 645 "\t" 345 "\t" 1 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
586 SNORD78 "\t" 498 "\t" 65 "\t" 98 "\t" 27 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
587 [...]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
588
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
589 *Note: Make sure the number of columns in the header is identical to the number of columns in the body.*
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
590
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
591 Design matrix
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
592 ^^^^^^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
593 ::
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
594
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
595 Sample "\t" Condition "\t" Ethnicity "\t" Patient "\t" Batch "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
596 Sample-1 "\t" Tumor "\t" European "\t" 1 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
597 Sample-2 "\t" Normal "\t" European "\t" 1 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
598 Sample-3 "\t" Tumor "\t" European "\t" 2 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
599 Sample-4 "\t" Normal "\t" European "\t" 2 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
600 Sample-5 "\t" Tumor "\t" African "\t" 3 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
601 Sample-6 "\t" Normal "\t" African "\t" 3 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
602 Sample-7 "\t" Tumor "\t" African "\t" 4 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
603 Sample-8 "\t" Normal "\t" African "\t" 4 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
604 Sample-9 "\t" Tumor "\t" Asian "\t" 5 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
605 Sample-10 "\t" Normal "\t" Asian "\t" 5 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
606 Sample-11 "\t" Tumor "\t" Asian "\t" 6 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
607 Sample-12 "\t" Normal "\t" Asian "\t" 6 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
608
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
609 *Note: Avoid factor names that are (1) numerical, (2) contain mathematical symbols and preferebly only use letters.*
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
610
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
611 Contrast
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
612 ^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
613 The contrast represents the biological question. There can be many questions asked, e.g.:
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
614
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
615 - Tumor-Normal
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
616 - African-European
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
617 - 0.5*(Control+Placebo) / Treated
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
618
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
619 Installation
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
620 ------------
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
621
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
622 This tool requires no specific configurations. The following dependencies are installed automatically:
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
623
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
624 - R
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
625 - Bioconductor
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
626 - limma
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
627 - edgeR
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
628
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
629 License
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
630 -------
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
631 - R
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
632 - GPL 2 &amp; GPL 3
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
633 - limma
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
634 - GPL (&gt;=2)
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
635 - edgeR
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
636 - GPL (&gt;=2)
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
637
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
638 References
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
639 ----------
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
640
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
641 EdgeR
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
642 ^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
643 **[1] edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.**
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
644
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
645 *Mark D. Robinson, Davis J. McCarthy and Gordon K. Smyth* - Bioinformatics (2010) 26 (1): 139-140.
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
646
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
647 - http://www.bioconductor.org/packages/2.12/bioc/html/edgeR.html
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
648 - http://dx.doi.org/10.1093/bioinformatics/btp616
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
649 - http://www.bioconductor.org/packages/release/bioc/html/edgeR.html
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
650
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
651 Test-data (MCF7)
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
652 ^^^^^^^^^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
653 **[2] RNA-seq differential expression studies: more sequence or more replication?**
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
654
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
655 *Yuwen Liu, Jie Zhou and Kevin P. White* - Bioinformatics (2014) 30 (3): 301-304.
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
656
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
657 - http://www.ncbi.nlm.nih.gov/pubmed/24319002
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
658 - http://dx.doi.org/10.1093/bioinformatics/btt688
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
659
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
660 Contact
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
661 -------
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
662
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
663 The tool wrapper has been written by Youri Hoogstrate from the Erasmus
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
664 Medical Center (Rotterdam, Netherlands) on behalf of the Translational
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
665 Research IT (TraIT) project:
83
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
666
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
667 http://www.ctmm.nl/en/programmas/infrastructuren/traitprojecttranslationeleresearch
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
668
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
669 More tools by the Translational Research IT (TraIT) project can be found
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
670 in the following toolsheds:
83
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
671
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
672 http://toolshed.dtls.nl/
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
673
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
674 http://toolshed.g2.bx.psu.edu
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
675
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
676 http://testtoolshed.g2.bx.psu.edu/
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
677
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
678 I would like to thank Hina Riaz - Naz Khan for her helpful contribution.
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
679 </help>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
680 </tool>