annotate edgeR_Differential_Gene_Expression.xml @ 95:9dac2146b98c draft

Uploaded
author yhoogstrate
date Sun, 29 Mar 2015 02:41:02 -0400
parents 46745f5666ac
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
1 <?xml version="1.0" encoding="UTF-8"?>
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
2 <tool id="edger_dge" name="edgeR: Differential Gene(Expression) Analysis" version="3.0.3-latest.d">
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
3 <description>RNA-Seq gene expression analysis using edgeR (R package)</description>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
4
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
5 <requirements>
67
6eb298a5561f Uploaded
yhoogstrate
parents: 65
diff changeset
6 <requirement type="package" version="3.0.3">R</requirement>
77
2ee2336d71cc Uploaded
yhoogstrate
parents: 75
diff changeset
7 <requirement type="package" version="latest">biocLite_edgeR_limma</requirement>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
8 </requirements>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
9
95
9dac2146b98c Uploaded
yhoogstrate
parents: 94
diff changeset
10 <version_command>
9dac2146b98c Uploaded
yhoogstrate
parents: 94
diff changeset
11 echo $(R --version | grep version | grep -v GNU) " , EdgeR version" $(R --vanilla --slave -e "library(edgeR) ; cat(sessionInfo()\$otherPkgs\$edgeR\$Version)" 2&gt; /dev/null | grep -v "WARNING: ")</version_command>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
12 <command>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
13 <!--
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
14 The following script is written in the "Cheetah" language:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
15 http://www.cheetahtemplate.org/docs/users_guide_html_multipage/contents.html
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
16 -->
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
17
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
18 R --vanilla --slave -f $R_script '--args
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
19 $expression_matrix
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
20 $design_matrix
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
21 $contrast
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
22
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
23 $fdr
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
24
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
25 $output_count_edgeR
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
26 $output_cpm
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
27
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
28 /dev/null <!-- Calculation of FPKM/RPKM should come here -->
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
29
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
30 #if $output_raw_counts:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
31 $output_raw_counts
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
32 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
33 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
34 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
35
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
36 #if $output_MDSplot_logFC:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
37 $output_MDSplot_logFC
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
38 #else:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
39 /dev/null
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
40 #end if
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
41
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
42 #if $output_MDSplot_bcv:
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
43 $output_MDSplot_bcv
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
44 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
45 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
46 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
47
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
48 #if $output_BCVplot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
49 $output_BCVplot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
50 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
51 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
52 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
53
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
54 #if $output_MAplot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
55 $output_MAplot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
56 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
57 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
58 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
59
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
60 #if $output_PValue_distribution_plot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
61 $output_PValue_distribution_plot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
62 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
63 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
64 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
65
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
66 #if $output_hierarchical_clustering_plot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
67 $output_hierarchical_clustering_plot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
68 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
69 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
70 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
71
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
72 #if $output_heatmap_plot:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
73 $output_heatmap_plot
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
74 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
75 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
76 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
77
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
78 #if $output_RData_obj:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
79 $output_RData_obj
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
80 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
81 /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
82 #end if
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
83
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
84 $output_format_images
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
85 '
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
86 #if $output_R:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
87 > $output_R
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
88 #else:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
89 > /dev/null
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
90 #end if
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
91 </command>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
92
94
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
93 <stdio>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
94 <regex match="Calculating library sizes from column"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
95 source="stderr"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
96 level="log" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
97 <regex match="During startup - Warning messages"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
98 source="stderr"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
99 level="log" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
100 <regex match="Setting LC_[^ ]+ failed"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
101 source="stderr"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
102 level="warning"
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
103 description="LOCALE has not been set correctly" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
104 </stdio>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
105
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
106 <inputs>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
107 <param name="expression_matrix" type="data" format="tabular" label="Expression (read count) matrix" />
94
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
108 <param name="design_matrix" type="data" format="tabular" label="Design matrix" help="Ensure your samplenames are identical to those in the expression matrix. Preferentially, create the contrast matrix using 'edgeR: Design- from Expression matrix'." />
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
109
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
110 <param name="contrast" type="text" label="Contrast (biological question)" help="e.g. 'tumor-normal' or '(G1+G2)/2-G3' using the factors chosen in the design matrix. Read the 'makeContrasts' manual from Limma package for more info: http://www.bioconductor.org/packages/release/bioc/html/limma.html and http://www.bioconductor.org/packages/release/bioc/vignettes/limma/inst/doc/usersguide.pdf." />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
111
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
112 <param name="fdr" type="float" min="0" max="1" value="0.05" label="False Discovery Rate (FDR)" />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
113
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
114 <param name="outputs" type="select" label="Optional desired outputs" multiple="true" display="checkboxes">
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
115 <option value="make_output_raw_counts">Raw counts table</option>
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
116 <option value="make_output_MDSplot_logFC">MDS-plot (logFC-method)</option>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
117 <option value="make_output_MDSplot_bcv">MDS-plot (BCV-method; much slower)</option>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
118 <option value="make_output_BCVplot">BCV-plot</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
119 <option value="make_output_MAplot">MA-plot</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
120 <option value="make_output_PValue_distribution_plot">P-Value distribution plot</option>
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
121 <option value="make_output_hierarchical_clustering_plot">Hierarchical custering (under contstruction)</option>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
122 <option value="make_output_heatmap_plot">Heatmap</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
123
43
c6e787bb605c Uploaded
yhoogstrate
parents: 42
diff changeset
124 <option value="make_output_R_stdout">R stdout</option>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
125 <option value="make_output_RData_obj">R Data object</option>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
126 </param>
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
127
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
128 <param name="output_format_images" type="select" label="Output format of images" display="radio">
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
129 <option value="png">Portable network graphics (.png)</option>
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
130 <option value="pdf">Portable document format (.pdf)</option>
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
131 <option value="svg">Scalable vector graphics (.svg)</option>
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
132 </param>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
133 </inputs>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
134
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
135 <configfiles>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
136 <configfile name="R_script">
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
137 library(limma,quietly=TRUE) ## enable quietly to avoid unnecessaity stderr dumping
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
138 library(edgeR,quietly=TRUE) ## enable quietly to avoid unnecessaity stderr dumping
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
139 library(splines,quietly=TRUE) ## enable quietly to avoid unnecessaity stderr dumping
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
140
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
141 ## Fetch commandline arguments
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
142 args &lt;- commandArgs(trailingOnly = TRUE)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
143
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
144 expression_matrix_file = args[1]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
145 design_matrix_file = args[2]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
146 contrast = args[3]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
147
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
148 fdr = args[4]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
149
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
150 output_count_edgeR = args[5]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
151 output_cpm = args[6]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
152
43
c6e787bb605c Uploaded
yhoogstrate
parents: 42
diff changeset
153 output_xpkm = args[7] ##FPKM file - yet to be implemented
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
154
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
155 output_raw_counts = args[8]
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
156 output_MDSplot_logFC = args[9]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
157 output_MDSplot_bcv = args[10]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
158 output_BCVplot = args[11]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
159 output_MAplot = args[12]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
160 output_PValue_distribution_plot = args[13]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
161 output_hierarchical_clustering_plot = args[14]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
162 output_heatmap_plot = args[15]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
163 output_RData_obj = args[16]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
164 output_format_images = args[17]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
165
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
166
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
167 library(edgeR)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
168 ##raw_data &lt;- read.delim(designmatrix,header=T,stringsAsFactors=T)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
169 ## Obtain read-counts
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
170
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
171 expression_matrix &lt;- read.delim(expression_matrix_file,header=T,stringsAsFactors=F,row.names=1,check.names=FALSE,na.strings=c(""))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
172 design_matrix &lt;- read.delim(design_matrix_file,header=T,stringsAsFactors=F,row.names=1,check.names=FALSE,na.strings=c(""))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
173
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
174 colnames(design_matrix) &lt;- make.names(colnames(design_matrix))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
175
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
176 for(i in 1:ncol(design_matrix)) {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
177 old &lt;- design_matrix[,i]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
178 design_matrix[,i] &lt;- make.names(design_matrix[,i])
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
179 if(paste(design_matrix[,i],collapse="\t") != paste(old,collapse="\t")) {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
180 print("Renaming of factors:")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
181 print(old)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
182 print("To:")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
183 print(design_matrix[,i])
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
184 }
45
f710e5ed7cea Uploaded
yhoogstrate
parents: 44
diff changeset
185 ## The following line seems to malfunction the script:
f710e5ed7cea Uploaded
yhoogstrate
parents: 44
diff changeset
186 ##design_matrix[,i] &lt;- as.factor(design_matrix[,i])
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
187 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
188
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
189 ## 1) In the expression matrix, you only want to have the samples described in the design matrix
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
190 columns &lt;- match(rownames(design_matrix),colnames(expression_matrix))
43
c6e787bb605c Uploaded
yhoogstrate
parents: 42
diff changeset
191 columns &lt;- columns[!is.na(columns)]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
192 read_counts &lt;- expression_matrix[,columns]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
193
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
194 ## 2) In the design matrix, you only want to have samples of which you really have the counts
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
195 columns &lt;- match(colnames(read_counts),rownames(design_matrix))
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
196 columns &lt;- columns[!is.na(columns)]
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
197 design_matrix &lt;- design_matrix[columns,,drop=FALSE]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
198
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
199 ## Filter for HTSeq predifined counts:
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
200 exclude_HTSeq &lt;- c("no_feature","ambiguous","too_low_aQual","not_aligned","alignment_not_unique")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
201 exclude_DEXSeq &lt;- c("_ambiguous","_empty","_lowaqual","_notaligned")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
202
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
203 exclude &lt;- match(c(exclude_HTSeq, exclude_DEXSeq),rownames(read_counts))
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
204 exclude &lt;- exclude[is.na(exclude)==0]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
205 if(length(exclude) != 0) {
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
206 read_counts &lt;- read_counts[-exclude,]
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
207 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
208
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
209
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
210 ## sorting expression matrix with the order of the read_counts
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
211 ##order &lt;- match(colnames(read_counts) , rownames(design_matrix))
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
212 ##read_counts_ordered &lt;- read_counts[,order2]
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
213
44
261b997b241f Uploaded
yhoogstrate
parents: 43
diff changeset
214 empty_samples &lt;- apply(read_counts,2,function(x) sum(x) == 0)
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
215 if(sum(empty_samples) > 0) {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
216 write(paste("There are ",sum(empty_samples)," empty samples found:",sep=""),stderr())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
217 write(colnames(read_counts)[empty_samples],stderr())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
218 } else {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
219
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
220 dge &lt;- DGEList(counts=read_counts,genes=rownames(read_counts))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
221
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
222 formula &lt;- paste(c("~0",make.names(colnames(design_matrix))),collapse = " + ")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
223 design_matrix_tmp &lt;- design_matrix
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
224 colnames(design_matrix_tmp) &lt;- make.names(colnames(design_matrix_tmp))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
225 design &lt;- model.matrix(as.formula(formula),design_matrix_tmp)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
226 rm(design_matrix_tmp)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
227
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
228 # Filter prefixes
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
229 prefixes = colnames(design_matrix)[attr(design,"assign")]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
230 avoid = nchar(prefixes) == nchar(colnames(design))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
231 replacements = substr(colnames(design),nchar(prefixes)+1,nchar(colnames(design)))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
232 replacements[avoid] = colnames(design)[avoid]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
233 colnames(design) = replacements
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
234
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
235 # Do normalization
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
236 write("Calculating normalization factors...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
237 dge &lt;- calcNormFactors(dge)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
238 write("Estimating common dispersion...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
239 dge &lt;- estimateGLMCommonDisp(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
240 write("Estimating trended dispersion...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
241 dge &lt;- estimateGLMTrendedDisp(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
242 write("Estimating tagwise dispersion...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
243 dge &lt;- estimateGLMTagwiseDisp(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
244
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
245
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
246 if(output_MDSplot_logFC != "/dev/null") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
247 write("Creating MDS plot (logFC method)",stdout())
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
248 points &lt;- plotMDS.DGEList(dge,top=500,labels=rep("",nrow(dge\$samples)))# Get coordinates of unflexible plot
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
249 dev.off()# Kill it
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
250
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
251 if(output_format_images == "pdf") {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
252 pdf(output_MDSplot_logFC,height=14,width=14)
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
253 } else if(output_format_images == "svg") {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
254 svg(output_MDSplot_logFC,height=14,width=14)
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
255 } else {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
256 ## png(output_MDSplot_logFC)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
257 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings: https://biostar.usegalaxy.org/p/9170/
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
258
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
259 bitmap(output_MDSplot_logFC,type="png16m",height=14,width=14)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
260 }
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
261
55
8c63794c3d3e Uploaded
yhoogstrate
parents: 53
diff changeset
262
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
263 diff_x &lt;- abs(max(points\$x)-min(points\$x))
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
264 diff_y &lt;-(max(points\$y)-min(points\$y))
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
265 plot(c(min(points\$x),max(points\$x) + 0.45 * diff_x), c(min(points\$y) - 0.05 * diff_y,max(points\$y) + 0.05 * diff_y), main="edgeR logFC-MDS Plot on top 500 genes",type="n", xlab="Leading logFC dim 1", ylab="Leading logFC dim 2")
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
266 points(points\$x,points\$y,pch=20)
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
267 text(points\$x, points\$y,rownames(dge\$samples),cex=1.25,col="gray",pos=4)
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
268 rm(diff_x,diff_y)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
269
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
270 dev.off()
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
271 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
272
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
273 if(output_MDSplot_bcv != "/dev/null") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
274 write("Creating MDS plot (bcv method)",stdout())
93
31335aa52b2e Uploaded
yhoogstrate
parents: 91
diff changeset
275
31335aa52b2e Uploaded
yhoogstrate
parents: 91
diff changeset
276 ## 1. First create a virtual plot to obtain the desired coordinates
31335aa52b2e Uploaded
yhoogstrate
parents: 91
diff changeset
277 pdf("bcvmds.pdf")
31335aa52b2e Uploaded
yhoogstrate
parents: 91
diff changeset
278 points &lt;- plotMDS.DGEList(dge,method="bcv",top=500,labels=rep("",nrow(dge\$samples)))
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
279 dev.off()# Kill it
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
280
93
31335aa52b2e Uploaded
yhoogstrate
parents: 91
diff changeset
281 ## 2. Re-plot the coordinates in a new figure with the size and settings.
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
282 if(output_format_images == "pdf") {
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
283 pdf(output_MDSplot_bcv,height=14,width=14)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
284 } else if(output_format_images == "svg") {
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
285 svg(output_MDSplot_bcv,height=14,width=14)
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
286 } else {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
287 ## png(output_MDSplot_bcv)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
288 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings: https://biostar.usegalaxy.org/p/9170/
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
289
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
290 bitmap(output_MDSplot_bcv,type="png16m",height=14,width=14)
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
291 }
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
292
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
293 diff_x &lt;- abs(max(points\$x)-min(points\$x))
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
294 diff_y &lt;-(max(points\$y)-min(points\$y))
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
295 plot(c(min(points\$x),max(points\$x) + 0.45 * diff_x), c(min(points\$y) - 0.05 * diff_y,max(points\$y) + 0.05 * diff_y), main="edgeR BCV-MDS Plot",type="n", xlab="Leading BCV dim 1", ylab="Leading BCV dim 2")
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
296 points(points\$x,points\$y,pch=20)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
297 text(points\$x, points\$y,rownames(dge\$samples),cex=1.25,col="gray",pos=4)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
298 rm(diff_x,diff_y)
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
299
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
300 dev.off()
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
301 }
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
302
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
303
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
304 if(output_BCVplot != "/dev/null") {
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
305 write("Creating Biological coefficient of variation plot",stdout())
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
306
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
307 if(output_format_images == "pdf") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
308 pdf(output_BCVplot)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
309 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
310 svg(output_BCVplot)
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
311 } else {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
312 ## png(output_BCVplot)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
313 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings: https://biostar.usegalaxy.org/p/9170/
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
314
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
315 bitmap(output_BCVplot,type="png16m")
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
316 }
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
317
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
318 plotBCV(dge, cex=0.4, main="edgeR: Biological coefficient of variation (BCV) vs abundance")
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
319 dev.off()
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
320 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
321
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
322
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
323 write("Fitting GLM...",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
324 fit &lt;- glmFit(dge,design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
325
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
326 write(paste("Performing likelihood ratio test: ",contrast,sep=""),stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
327 cont &lt;- c(contrast)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
328 cont &lt;- makeContrasts(contrasts=cont, levels=design)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
329
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
330 lrt &lt;- glmLRT(fit, contrast=cont[,1])
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
331 write(paste("Exporting to file: ",output_count_edgeR,sep=""),stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
332 write.table(file=output_count_edgeR,topTags(lrt,n=nrow(read_counts))\$table,sep="\t",row.names=TRUE,col.names=NA)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
333 write.table(file=output_cpm,cpm(dge,normalized.lib.sizes=TRUE),sep="\t",row.names=TRUE,col.names=NA)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
334
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
335 ## todo EXPORT FPKM
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
336 write.table(file=output_raw_counts,dge\$counts,sep="\t",row.names=TRUE,col.names=NA)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
337
34
a36a9af266e5 Uploaded
yhoogstrate
parents: 33
diff changeset
338 if(output_MAplot != "/dev/null" || output_PValue_distribution_plot != "/dev/null") {
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
339 etable &lt;- topTags(lrt, n=nrow(dge))\$table
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
340 etable &lt;- etable[order(etable\$FDR), ]
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
341
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
342 if(output_MAplot != "/dev/null") {
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
343 write("Creating MA plot...",stdout())
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
344
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
345 if(output_format_images == "pdf") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
346 pdf(output_MAplot)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
347 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
348 svg(output_MAplot)
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
349 } else {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
350 ## png(output_MAplot)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
351 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings: https://biostar.usegalaxy.org/p/9170/
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
352
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
353 bitmap(output_MAplot,type="png16m")
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
354 }
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
355
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
356 with(etable, plot(logCPM, logFC, pch=20, main="edgeR: Fold change vs abundance"))
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
357 with(subset(etable, FDR &lt; fdr), points(logCPM, logFC, pch=20, col="red"))
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
358 abline(h=c(-1,1), col="blue")
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
359 dev.off()
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
360 }
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
361
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
362 if(output_PValue_distribution_plot != "/dev/null") {
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
363 write("Creating P-value distribution plot...",stdout())
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
364
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
365 if(output_format_images == "pdf") {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
366 pdf(output_PValue_distribution_plot,width=14,height=14)
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
367 } else if(output_format_images == "svg") {
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
368 svg(output_PValue_distribution_plot,width=14,height=14)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
369 } else {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
370 ## png(output_PValue_distribution_plot)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
371 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings: https://biostar.usegalaxy.org/p/9170/
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
372
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
373 bitmap(output_PValue_distribution_plot,type="png16m",width=14,height=14)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
374 }
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
375
32
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
376 expressed_genes &lt;- subset(etable, PValue &lt; 0.99)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
377 h &lt;- hist(expressed_genes\$PValue,breaks=nrow(expressed_genes)/15,main="Binned P-Values (&lt; 0.99)")
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
378 center &lt;- sum(h\$counts) / length(h\$counts)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
379 lines(c(0,1),c(center,center),lty=2,col="red",lwd=2)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
380 k &lt;- ksmooth(h\$mid, h\$counts)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
381 lines(k\$x,k\$y,col="red",lwd=2)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
382 rmsd &lt;- (h\$counts) - center
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
383 rmsd &lt;- rmsd^2
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
384 rmsd &lt;- sum(rmsd)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
385 rmsd &lt;- sqrt(rmsd)
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
386 text(0,max(h\$counts),paste("e=",round(rmsd,2),sep=""),pos=4,col="blue")
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
387 ## change e into epsilon somehow
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
388 dev.off()
87bf067cfc53 Uploaded
yhoogstrate
parents: 29
diff changeset
389 }
40
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
390 }
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
391
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
392 if(output_heatmap_plot != "/dev/null") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
393
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
394 if(output_format_images == "pdf") {
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
395 pdf(output_heatmap_plot,width=10.5)
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
396 } else if(output_format_images == "svg") {
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
397 svg(output_heatmap_plot,width=10.5)
91
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
398 } else {
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
399 ## png(output_heatmap_plot)
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
400 ## png does not work out of the box in the Galaxy Toolshed Version of R due to its compile settings: https://biostar.usegalaxy.org/p/9170/
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
401
c81da57fff20 Uploaded
yhoogstrate
parents: 90
diff changeset
402 bitmap(output_heatmap_plot,type="png16m",width=10.5)
70
fb64bf85361d Uploaded
yhoogstrate
parents: 69
diff changeset
403 }
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
404
40
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
405 etable2 &lt;- topTags(lrt, n=100)\$table
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
406 order &lt;- rownames(etable2)
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
407 cpm_sub &lt;- cpm(dge,normalized.lib.sizes=TRUE,log=TRUE)[as.numeric(order),]
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
408 heatmap(t(cpm_sub))
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
409 dev.off()
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
410 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
411
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
412 ##output_hierarchical_clustering_plot = args[13]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
413
35
fab87cecb3e5 Uploaded
yhoogstrate
parents: 34
diff changeset
414 if(output_RData_obj != "/dev/null") {
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
415 save.image(output_RData_obj)
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
416 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
417
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
418 write("Done!",stdout())
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
419 }
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
420 </configfile>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
421 </configfiles>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
422
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
423 <outputs>
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
424 <data format="tabular" name="output_count_edgeR" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - differentially expressed genes" />
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
425 <data format="tabular" name="output_cpm" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - CPM" />
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
426
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
427 <data format="tabular" name="output_raw_counts" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - raw counts">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
428 <filter>outputs and ("make_output_raw_counts" in outputs)</filter>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
429 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
430
89
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
431 <data format="png" name="output_MDSplot_logFC" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - MDS-plot (logFC method)">
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
432 <filter>outputs and ("make_output_MDSplot_logFC" in outputs)</filter>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
433
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
434 <change_format>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
435 <when input="output_format_images" value="png" format="png" />
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
436 <when input="output_format_images" value="pdf" format="pdf" />
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
437 <when input="output_format_images" value="svg" format="svg" />
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
438 </change_format>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
439 </data>
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
440
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
441 <data format="png" name="output_MDSplot_bcv" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - MDS-plot (bcv method)">
875f080136b6 Solved a very serious bug: if contrast and design matrix described samples not in the same order, statistical analysis goes wrong
yhoogstrate
parents: 83
diff changeset
442 <filter>outputs and ("make_output_MDSplot_bcv" in outputs)</filter>
59
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
443
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
444 <change_format>
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
445 <when input="output_format_images" value="png" format="png" />
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
446 <when input="output_format_images" value="pdf" format="pdf" />
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
447 <when input="output_format_images" value="svg" format="svg" />
f20dc31afd5e Uploaded
yhoogstrate
parents: 57
diff changeset
448 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
449 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
450
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
451 <data format="png" name="output_BCVplot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - BCV-plot">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
452 <filter>outputs and ("make_output_BCVplot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
453
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
454 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
455 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
456 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
457 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
458 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
459 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
460
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
461 <data format="png" name="output_MAplot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - MA-plot">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
462 <filter>outputs and ("make_output_MAplot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
463
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
464 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
465 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
466 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
467 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
468 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
469 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
470
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
471 <data format="png" name="output_PValue_distribution_plot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - P-Value distribution">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
472 <filter>outputs and ("make_output_PValue_distribution_plot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
473
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
474 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
475 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
476 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
477 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
478 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
479 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
480
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
481 <data format="png" name="output_hierarchical_clustering_plot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - Hierarchical custering">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
482 <filter>outputs and ("make_output_hierarchical_clustering_plot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
483
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
484 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
485 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
486 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
487 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
488 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
489 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
490
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
491 <data format="png" name="output_heatmap_plot" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - Heatmap">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
492 <filter>outputs and ("make_output_heatmap_plot" in outputs)</filter>
60
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
493
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
494 <change_format>
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
495 <when input="output_format_images" value="png" format="png" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
496 <when input="output_format_images" value="pdf" format="pdf" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
497 <when input="output_format_images" value="svg" format="svg" />
c5cc1b24f7d8 Uploaded
yhoogstrate
parents: 59
diff changeset
498 </change_format>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
499 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
500
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
501 <data format="RData" name="output_RData_obj" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - R data object">
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
502 <filter>outputs and ("make_output_RData_obj" in outputs)</filter>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
503 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
504
40
d9c9caf4e182 Uploaded
yhoogstrate
parents: 39
diff changeset
505 <data format="txt" name="output_R" label="edgeR DGE on ${design_matrix.hid}: ${design_matrix.name} - R output (debug)" >
53
b89788eead3c Uploaded
yhoogstrate
parents: 45
diff changeset
506 <filter>outputs and ("make_output_R_stdout" in outputs)</filter>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
507 </data>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
508 </outputs>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
509
94
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
510 <tests>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
511 <test>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
512 <param name="expression_matrix" value="Differential_Gene_Expression/expression_matrix.tabular.txt" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
513 <param name="design_matrix" value="Differential_Gene_Expression/design_matrix.tabular.txt" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
514
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
515 <param name="contrast" value="E-C"/>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
516
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
517 <param name="fdr" value="0.05" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
518
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
519 <param name="output_format_images" value="png" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
520
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
521 <output name="output_count_edgeR" file="Differential_Gene_Expression/differentially_expressed_genes.tabular.txt" />
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
522 </test>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
523 </tests>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
524
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
525 <help>
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
526 edgeR: Differential Gene(Expression) Analysis
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
527 #############################################
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
528
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
529 Overview
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
530 --------
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
531 Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data [1].
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
532
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
533 For every experiment, the algorithm requires a design matrix. This matrix describes which samples belong to which groups.
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
534 More details on this are given in the edgeR manual: http://www.bioconductor.org/packages/2.12/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
535 and the limma manual.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
536
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
537 Because the creation of a design matrix can be complex and time consuming, especially if no GUI is used, this package comes with an alternative tool which can help you with it.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
538 This tool is called *edgeR Design Matrix Creator*.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
539 If the appropriate design matrix (with corresponding links to the files) is given,
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
540 the correct contrast ( http://en.wikipedia.org/wiki/Contrast_(statistics) ) has to be given.
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
541
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
542 If you have for example two groups, with an equal weight, you would like to compare either
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
543 "g1-g2" or "normal-cancer".
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
544
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
545 The test function makes use of a MCF7 dataset used in a study that indicates that a higher sequencing depth is not neccesairily more important than a higher amount of replaciates[2].
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
546
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
547 Input
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
548 -----
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
549 Expression matrix
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
550 ^^^^^^^^^^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
551 ::
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
552
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
553 Geneid "\t" Sample-1 "\t" Sample-2 "\t" Sample-3 "\t" Sample-4 [...] "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
554 SMURF "\t" 123 "\t" 21 "\t" 34545 "\t" 98 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
555 BRCA1 "\t" 435 "\t" 6655 "\t" 45 "\t" 55 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
556 LINK33 "\t" 4 "\t" 645 "\t" 345 "\t" 1 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
557 SNORD78 "\t" 498 "\t" 65 "\t" 98 "\t" 27 ... "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
558 [...]
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
559
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
560 *Note: Make sure the number of columns in the header is identical to the number of columns in the body.*
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
561
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
562 Design matrix
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
563 ^^^^^^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
564 ::
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
565
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
566 Sample "\t" Condition "\t" Ethnicity "\t" Patient "\t" Batch "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
567 Sample-1 "\t" Tumor "\t" European "\t" 1 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
568 Sample-2 "\t" Normal "\t" European "\t" 1 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
569 Sample-3 "\t" Tumor "\t" European "\t" 2 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
570 Sample-4 "\t" Normal "\t" European "\t" 2 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
571 Sample-5 "\t" Tumor "\t" African "\t" 3 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
572 Sample-6 "\t" Normal "\t" African "\t" 3 "\t" 1 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
573 Sample-7 "\t" Tumor "\t" African "\t" 4 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
574 Sample-8 "\t" Normal "\t" African "\t" 4 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
575 Sample-9 "\t" Tumor "\t" Asian "\t" 5 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
576 Sample-10 "\t" Normal "\t" Asian "\t" 5 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
577 Sample-11 "\t" Tumor "\t" Asian "\t" 6 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
578 Sample-12 "\t" Normal "\t" Asian "\t" 6 "\t" 2 "\n"
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
579
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
580 *Note: Avoid factor names that are (1) numerical, (2) contain mathematical symbols and preferebly only use letters.*
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
581
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
582 Contrast
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
583 ^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
584 The contrast represents the biological question. There can be many questions asked, e.g.:
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
585
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
586 - Tumor-Normal
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
587 - African-European
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
588 - 0.5*(Control+Placebo) / Treated
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
589
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
590 Installation
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
591 ------------
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
592
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
593 This tool requires no specific configurations. The following dependencies are installed automatically:
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
594
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
595 - R
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
596 - Bioconductor
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
597 - limma
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
598 - edgeR
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
599
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
600 License
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
601 -------
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
602 - R
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
603 - GPL 2 &amp; GPL 3
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
604 - limma
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
605 - GPL (&gt;=2)
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
606 - edgeR
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
607 - GPL (&gt;=2)
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
608
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
609 References
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
610 ----------
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
611
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
612 EdgeR
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
613 ^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
614 **[1] edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.**
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
615
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
616 *Mark D. Robinson, Davis J. McCarthy and Gordon K. Smyth* - Bioinformatics (2010) 26 (1): 139-140.
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
617
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
618 - http://www.bioconductor.org/packages/2.12/bioc/html/edgeR.html
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
619 - http://dx.doi.org/10.1093/bioinformatics/btp616
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
620 - http://www.bioconductor.org/packages/release/bioc/html/edgeR.html
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
621
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
622 Test-data (MCF7)
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
623 ^^^^^^^^^^^^^^^^
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
624 **[2] RNA-seq differential expression studies: more sequence or more replication?**
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
625
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
626 *Yuwen Liu, Jie Zhou and Kevin P. White* - Bioinformatics (2014) 30 (3): 301-304.
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
627
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
628 - http://www.ncbi.nlm.nih.gov/pubmed/24319002
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
629 - http://dx.doi.org/10.1093/bioinformatics/btt688
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
630
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
631 Contact
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
632 -------
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
633
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
634 The tool wrapper has been written by Youri Hoogstrate from the Erasmus
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
635 Medical Center (Rotterdam, Netherlands) on behalf of the Translational
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
636 Research IT (TraIT) project:
83
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
637
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
638 http://www.ctmm.nl/en/programmas/infrastructuren/traitprojecttranslationeleresearch
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
639
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
640 More tools by the Translational Research IT (TraIT) project can be found
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
641 in the following toolsheds:
83
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
642
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
643 http://toolshed.dtls.nl/
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
644
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
645 http://toolshed.g2.bx.psu.edu
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
646
b2738b4d7c8c Uploaded
yhoogstrate
parents: 79
diff changeset
647 http://testtoolshed.g2.bx.psu.edu/
79
c86f1f86b3c1 Uploaded
yhoogstrate
parents: 77
diff changeset
648
36
185effd01633 Uploaded
yhoogstrate
parents: 35
diff changeset
649 I would like to thank Hina Riaz - Naz Khan for her helpful contribution.
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
650 </help>
94
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
651
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
652 <citations>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
653 <citation type="doi">10.1093/bioinformatics/btp616</citation>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
654 <citation type="doi">10.1093/bioinformatics/btt688</citation>
46745f5666ac Added test
yhoogstrate
parents: 93
diff changeset
655 </citations>
25
7cb518091b18 Uploaded
yhoogstrate
parents:
diff changeset
656 </tool>