comparison edgeR_Differential_Gene_Expression.xml @ 36:185effd01633 draft

Uploaded
author yhoogstrate
date Wed, 21 May 2014 08:50:36 -0400
parents fab87cecb3e5
children 00b9ceac4332
comparison
equal deleted inserted replaced
35:fab87cecb3e5 36:185effd01633
332 </data> 332 </data>
333 </outputs> 333 </outputs>
334 334
335 <help> 335 <help>
336 edgeR: Differential Gene(Expression) Analysis 336 edgeR: Differential Gene(Expression) Analysis
337 337 #############################################
338 **Overview** 338
339 339 Overview
340 Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data. 340 --------
341 Author: Mark Robinson, Davis McCarthy, Yunshun Chen, Aaron Lun &amp; Gordon Smyth 341 Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data [1].
342
343 http://www.bioconductor.org/packages/2.12/bioc/html/edgeR.html
344 http://dx.doi.org/10.1093/bioinformatics/btp616
345 342
346 For every experiment, the algorithm requires a design matrix. This matrix describes which samples belong to which groups. 343 For every experiment, the algorithm requires a design matrix. This matrix describes which samples belong to which groups.
347 More details on this are given in the edgeR manual: 344 More details on this are given in the edgeR manual: http://www.bioconductor.org/packages/2.12/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf
348 http://www.bioconductor.org/packages/2.12/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf
349 and the limma manual. 345 and the limma manual.
350 346
351 Because the creation of a design matrix can be complex and time consuming, especially if no GUI is used, this package comes with an alternative tool which can help you with it. 347 Because the creation of a design matrix can be complex and time consuming, especially if no GUI is used, this package comes with an alternative tool which can help you with it.
352 This tool is called *edgeR Design Matrix Creator*. 348 This tool is called *edgeR Design Matrix Creator*.
353 If the appropriate design matrix (with corresponding links to the files) is given, 349 If the appropriate design matrix (with corresponding links to the files) is given,
354 the correct contrast ( http://en.wikipedia.org/wiki/Contrast_(statistics) ) has to be given. 350 the correct contrast ( http://en.wikipedia.org/wiki/Contrast_(statistics) ) has to be given.
355 351
356 If you have for example two groups, with an equal weight, you would like to compare either 352 If you have for example two groups, with an equal weight, you would like to compare either
357 "g1~g2" or "normal~cancer". 353 "g1~g2" or "normal~cancer".
358 354
359 ** Input ** 355 The test function makes use of a MCF7 dataset used in a study that indicates that a higher sequencing depth is not neccesairily more important than a higher amount of replaciates[2].
360 356
361 Expression matrix:: 357 Input
358 -----
359 Expression matrix
360 ^^^^^^^^^^^^^^^^^
361 ::
362 362
363 Geneid "\t" Sample-1 "\t" Sample-2 "\t" Sample-3 "\t" Sample-4 [...] "\n" 363 Geneid "\t" Sample-1 "\t" Sample-2 "\t" Sample-3 "\t" Sample-4 [...] "\n"
364 SMURF "\t" 123 "\t" 21 "\t" 34545 "\t" 98 ... "\n" 364 SMURF "\t" 123 "\t" 21 "\t" 34545 "\t" 98 ... "\n"
365 BRCA1 "\t" 435 "\t" 6655 "\t" 45 "\t" 55 ... "\n" 365 BRCA1 "\t" 435 "\t" 6655 "\t" 45 "\t" 55 ... "\n"
366 LINK33 "\t" 4 "\t" 645 "\t" 345 "\t" 1 ... "\n" 366 LINK33 "\t" 4 "\t" 645 "\t" 345 "\t" 1 ... "\n"
367 SNORD78 "\t" 498 "\t" 65 "\t" 98 "\t" 27 ... "\n" 367 SNORD78 "\t" 498 "\t" 65 "\t" 98 "\t" 27 ... "\n"
368 [...] 368 [...]
369 369
370 Note: Make sure the number of columns in the header is identical to the number of columns in the body. 370 *Note: Make sure the number of columns in the header is identical to the number of columns in the body.*
371 371
372 Design matrix:: 372 Design matrix
373 ^^^^^^^^^^^^^
374 ::
373 375
374 Sample "\t" Condition "\t" Ethnicity "\t" Patient "\t" Batch "\n" 376 Sample "\t" Condition "\t" Ethnicity "\t" Patient "\t" Batch "\n"
375 Sample-1 "\t" Tumor "\t" European "\t" 1 "\t" 1 "\n" 377 Sample-1 "\t" Tumor "\t" European "\t" 1 "\t" 1 "\n"
376 Sample-2 "\t" Normal "\t" European "\t" 1 "\t" 1 "\n" 378 Sample-2 "\t" Normal "\t" European "\t" 1 "\t" 1 "\n"
377 Sample-3 "\t" Tumor "\t" European "\t" 2 "\t" 1 "\n" 379 Sample-3 "\t" Tumor "\t" European "\t" 2 "\t" 1 "\n"
383 Sample-9 "\t" Tumor "\t" Asian "\t" 5 "\t" 2 "\n" 385 Sample-9 "\t" Tumor "\t" Asian "\t" 5 "\t" 2 "\n"
384 Sample-10 "\t" Normal "\t" Asian "\t" 5 "\t" 2 "\n" 386 Sample-10 "\t" Normal "\t" Asian "\t" 5 "\t" 2 "\n"
385 Sample-11 "\t" Tumor "\t" Asian "\t" 6 "\t" 2 "\n" 387 Sample-11 "\t" Tumor "\t" Asian "\t" 6 "\t" 2 "\n"
386 Sample-12 "\t" Normal "\t" Asian "\t" 6 "\t" 2 "\n" 388 Sample-12 "\t" Normal "\t" Asian "\t" 6 "\t" 2 "\n"
387 389
388 Note: Avoid factor names that are (1) numerical, (2) contain mathematical symbols and preferebly only use letters. 390 *Note: Avoid factor names that are (1) numerical, (2) contain mathematical symbols and preferebly only use letters.*
389 391
390 Contrast:: 392 Contrast
391 393 ^^^^^^^^
392 Tumor-Normal 394 The contrast represents the biological question. There can be many questions asked, e.g.:
393 African-European 395
394 0.5*(Control+Placebo) / Treated 396 - Tumor-Normal
395 397 - African-European
396 398 - 0.5*(Control+Placebo) / Treated
397 **Installation** 399
400 Installation
401 ------------
398 402
399 This tool requires no specific configurations. The following dependencies are installed automatically: 403 This tool requires no specific configurations. The following dependencies are installed automatically:
400 * R 404
401 * Bioconductor 405 - R
406 - Bioconductor
402 - limma 407 - limma
408
403 - edgeR 409 - edgeR
404 410
405 **License** 411 License
406 - R - GPL-2 &amp; GPL-3 412 -------
407 - limma - GPL (&gt;=2) 413 - R
408 - edgeR - GPL (&gt;=2) 414 - GPL-2 &amp; GPL-3
409 415 - limma
410 **Contact** 416 - GPL (&gt;=2)
411 417 - edgeR
418 - GPL (&gt;=2)
419
420 References
421 ----------
422
423 EdgeR
424 ^^^^^
425 **[1] edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.**
426
427 *Mark D. Robinson, Davis J. McCarthy and Gordon K. Smyth* - Bioinformatics (2010) 26 (1): 139-140.
428
429 - http://www.bioconductor.org/packages/2.12/bioc/html/edgeR.html
430 - http://dx.doi.org/10.1093/bioinformatics/btp616
431 - http://www.bioconductor.org/packages/release/bioc/html/edgeR.html
432
433 Test-data (MCF7)
434 ^^^^^^^^^^^^^^^^
435 **[2] RNA-seq differential expression studies: more sequence or more replication?**
436
437 *Yuwen Liu, Jie Zhou and Kevin P. White* - Bioinformatics (2014) 30 (3): 301-304.
438
439 - http://www.ncbi.nlm.nih.gov/pubmed/24319002
440 - http://dx.doi.org/10.1093/bioinformatics/btt688
441
442 Contact
443 -------
412 The tool wrapper has been written by Youri Hoogstrate from the Erasmus Medical Center (Rotterdam, Netherlands) on behalf of the Translational Research IT (TraIT) project: 444 The tool wrapper has been written by Youri Hoogstrate from the Erasmus Medical Center (Rotterdam, Netherlands) on behalf of the Translational Research IT (TraIT) project:
413 http://www.ctmm.nl/en/programmas/infrastructuren/traitprojecttranslationeleresearch 445 http://www.ctmm.nl/en/programmas/infrastructuren/traitprojecttranslationeleresearch
414 446
447 I would like to thank Hina Riaz - Naz Khan for her helpful contribution.
448
415 More tools by the Translational Research IT (TraIT) project can be found in the following repository: 449 More tools by the Translational Research IT (TraIT) project can be found in the following repository:
416 http://toolshed.nbic.nl/ 450 http://testtoolshed.g2.bx.psu.edu/
417
418 **References**
419
420 The test data is coming from: doi: 10.1093/bioinformatics/btt688.
421 http://www.ncbi.nlm.nih.gov/pubmed/24319002
422
423 </help> 451 </help>
424 </tool> 452 </tool>