Mercurial > repos > yhoogstrate > edger_with_design_matrix
comparison edgeR_Differential_Gene_Expression.xml @ 36:185effd01633 draft
Uploaded
author | yhoogstrate |
---|---|
date | Wed, 21 May 2014 08:50:36 -0400 |
parents | fab87cecb3e5 |
children | 00b9ceac4332 |
comparison
equal
deleted
inserted
replaced
35:fab87cecb3e5 | 36:185effd01633 |
---|---|
332 </data> | 332 </data> |
333 </outputs> | 333 </outputs> |
334 | 334 |
335 <help> | 335 <help> |
336 edgeR: Differential Gene(Expression) Analysis | 336 edgeR: Differential Gene(Expression) Analysis |
337 | 337 ############################################# |
338 **Overview** | 338 |
339 | 339 Overview |
340 Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data. | 340 -------- |
341 Author: Mark Robinson, Davis McCarthy, Yunshun Chen, Aaron Lun & Gordon Smyth | 341 Differential expression analysis of RNA-seq and digital gene expression profiles with biological replication. Uses empirical Bayes estimation and exact tests based on the negative binomial distribution. Also useful for differential signal analysis with other types of genome-scale count data [1]. |
342 | |
343 http://www.bioconductor.org/packages/2.12/bioc/html/edgeR.html | |
344 http://dx.doi.org/10.1093/bioinformatics/btp616 | |
345 | 342 |
346 For every experiment, the algorithm requires a design matrix. This matrix describes which samples belong to which groups. | 343 For every experiment, the algorithm requires a design matrix. This matrix describes which samples belong to which groups. |
347 More details on this are given in the edgeR manual: | 344 More details on this are given in the edgeR manual: http://www.bioconductor.org/packages/2.12/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf |
348 http://www.bioconductor.org/packages/2.12/bioc/vignettes/edgeR/inst/doc/edgeRUsersGuide.pdf | |
349 and the limma manual. | 345 and the limma manual. |
350 | 346 |
351 Because the creation of a design matrix can be complex and time consuming, especially if no GUI is used, this package comes with an alternative tool which can help you with it. | 347 Because the creation of a design matrix can be complex and time consuming, especially if no GUI is used, this package comes with an alternative tool which can help you with it. |
352 This tool is called *edgeR Design Matrix Creator*. | 348 This tool is called *edgeR Design Matrix Creator*. |
353 If the appropriate design matrix (with corresponding links to the files) is given, | 349 If the appropriate design matrix (with corresponding links to the files) is given, |
354 the correct contrast ( http://en.wikipedia.org/wiki/Contrast_(statistics) ) has to be given. | 350 the correct contrast ( http://en.wikipedia.org/wiki/Contrast_(statistics) ) has to be given. |
355 | 351 |
356 If you have for example two groups, with an equal weight, you would like to compare either | 352 If you have for example two groups, with an equal weight, you would like to compare either |
357 "g1~g2" or "normal~cancer". | 353 "g1~g2" or "normal~cancer". |
358 | 354 |
359 ** Input ** | 355 The test function makes use of a MCF7 dataset used in a study that indicates that a higher sequencing depth is not neccesairily more important than a higher amount of replaciates[2]. |
360 | 356 |
361 Expression matrix:: | 357 Input |
358 ----- | |
359 Expression matrix | |
360 ^^^^^^^^^^^^^^^^^ | |
361 :: | |
362 | 362 |
363 Geneid "\t" Sample-1 "\t" Sample-2 "\t" Sample-3 "\t" Sample-4 [...] "\n" | 363 Geneid "\t" Sample-1 "\t" Sample-2 "\t" Sample-3 "\t" Sample-4 [...] "\n" |
364 SMURF "\t" 123 "\t" 21 "\t" 34545 "\t" 98 ... "\n" | 364 SMURF "\t" 123 "\t" 21 "\t" 34545 "\t" 98 ... "\n" |
365 BRCA1 "\t" 435 "\t" 6655 "\t" 45 "\t" 55 ... "\n" | 365 BRCA1 "\t" 435 "\t" 6655 "\t" 45 "\t" 55 ... "\n" |
366 LINK33 "\t" 4 "\t" 645 "\t" 345 "\t" 1 ... "\n" | 366 LINK33 "\t" 4 "\t" 645 "\t" 345 "\t" 1 ... "\n" |
367 SNORD78 "\t" 498 "\t" 65 "\t" 98 "\t" 27 ... "\n" | 367 SNORD78 "\t" 498 "\t" 65 "\t" 98 "\t" 27 ... "\n" |
368 [...] | 368 [...] |
369 | 369 |
370 Note: Make sure the number of columns in the header is identical to the number of columns in the body. | 370 *Note: Make sure the number of columns in the header is identical to the number of columns in the body.* |
371 | 371 |
372 Design matrix:: | 372 Design matrix |
373 ^^^^^^^^^^^^^ | |
374 :: | |
373 | 375 |
374 Sample "\t" Condition "\t" Ethnicity "\t" Patient "\t" Batch "\n" | 376 Sample "\t" Condition "\t" Ethnicity "\t" Patient "\t" Batch "\n" |
375 Sample-1 "\t" Tumor "\t" European "\t" 1 "\t" 1 "\n" | 377 Sample-1 "\t" Tumor "\t" European "\t" 1 "\t" 1 "\n" |
376 Sample-2 "\t" Normal "\t" European "\t" 1 "\t" 1 "\n" | 378 Sample-2 "\t" Normal "\t" European "\t" 1 "\t" 1 "\n" |
377 Sample-3 "\t" Tumor "\t" European "\t" 2 "\t" 1 "\n" | 379 Sample-3 "\t" Tumor "\t" European "\t" 2 "\t" 1 "\n" |
383 Sample-9 "\t" Tumor "\t" Asian "\t" 5 "\t" 2 "\n" | 385 Sample-9 "\t" Tumor "\t" Asian "\t" 5 "\t" 2 "\n" |
384 Sample-10 "\t" Normal "\t" Asian "\t" 5 "\t" 2 "\n" | 386 Sample-10 "\t" Normal "\t" Asian "\t" 5 "\t" 2 "\n" |
385 Sample-11 "\t" Tumor "\t" Asian "\t" 6 "\t" 2 "\n" | 387 Sample-11 "\t" Tumor "\t" Asian "\t" 6 "\t" 2 "\n" |
386 Sample-12 "\t" Normal "\t" Asian "\t" 6 "\t" 2 "\n" | 388 Sample-12 "\t" Normal "\t" Asian "\t" 6 "\t" 2 "\n" |
387 | 389 |
388 Note: Avoid factor names that are (1) numerical, (2) contain mathematical symbols and preferebly only use letters. | 390 *Note: Avoid factor names that are (1) numerical, (2) contain mathematical symbols and preferebly only use letters.* |
389 | 391 |
390 Contrast:: | 392 Contrast |
391 | 393 ^^^^^^^^ |
392 Tumor-Normal | 394 The contrast represents the biological question. There can be many questions asked, e.g.: |
393 African-European | 395 |
394 0.5*(Control+Placebo) / Treated | 396 - Tumor-Normal |
395 | 397 - African-European |
396 | 398 - 0.5*(Control+Placebo) / Treated |
397 **Installation** | 399 |
400 Installation | |
401 ------------ | |
398 | 402 |
399 This tool requires no specific configurations. The following dependencies are installed automatically: | 403 This tool requires no specific configurations. The following dependencies are installed automatically: |
400 * R | 404 |
401 * Bioconductor | 405 - R |
406 - Bioconductor | |
402 - limma | 407 - limma |
408 | |
403 - edgeR | 409 - edgeR |
404 | 410 |
405 **License** | 411 License |
406 - R - GPL-2 & GPL-3 | 412 ------- |
407 - limma - GPL (>=2) | 413 - R |
408 - edgeR - GPL (>=2) | 414 - GPL-2 & GPL-3 |
409 | 415 - limma |
410 **Contact** | 416 - GPL (>=2) |
411 | 417 - edgeR |
418 - GPL (>=2) | |
419 | |
420 References | |
421 ---------- | |
422 | |
423 EdgeR | |
424 ^^^^^ | |
425 **[1] edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.** | |
426 | |
427 *Mark D. Robinson, Davis J. McCarthy and Gordon K. Smyth* - Bioinformatics (2010) 26 (1): 139-140. | |
428 | |
429 - http://www.bioconductor.org/packages/2.12/bioc/html/edgeR.html | |
430 - http://dx.doi.org/10.1093/bioinformatics/btp616 | |
431 - http://www.bioconductor.org/packages/release/bioc/html/edgeR.html | |
432 | |
433 Test-data (MCF7) | |
434 ^^^^^^^^^^^^^^^^ | |
435 **[2] RNA-seq differential expression studies: more sequence or more replication?** | |
436 | |
437 *Yuwen Liu, Jie Zhou and Kevin P. White* - Bioinformatics (2014) 30 (3): 301-304. | |
438 | |
439 - http://www.ncbi.nlm.nih.gov/pubmed/24319002 | |
440 - http://dx.doi.org/10.1093/bioinformatics/btt688 | |
441 | |
442 Contact | |
443 ------- | |
412 The tool wrapper has been written by Youri Hoogstrate from the Erasmus Medical Center (Rotterdam, Netherlands) on behalf of the Translational Research IT (TraIT) project: | 444 The tool wrapper has been written by Youri Hoogstrate from the Erasmus Medical Center (Rotterdam, Netherlands) on behalf of the Translational Research IT (TraIT) project: |
413 http://www.ctmm.nl/en/programmas/infrastructuren/traitprojecttranslationeleresearch | 445 http://www.ctmm.nl/en/programmas/infrastructuren/traitprojecttranslationeleresearch |
414 | 446 |
447 I would like to thank Hina Riaz - Naz Khan for her helpful contribution. | |
448 | |
415 More tools by the Translational Research IT (TraIT) project can be found in the following repository: | 449 More tools by the Translational Research IT (TraIT) project can be found in the following repository: |
416 http://toolshed.nbic.nl/ | 450 http://testtoolshed.g2.bx.psu.edu/ |
417 | |
418 **References** | |
419 | |
420 The test data is coming from: doi: 10.1093/bioinformatics/btt688. | |
421 http://www.ncbi.nlm.nih.gov/pubmed/24319002 | |
422 | |
423 </help> | 451 </help> |
424 </tool> | 452 </tool> |