# HG changeset patch # User bgruening # Date 1375791647 14400 # Node ID 1f312af2f8db08a2466bd9f9f57f01212c667d76 # Parent c8a0dc48149331dc81313d39fd377ec9190599a5 Uploaded diff -r c8a0dc481493 -r 1f312af2f8db bamCompare.xml --- a/bamCompare.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/bamCompare.xml Tue Aug 06 08:20:47 2013 -0400 @@ -1,5 +1,5 @@ - Normalize and compare two BAM files to output ratio, log2ratio or difference. + normalizes and compares two BAM files to obtain the ratio, log2ratio or difference. deepTools numpy @@ -157,7 +157,7 @@ help="If set, reads that have the same orientation and start position will be considered only once. If reads are paired, the mate position also has to coincide to ignore a read." /> diff -r c8a0dc481493 -r 1f312af2f8db bamCorrelate.xml --- a/bamCorrelate.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/bamCorrelate.xml Tue Aug 06 08:20:47 2013 -0400 @@ -1,5 +1,5 @@ - corrlates pairs of bam files + correlates pairs of BAM files deepTools @@ -125,7 +125,7 @@ + help ="If set, then regions with zero counts for *all* BAM files given are included. The default behavior is to ignore those cases" /> @@ -156,15 +156,18 @@ **What it does** -Genomes are split into bins of given length. For each bin the number of reads -found for each of the bam files is counted. A correlation is computed for all -pairs of bam files. +This tool is useful to assess the overall similarity of different BAM files. A typical application +is to check the correlation between replicates or published data sets. + +The tool splits the genomes are into bins of given length. For each bin, the number of reads +found in each BAM file is counted and a correlation is computed for all +pairs of BAM files. ----- .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_. diff -r c8a0dc481493 -r 1f312af2f8db bamCoverage.xml --- a/bamCoverage.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/bamCoverage.xml Tue Aug 06 08:20:47 2013 -0400 @@ -129,7 +129,7 @@ .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_. diff -r c8a0dc481493 -r 1f312af2f8db bamFingerprint.xml --- a/bamFingerprint.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/bamFingerprint.xml Tue Aug 06 08:20:47 2013 -0400 @@ -141,15 +141,20 @@ **What it does** -Samples indexed bam files and plots a profile for each bam file. At each -sample position all reads overlaping a window (bin) of specified length are -counted. This counts are then sorted and the cumulative sum plotted +This tool is based on a method developed by Diaz et al. (2012). Stat Appl Genet Mol Biol 11(3). +The resulting plot can be used to assess the strength of a ChIP (for factors that bind to narrow regions). +The tool first samples indexed bam files and counts all reads overlapping a window (bin) of specified length. +These counts are then sorted according to their rank and the cumulative sum of read counts are plotted. An ideal input +with perfect uniform distribution of reads along the genome (i.e. without enrichments in open chromatin etc.) should +generate a straight diagonal line. A very specific and strong ChIP enrichment will be indicated by a prominent and steep +rise of the cumulative sum towards the highest rank. This means that a big chunk of reads from the ChIP sample is located in +few bins which corresponds to high, narrow enrichments seen for transcription factors. ----- .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_. diff -r c8a0dc481493 -r 1f312af2f8db bigwigCompare.xml --- a/bigwigCompare.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/bigwigCompare.xml Tue Aug 06 08:20:47 2013 -0400 @@ -1,5 +1,5 @@ - compares two bigwig files based on the number of mapped reads + normalizes and compares two bigWig files to obtain the ratio, log2ratio or difference deepTools ucsc_tools @@ -92,14 +92,14 @@ This tool compares two bigwig files based on the number of mapped reads. To compare the bigwig files the genome is partitioned into bins of equal size, then the number of reads found in each BAM file are counted for such bins and -finally a summarizing value is reported. This vaule can be the ratio of the -number of readsper bin, the log2 of the ratio, the sum or the difference. +finally a summarizing value is reported. This value can be the ratio of the +number of reads per bin, the log2 of the ratio, the sum or the difference. ----- .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_. diff -r c8a0dc481493 -r 1f312af2f8db computeGCBias.xml --- a/computeGCBias.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/computeGCBias.xml Tue Aug 06 08:20:47 2013 -0400 @@ -1,5 +1,6 @@ - + to see whether your samples should be normalized for GC bias + deepTools @@ -51,13 +52,17 @@ #end if - #if $output.showOutputSettings == "yes" - #if $output.saveBiasPlot: - --biasPlot biasPlot.png ; - mv biasPlot.png $biasPlot - #end if + #if $saveBiasPlot: + --biasPlot $biasPlot #end if +## #if $output.showOutputSettings == "yes" +## #if $output.saveBiasPlot: +## --biasPlot biasPlot.png ; +## mv biasPlot.png $biasPlot +## #end if +## #end if + ; rm $temp_dir -rf @@ -123,6 +128,8 @@ + + @@ -144,8 +152,8 @@ **What it does** -Computes the GC bias ussing Benjamini's method [citation]. The resulting GC -bias can later be used to plot the bias or to correct the bias. +This tool computes the GC bias ussing the method proposed by Benjamini and Speed (2012). Nucleic Acids Res. +The output is used to plot the bias and can also be used later on to correct the bias with the tool correctGCbias. ----- diff -r c8a0dc481493 -r 1f312af2f8db computeMatrix.xml --- a/computeMatrix.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/computeMatrix.xml Tue Aug 06 08:20:47 2013 -0400 @@ -4,14 +4,29 @@ deepTools + #import tempfile + + #set $temp_input_handle = tempfile.NamedTemporaryFile() + #set $temp_input_path = $temp_input_handle.name + #silent $temp_input_handle.close() + + #for $rf in $regionsFiles: + cat "$rf.regionsFile" >> $temp_input_path; + #if str($rf.label.value).strip(): + echo "\#$rf.label.value" >> $temp_input_path; + #else: + echo "\#$rf.regionsFile.name" >> $temp_input_path; + #end if + #end for + + computeMatrix $mode.mode_select - --regionsFileName '$regionsFile' + --regionsFileName '$temp_input_path' --scoreFileName '$scoreFile' --outFileName '$outFileName' - ##ToDo --numberOfProcessors 4 #if $output.showOutputSettings == "yes" @@ -61,11 +76,17 @@ #end if #end if + ; rm $temp_input_path - - + + + + + + + diff -r c8a0dc481493 -r 1f312af2f8db correctGCBias.xml --- a/correctGCBias.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/correctGCBias.xml Tue Aug 06 08:20:47 2013 -0400 @@ -1,6 +1,5 @@ - - + use the output from computeGCBias to obtain corrected sample files deepTools ucsc_tools @@ -106,14 +105,13 @@ **What it does** -Computes the GC bias ussing Benjamini's method [citation]. The resulting GC -bias can later be used to plot the bias or to correct the bias. - +This tool requires the output from computeGCBias to correct the given BAM files according to the method proposed by Benjamini and Speed (2012). Nucleic Acids Res. +The resulting BAM files can be used in any downstream analyses, but be aware that you should not filter out duplicates from here on. ----- .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_. diff -r c8a0dc481493 -r 1f312af2f8db heatmapper.xml --- a/heatmapper.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/heatmapper.xml Tue Aug 06 08:20:47 2013 -0400 @@ -144,7 +144,7 @@ - + @@ -353,7 +353,7 @@ .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_. diff -r c8a0dc481493 -r 1f312af2f8db profiler.xml --- a/profiler.xml Mon Aug 05 11:36:11 2013 -0400 +++ b/profiler.xml Tue Aug 06 08:20:47 2013 -0400 @@ -11,8 +11,8 @@ --matrixFile $matrixFile #if $output.showOutputSettings == "yes" - #set newoutFileName=str($outFileName)+"."+str($output.outFileFormat) - --outFileName $newoutFilename + #set newoutFileName = str($outFileName)+"."+str($output.outFileFormat) + --outFileName $newoutFileName #if $output.outFileNameData: --outFileNameData '$output.outFileNameData' #end if @@ -25,7 +25,7 @@ --outFileSortedRegions '$output.outFileSortedRegions' #end if #else - #set newoutFileName=str($outFileName)+".png" + #set newoutFileName = str($outFileName)+".png" --outFileName $newoutFileName #end if @@ -96,7 +96,7 @@ - + @@ -171,7 +171,7 @@ .. class:: infomark -Please acknowledge that this tool **is still in development** and we will be very happy to receive feedback from the users. If you run into any trouble please sent an email to `Fidel Ramirez`_. +If you would like to give us feedback or you run into any trouble, please sent an email to deeptools@googlegroups.com This tool is developed by the `Bioinformatics and Deep-Sequencing Unit`_ at the `Max Planck Institute for Immunobiology and Epigenetics`_.