view metagene_readthrough.xml @ 19:385fc64fa988 draft default tip

Uploaded
author rlegendre
date Fri, 12 Jun 2015 11:32:59 -0400
parents 7c944fd9907e
children
line wrap: on
line source

<tool id="readthrough_analysis" name="Stop_supp">
	<description>To analyse Ribo-seq alignments for the detection of stop codon readthrough events</description>
	<requirements>
	    <requirement type="package">samtools</requirement>
	    <requirement type="python-module">HTseq</requirement>
	    <requirement type="python-module">pysam</requirement>
	    <requirement type="python-module">csv</requirement>
	    <requirement type="python-module">Bio</requirement>
	</requirements>
	<command interpreter="python"> 
		metagene_readthrough.py --gff $gff --fasta $fasta --bam $mapping --dirout=$output,$output.files_path --extend $ext

	</command>

	<inputs>
		<param name="gff" type="data" label="Reference annotation file (GFF)" format="gff" />
		<param name="fasta" type="data" label="Reference genome in Fasta format" format="fasta" />
		<param name="mapping" type="data" label="Bam File" format="bam" />
		<param name="ext" type="integer" label="Length of 3’ UTR extension downstream the annotated stop codon (in bp)" value="300" />
	</inputs>
            
	<outputs>
		<data format="html"  name="output" label="[RP]Readthrough results on  ${on_string}"/>
	</outputs>
        
	<help>
Summary
-------          
This tool uses Ribo-seq data (BAM file) to extract genes displaying potential stop codon readthrough events from a reference annotation file (`GFF3`_).

C-terminal protein extensions were identified as previously described (Dunn J.G. et al, 2013). Only uniquely mapped footprints with a size in the 25 to 34 range are considered.
A gene is considered to display readthrough if:

 i) It is covered by more than 128 footprints.

 ii) There are footprints after the stop codon.

 iii) There are footprints overlapping the next in-frame stop codon.

 iv) There is no methionine codon in the next five codons downstream from the official stop codon of the CDS.

 v) Coverage is homogeneous within the extension.

Stop codon readthrough was estimated by calculating the ratio of the number of footprints in the C-terminal extension to that in the CDS. Ribosome density footprints were estimated in RPKM (reads per kilobase per million). We controlled for variability due to stop codon peaks, by excluding footprints mapping to stop codons from the calculation of RPKM.

.. _GFF3: http://gmod.org/wiki/GFF3
		
		
Output 
------- 
This tool produces an html file with plots for each gene displaying readthrough.
				

Dependances
------------

.. class:: warningmark

This tool depends on Python (>=2.7) and following packages : numpy 1.8.0, Biopython 1.58, matplotlib 1.3.1, HTSeq and pysam. Samtools is used for bam manipulation.

	</help>
</tool>