# HG changeset patch # User rlegendre # Date 1413817649 14400 # Node ID 0b8fa8b7f35613fa7911200115e2cc2bc32f9e8e # Parent eea5fec46e5c97544aec049890c3115df8de28ea Uploaded diff -r eea5fec46e5c -r 0b8fa8b7f356 metagene_frameshift_analysis.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/metagene_frameshift_analysis.xml Mon Oct 20 11:07:29 2014 -0400 @@ -0,0 +1,73 @@ + + Analyse Ribo-seq alignment to extract translational ambiguities events + + samtools + matplotlib + numpy + PIL + Bio + + + metagene_frameshift_analysis.py --input $reference --bam $mapping --cutoff $cutoff --kmer $kmer --fasta $fasta --dirout $output,$output.files_path --box $boxplot> $log + + + + + + + + + + + + + + + + + + + +Summary +------- +This tool uses Ribo-seq data (bam file) to extract out-of-frame footprints in all genes from a reference annotation file (GFF3). Subprofile are plotted for each gene with dual coding events. + + +*- GFF3 file* : It must contain 9 tabulate-delimited columns : Chromosome, source, feature, start, stop, score, strand, phasing, note. The gene ID was retrieved in note field by "ID=" tag. + +*- Fasta file* : Reference fasta file. Be careful about the chromosome nomenclature used : it must be compatible with your GFF3 annotation file. + +*- BAM file* : It must be sorted. It can contain either multiples or unaligned footprints + +*- Kmer* : Lenght of the best phasing footprint. You can compute it running kmer_analysis + +*- Cutoff* : Integer value for selecting all genes that have less than 60 % (default) of footprints in coding frame. + + + +................................................................................................................................................................................................. + + + +Output +------- +This tool generates 2 output files : + +*- html file* : relative to translational ambiguities detection and visualization. + +*- Stat file* : statistiques about treated footprints and phasing. + +*- Boxplot* : Proportion of footprints in the three frames for all genes. + + +Dependances +------------ + +.. class:: warningmark + +This tool depends on Python (>=2.7) and following packages : numpy 1.8.0, Biopython 1.58, matplotlib 1.3.1. Samtools is used for bam manipulation. + + + + +