Mercurial > repos > george-weingart > graphlan_import
diff export2graphlan.xml @ 0:cac6247cb1d3 draft
graphlan_import
author | george-weingart |
---|---|
date | Tue, 26 Aug 2014 14:51:29 -0400 |
parents | |
children | 2c0d791fc950 |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/export2graphlan.xml Tue Aug 26 14:51:29 2014 -0400 @@ -0,0 +1,85 @@ +<tool id="export2graphlan" name="export2graphlan" version="1.0.0"> + <description>Export to Graphlan</description> + <command interpreter="python"> + export2graphlan.py + -i $inp_data + -o $out_data + -t $output_tree_file + -a $output_annot_file + --title $export_title + --annotations $export_annotations + --external_annotations $export_external_annotations + --skip_rows 1,2 + </command> + + <inputs> + <param name="export_title" type="text" format="text" label="Title" value="Title"/> + <param name="export_annotations" type="text" format="text" label="Annotations" value="2,3"/> + <param name="export_external_annotations" type="text" format="text" label="External Annotations" value="4,5,6"/> + <param format="tabular" name="inp_data" type="data" label="Input used to run Lefse - See samples below - Please use Galaxy Get-Data/Upload-File. Use File-Type = Tabular" help="This is the file that was used as input for Lefse"/> + <param format="lefse_res" name="out_data" type="data" label="Output of Lefse" help="This is the Lefse output file"/> + </inputs> + <outputs> + <data name="output_annot_file" format="circl" /> + <data name="output_tree_file" format="circl" /> + </outputs> + + <help> +**export2graphlan** is an automatic conversion script from **LEfSe**, **MetaPhlAn2**, and **HUMAnN** input and/or output files, to **GraPhlAn**. Input file can be also given in BIOM 2.0 format. + +The aim of this tool is to support biologists, helping them by automatically write the tree and the annotation file for **GraPhlAn**. + +---- + +.. contents:: + +---- + +Overview +======== + +A graphical representation of how **export2graphlan** can be used in the analysis pipeline: + +.. image:: https://bitbucket.org/repo/oL6bEG/images/3364692296-graphlan_integration.png + :height: 500 + :width: 600 + + +---- + +HMP aerobiosis +============== + +A taxonomic tree that shows three different classes of oxygen (low, medium, and high), highlighting biomarker clades for each class. Data are taken from the HMP project. + +.. image:: https://bitbucket.org/repo/oL6bEG/images/2487320460-hmp_aerobiosis.png + :height: 500 + :width: 600 + + +Pipeline +-------- + + + # download the data + $ wget http://huttenhower.sph.harvard.edu/webfm_send/129 -O hmp_aerobiosis_small.txt + + # convert the file in LEfSe format, and run LEfSe + $ format_input.py hmp_aerobiosis_small.txt hmp_aerobiosis_small.in -c 1 -s 2 -u 3 -o 1000000 + $ run_lefse.py hmp_aerobiosis_small.in hmp_aerobiosis_small.res + + # convert it! + $ export2graphlan.py -i hmp_aerobiosis_small.txt -o hmp_aerobiosis_small.res -t tree.txt -a annot.txt --title "HMP aerobiosis" --annotations 2,3 --external_annotations 4,5,6 --fname_row 0 --skip_rows 1,2 --ftop 200 + + # attach annotation to the tree + $ graphlan_annotate.py --annot annot.txt tree.txt outtree.txt + + # generate the beautiful image + $ graphlan.py --dpi 300 --size 7.0 outtree.txt outimg.png --external_legends + +The input file is downloaded from The Huttenhower Lab and given to **LEfSe** for biomarkers discovery. The two file (the *LEfSe input* and the *LEfSe output* files) are then passed to **export2graphlan**. In this case the levels 2 (*phylum*) and 3 (*class*) are annotated on the circular tree, while levels 4 (*order*), 5 (*family*), and 6 (*genus*) are put on the external legend. + + + + </help> +</tool>