annotate export2graphlan.xml @ 15:e743b0890ce2 draft

Uploaded
author george-weingart
date Thu, 04 Sep 2014 14:47:30 -0400
parents b084b394910e
children 8c34d0d94c44
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
1 <tool id="export2graphlan" name="export2graphlan" version="1.0.0">
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
2 <description>Export to Graphlan</description>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
3 <command interpreter="python">
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
4 export2graphlan.py
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
5 -i $inp_data
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
6 -o $out_data
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
7 -t $output_tree_file
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
8 -a $output_annot_file
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
9 --title $export_title
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
10 --annotations $export_annotations
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
11 --external_annotations $export_external_annotations
14
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
12 --background_levels $background_levels
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
13 --background_clades $background_clades
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
14 --skip_rows 1,2
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
15 </command>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
16
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
17 <inputs>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
18 <param format="tabular" name="inp_data" type="data" label="Input used to run Lefse - See samples below - Please use Galaxy Get-Data/Upload-File. Use File-Type = Tabular" help="This is the file that was used as input for Lefse"/>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
19 <param format="lefse_res" name="out_data" type="data" label="Output of Lefse" help="This is the Lefse output file"/>
14
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
20 <param name="export_title" type="text" format="text" label="Title" value="Title"/>
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
21 <param name="export_annotations" type="text" format="text" label="Annotations" value="2,3"/>
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
22 <param name="export_external_annotations" type="text" format="text" label="External Annotations" value="4,5,6"/>
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
23 <param name="background_levels" type="text" format="text" label="Background Levels" value="1,2,3"/>
15
e743b0890ce2 Uploaded
george-weingart
parents: 14
diff changeset
24 <param name="background_clades" type="text" format="text" label="Background Clades" value=" " />
14
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
25
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
26
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
27 </inputs>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
28 <outputs>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
29 <data name="output_annot_file" format="circl" />
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
30 <data name="output_tree_file" format="circl" />
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
31 </outputs>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
32
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
33 <help>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
34 Overview
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
35 ========
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
36 **export2graphlan** is an *OPTIONAL* tool that automatically convert **LEfSe**, **MetaPhlAn2**, and **HUMAnN** input and/or output files, to **GraPhlAn**. Input file can be also given in BIOM (both 1 and 2) format.
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
37
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
38 The aim of this tool is to support biologists, helping them by provide the tree and the annotation file for GraPhlAn, automatically.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
39
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
40 Input files
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
41 -----------
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
42
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
43 As shown in the image below, export2graphlan can work with just one of the following files or with both of them.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
44
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
45 * **Result of MetaPhlAn or HUMAnN analysis**: As depicted in the image below, this file can be the result of a MetaPhlAn analysis or a HUMAnN analysis. Generally, it is a tab separated file that have for each row a taxonomy and an abundance value.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
46
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
47 * **Output of LEfSe**: This file is the result of LEfSe execute on the *Result of MetaPhlAn or HUMAnN analysis* file. This file allow GraPhlAn to highlight for you the found biomarkers.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
48
14
b084b394910e Uploaded
george-weingart
parents: 13
diff changeset
49 Input parameters9999
4
c0c7f369e331 Uploaded
george-weingart
parents: 3
diff changeset
50 --------------------
3
ebe3cb467f8c Uploaded
george-weingart
parents: 2
diff changeset
51
4
c0c7f369e331 Uploaded
george-weingart
parents: 3
diff changeset
52 --annotations ANNOTATIONS
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
53 List which levels should be annotated in the tree. Use
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
54 a comma separate values form, e.g.,
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
55 --annotation_levels 1,2,3. Default is None
4
c0c7f369e331 Uploaded
george-weingart
parents: 3
diff changeset
56 --external_annotations EXTERNAL_ANNOTATIONS
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
57 List which levels should use the external legend for
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
58 the annotation. Use a comma separate values form,
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
59 e.g., --annotation_levels 1,2,3. Default is None
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
60 --background_levels BACKGROUND_LEVELS
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
61 List which levels should be highlight with a shaded
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
62 background. Use a comma separate values form, e.g.,
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
63 --background_levels 1,2,3
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
64 --background_clades BACKGROUND_CLADES
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
65 Specify the clades that should be highlight with a
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
66 shaded background. Use a comma separate values form
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
67 and surround the string with " if it contains spaces.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
68 Example: --background_clades "Bacteria.Actinobacteria,
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
69 Bacteria.Bacteroidetes.Bacteroidia,
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
70 Bacteria.Firmicutes.Clostridia.Clostridiales"
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
71 --background_colors BACKGROUND_COLORS
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
72 Set the color to use for the shaded background. Colors
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
73 can be either in RGB or HSV (using a semi-colon to
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
74 separate values, surrounded with ()) format. Use a
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
75 comma separate values form and surround the string
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
76 with " if it contains spaces. Example:
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
77 --background_colors "#29cc36, (150; 100; 100), (280;
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
78 80; 88)"
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
79 --title TITLE If specified set the title of the GraPhlAn plot.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
80 Surround the string with " if it contains spaces,
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
81 e.g., --title "Title example"
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
82 --title_font_size TITLE_FONT_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
83 Set the title font size. Default is 15
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
84 --def_clade_size DEF_CLADE_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
85 Set a default size for clades that are not found as
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
86 biomarkers by LEfSe. Default is 10
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
87 --min_clade_size MIN_CLADE_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
88 Set the minimum value of clades that are biomarkers.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
89 Default is 20
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
90 --max_clade_size MAX_CLADE_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
91 Set the maximum value of clades that are biomarkers.
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
92 Default is 200
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
93 --def_font_size DEF_FONT_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
94 Set a default font size. Default is 10
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
95 --min_font_size MIN_FONT_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
96 Set the minimum font size to use. Default is 8
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
97 --max_font_size MAX_FONT_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
98 Set the maximum font size. Default is 12
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
99 --annotation_legend_font_size ANNOTATION_LEGEND_FONT_SIZE
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
100 Set the font size for the annotation legend. Default
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
101 is 10
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
102 --abundance_threshold ABUNDANCE_THRESHOLD
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
103 Set the minimun abundace value for a clade to be
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
104 annotated. Default is 20.0
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
105 --most_abundant MOST_ABUNDANT
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
106 When only lefse_input is provided, you can specify how
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
107 many clades highlight. Since the biomarkers are
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
108 missing, they will be chosen from the most abundant
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
109 --least_biomarkers LEAST_BIOMARKERS
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
110 When only lefse_input is provided, you can specify the
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
111 minimum number of biomarkers to extract. The taxonomy
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
112 is parsed, and the level is choosen in order to have
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
113 at least the specified number of biomarkers
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
114 --discard_otus If specified the OTU ids will be discarde from the
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
115 taxonmy. Default behavior keep OTU ids in taxonomy
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
116 --internal_levels If specified sum-up from leaf to root the abundances
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
117 values. Default behavior do not sum-up abundances on
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
118 the internal nodes
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
119
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
120 input parameters:
6
14edb544cdac Uploaded
george-weingart
parents: 5
diff changeset
121 You need to provide at least LEfSe input data
3
ebe3cb467f8c Uploaded
george-weingart
parents: 2
diff changeset
122 -i LEFSE_INPUT, --lefse_input LEFSE_INPUT
6
14edb544cdac Uploaded
george-weingart
parents: 5
diff changeset
123
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
124 -o LEFSE_OUTPUT, --lefse_output LEFSE_OUTPUT
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
125
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
126 output parameters:
13
66c50eadf709 Uploaded
george-weingart
parents: 12
diff changeset
127
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
128 -t TREE, --tree TREE Output filename where save the input tree for GraPhlAn
13
66c50eadf709 Uploaded
george-weingart
parents: 12
diff changeset
129
9
cc25a2f6b1b9 Uploaded
george-weingart
parents: 8
diff changeset
130 -a ANNOTATION, --annotation ANNOTATION : This is the Output filename where to save GraPhlAn annotation
7
09eff46a46e7 Uploaded
george-weingart
parents: 6
diff changeset
131
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
132 Input data matrix parameters:
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
133 --sep SEP
9
cc25a2f6b1b9 Uploaded
george-weingart
parents: 8
diff changeset
134 --out_table OUT_TABLE : This is where to write the processed data matrix to file
12
389074508060 Uploaded
george-weingart
parents: 11
diff changeset
135
11
8cfabe8759ab Uploaded
george-weingart
parents: 10
diff changeset
136 --fname_row FNAME_ROW : Row number containing the names of the features (default 0, specify -1 if no names are present in the matrix)
12
389074508060 Uploaded
george-weingart
parents: 11
diff changeset
137
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
138 --sname_row SNAME_ROW
11
8cfabe8759ab Uploaded
george-weingart
parents: 10
diff changeset
139 column number containing the names of the samples (default 0, specify -1 if no names are present in the matrix)
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
140 --metadata_rows METADATA_ROWS
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
141 Row numbers to use as metadata[default None, meaning
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
142 no metadata
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
143 --skip_rows SKIP_ROWS
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
144 Row numbers to skip (0-indexed, comma separated) from
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
145 the input file[default None, meaning no rows skipped
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
146 --sperc SPERC Percentile of sample value distribution for sample
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
147 selection
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
148 --fperc FPERC Percentile of feature value distribution for sample
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
149 selection
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
150 --stop STOP Number of top samples to select (ordering based on
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
151 percentile specified by --sperc)
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
152 --ftop FTOP Number of top features to select (ordering based on
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
153 percentile specified by --fperc)
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
154 --def_na DEF_NA Set the default value for missing values [default None
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
155 which means no replacement]
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
156
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
157 Integration
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
158 ===========
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
159
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
160 A graphical representation of how **export2graphlan** can be integrated in the analysis pipeline:
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
161
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
162 .. image:: https://bitbucket.org/repo/oL6bEG/images/3364692296-graphlan_integration.png
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
163 :height: 672
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
164 :width: 800
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
165
1
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
166 Want to know more?
2c0d791fc950 Updated the help
george-weingart
parents: 0
diff changeset
167 ==================
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
168
2
dba11280df2c Updated help
george-weingart
parents: 1
diff changeset
169 If you want to know more about **export2graphlan** please have a look at the tutorial
0
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
170 </help>
cac6247cb1d3 graphlan_import
george-weingart
parents:
diff changeset
171 </tool>