comparison assembly_post_processor.xml @ 29:443c8e1fd8aa draft

Uploaded
author greg
date Thu, 16 Mar 2017 14:04:20 -0400
parents eda72de47171
children 0d680d17278c
comparison
equal deleted inserted replaced
28:eda72de47171 29:443c8e1fd8aa
134 134
135 **Required options** 135 **Required options**
136 136
137 * **Transcriptome assembly fasta file** - either de novo or reference-guided transcriptome assembly fasta file selected from your history. 137 * **Transcriptome assembly fasta file** - either de novo or reference-guided transcriptome assembly fasta file selected from your history.
138 * **Coding regions prediction method** - method for finding coding recions within transcripts. Available methods are TransDecoder[3] and ESTScan[4]. 138 * **Coding regions prediction method** - method for finding coding recions within transcripts. Available methods are TransDecoder[3] and ESTScan[4].
139 * **Scores matrices** - Scores matrices, based on a related species, are required when ESTScan is ued to find coding regions. Details of how to create species-specific scores matrices can be found on the ESTScan website (http://estscan.sourceforge.net). Matrices of some organisms are also available to download. 139 * **Scores matrices** - scores matrices, based on a related species, are required when ESTScan is ued to find coding regions. Details of how to create species-specific scores matrices can be found on the ESTScan website (http://estscan.sourceforge.net). Matrices of some organisms are also available to download.
140 140
141 **Other options** 141 **Other options**
142 142
143 * **Perform targeted gene assembly?** - Selecting 'Yes' enables local assembly of one or more targeted gene families in a specific scaffold. Scaffolds are defined in PlantTribes as clusters of paralogous/orthologous sequences from a specified set of proteomes[5-7]. 143 * **Perform targeted gene assembly?** - selecting 'Yes' enables local assembly of one or more targeted gene families in a specific scaffold. Scaffolds are defined in PlantTribes as clusters of paralogous/orthologous sequences from a specified set of proteomes[5-7].
144 * **Targeted gene families** - A selected history item containing a list of targeted orthogroup identifiers corresponding to the gene family classification from a specified scaffold. Gene identifiers can be obtained from the function annotation table ("Orthogroup ID" field of *.summary file) of scaffold data installed into Galaxy via the PlantTribes Scaffolds Download Data Manager tool. 144 * **Targeted gene families** - select a history item containing a list of targeted orthogroup identifiers corresponding to the gene family classification from a specified scaffold. Gene identifiers can be obtained from the function annotation table ("Orthogroup ID" field of .summary file) of scaffold data installed into Galaxy via the PlantTribes Scaffolds Download Data Manager tool.
145 * **Gene family scaffold** - one of the PlantTribes gene family scaffolds (installed into Galaxy by the PlantTribes Scaffolds Download Data Manager tool) whose orthogroups(s) are targeted for the localized assembly. 145 * **Gene family scaffold** - one of the PlantTribes gene family scaffolds (installed into Galaxy by the PlantTribes Scaffolds Download Data Manager tool) whose orthogroups(s) are targeted for the localized assembly.
146 * **Protein clustering method** -gene family scaffold protein clustering method. Each PlantTribes scaffold data has up to three sets of clusters - GFam[8] (clusters of consensus domain architecture), OrthoFinder[9] (broadly defined clusters) or OrthoMCL[10] (narrowly defined clusters). You can also install your own data scaffold created using a different clustering method as long as it conforms to the PlantTribes scaffold data format. 146 * **Protein clustering method** - gene family scaffold protein clustering method. Each PlantTribes scaffold data has up to three sets of clusters - GFam[8] (clusters of consensus domain architecture), OrthoFinder[9] (broadly defined clusters) or OrthoMCL[10] (narrowly defined clusters). You can also install your own data scaffold created using a different clustering method as long as it conforms to the PlantTribes scaffold data format.
147 * **Trim alignments** - trim gene family multiple sequence alignments that include scaffold backbone genes and locally assembled transcripts to remove non-conserved regions (gappy sites)[11]. The trimmed alignments are used in assigning scores to locally assembled transcripts to determine how well they compare to the backbones gene models. The default setting of 0.1 removes sites tha thave gaps in 90% of the sequences in the multiple sequence alignment. This option is restricted to the range 0.0 - 1.0. 147 * **Trim alignments** - trim gene family multiple sequence alignments that include scaffold backbone genes and locally assembled transcripts to remove non-conserved regions (gappy sites)[11]. The trimmed alignments are used in assigning scores to locally assembled transcripts to determine how well they compare to the backbones gene models. The default setting of 0.1 removes sites tha thave gaps in 90% of the sequences in the multiple sequence alignment. This option is restricted to the range 0.0 - 1.0.
148 * **Strand-specific assembly?** - select 'Yes' if transcriptome library sequences were strand-specific. If 'Yes" is selected, transcripts from the minority strand (antisense) are removed. 148 * **Strand-specific assembly?** - select 'Yes' if transcriptome library sequences were strand-specific. If 'Yes" is selected, transcripts from the minority strand (antisense) are removed.
149 * **Remove duplicate sequences?** - select 'Yes' to remove duplicated and exact subsequences[12]. 149 * **Remove duplicate sequences?** - select 'Yes' to remove duplicated and exact subsequences[12].
150 * **Minimum sequence length** - set the minimum sequence length of predicted coding regions. The default is 200 bp. 150 * **Minimum sequence length** - set the minimum sequence length of predicted coding regions. The default is 200 bp.
151 151
152 </help> 152 </help>
153 <citations> 153 <citations>
154 <citation type="bibtex"> 154 <citation type="bibtex">
155 @unpublished{None, 155 @article{None,
156 key = {1}, 156 journal = {None},
157 author = {Eric Wafula}, 157 author = {1. Wafula EK},
158 title = {Manuscript in preparation}, 158 title = {Manuscript in preparation},
159 year = {None}, 159 year = {None},
160 url = {https://github.com/dePamphilis/PlantTribes} 160 url = {https://github.com/dePamphilis/PlantTribes},}
161 } 161 </citation>
162 </citation> 162 <citation type="bibtex">
163 <citation type="bibtex"> 163 @article{Honaas2016,
164 @published{PloS one, 164 journal = {PloS one},
165 key = {2}, 165 author = {2. Honaas LA, Wafula EK, Wickett NJ, Der JP, Zhang Y, Edger PP, Altman NS, Pires JC, Leebens-Mack JH},
166 author = {Honaas, L.A., Wafula, E.K., Wickett, N.J., Der, J.P., Zhang, Y., Edger, P.P., Altman, N.S., Pires, J.C. and Leebens-Mack, J.H.},
167 title = {Selecting superior de novo transcriptome assemblies: lessons learned by leveraging the best plant genome}, 166 title = {Selecting superior de novo transcriptome assemblies: lessons learned by leveraging the best plant genome},
168 year = {2016}, 167 year = {2016},
169 volume = {11}, 168 volume = {11},
170 number = {1}, 169 number = {1},
171 pages = {e0146062} 170 pages = {e0146062},}
172 } 171 </citation>
173 </citation> 172 <citation type="bibtex">
174 <citation type="bibtex"> 173 @article{Haas2013,
175 @published{Nature protocols, 174 journal = {Nature Protocols},
176 key = {3}, 175 author = {3. Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M, MacManes MD},
177 author = {Haas, B.J., Papanicolaou, A., Yassour, M., Grabherr, M., Blood, P.D., Bowden, J., Couger, M.B., Eccles, D., Li, B., Lieber, M. and MacManes, M.D.},
178 title = {De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis}, 176 title = {De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis},
179 year = {2013}, 177 year = {2013},
180 volume = {8}, 178 volume = {8},
181 number = {8}, 179 number = {8},
182 pages = {1494-1512} 180 pages = {1494-1512},}
183 } 181 </citation>
184 </citation> 182 <citation type="bibtex">
185 <citation type="bibtex"> 183 @article{Iseli1999,
186 @published{ISMB, 184 journal = {ISMB},
187 key = {4}, 185 author = {4. Iseli C, Jongeneel CV, Bucher P},
188 author = {Iseli C, Jongeneel CV, Bucher P.}, 186 title = {ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences},
189 title = {ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences.},
190 year = {1999}, 187 year = {1999},
191 volume = {99}, 188 volume = {99},
192 pages = {138-148} 189 pages = {138-148},
193 url = {http://estscan.sourceforge.net} 190 url = {http://estscan.sourceforge.net},}
194 } 191 </citation>
195 </citation> 192 <citation type="bibtex">
196 <citation type="bibtex"> 193 @article{Huang1999,
197 @published{Genome research, 194 journal = {Genome Research},
198 key = {5}, 195 author = {5. Huang X, Madan A},
199 author = {Huang X, Madan A},
200 title = {CAP3: A DNA sequence assembly program}, 196 title = {CAP3: A DNA sequence assembly program},
201 year = {1999}, 197 year = {1999},
202 volume = {9}, 198 volume = {9},
203 number = {9}, 199 number = {9},
204 pages = {868-877} 200 pages = {868-877},
205 url = {http://seq.cs.iastate.edu/cap3.html} 201 url = {http://seq.cs.iastate.edu/cap3.html},}
206 } 202 </citation>
207 </citation> 203 <citation type="bibtex">
208 <citation type="bibtex"> 204 @article{Eddy2009,
209 @published{Genome Inform, 205 journal = {Genome Inform},
210 key = {6}, 206 author = {6. Eddy SR},
211 author = {Eddy, S.R.},
212 title = {A new generation of homology search tools based on probabilistic inference}, 207 title = {A new generation of homology search tools based on probabilistic inference},
213 year = {2009}, 208 year = {2009},
214 volume = {23}, 209 volume = {23},
215 number = {1}, 210 number = {1},
216 pages = {205-211} 211 pages = {205-211},}
217 } 212 </citation>
218 </citation> 213 <citation type="bibtex">
219 <citation type="bibtex"> 214 @article{Sasidharan2012,
220 @published{Nucleic acids research, 215 journal = {Nucleic Acids Research},
221 key = {7}, 216 author = {7. Sasidharan R, Nepusz T, Swarbreck D, Huala E, Paccanaro A},
222 author = {Sasidharan, R., Nepusz, T., Swarbreck, D., Huala, E. and Paccanaro, A.},
223 title = {GFam: a platform for automatic annotation of gene families}, 217 title = {GFam: a platform for automatic annotation of gene families},
224 year = {2012}, 218 year = {2012},
225 pages = {gks631} 219 pages = {gks631},}
226 } 220 </citation>
227 </citation> 221 <citation type="bibtex">
228 <citation type="bibtex"> 222 @article{Li2003,
229 @published{Genome research, 223 journal = {Genome Research}
230 key = {8}, 224 author = {8. Li L, Stoeckert CJ, Roos DS},
231 author = {Li, L., Stoeckert, C.J. and Roos, D.S.},
232 title = {OrthoMCL: identification of ortholog groups for eukaryotic genomes}, 225 title = {OrthoMCL: identification of ortholog groups for eukaryotic genomes},
233 year = {2003}, 226 year = {2003},
234 volume = {13}, 227 volume = {13},
235 number = {9}, 228 number = {9},
236 pages = {2178-2189} 229 pages = {2178-2189},}
237 } 230 </citation>
238 </citation> 231 <citation type="bibtex">
239 <citation type="bibtex"> 232 @article{Emms2015,
240 @published{Genome biology, 233 journal = {Genome Biology}
241 key = {9}, 234 author = {9. Emms DM, Kelly S},
242 author = {Emms, D.M. and Kelly, S.},
243 title = {OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy}, 235 title = {OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy},
244 year = {2015}, 236 year = {2015},
245 volume = {16}, 237 volume = {16},
246 number = {1}, 238 number = {1},
247 pages = {157} 239 pages = {157},}
248 } 240 </citation>
249 </citation> 241 <citation type="bibtex">
250 <citation type="bibtex"> 242 @article{Capella-Gutierrez2009,
251 @published{Bioinformatics, 243 journal = {Bioinformatics,},
252 key = {10}, 244 author = {10. Capella-Gutierrez S, Silla-Martínez JM, Gabaldón T},
253 author = {Capella-Gutiérrez, S., Silla-Martínez, J.M. and Gabaldón, T.},
254 title = {trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses}, 245 title = {trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses},
255 year = {2009}, 246 year = {2009},
256 volume = {25}, 247 volume = {25},
257 number = {15}, 248 number = {15},
258 pages = {1972-1973} 249 pages = {1972-1973},}
259 } 250 </citation>
260 </citation> 251 <citation type="bibtex">
261 <citation type="bibtex"> 252 @article{Gremme2013,
262 @published{IEEE/ACM Transactions on Computational Biology and Bioinformatics, 253 journal = {IEEE/ACM Transactions on Computational Biology and Bioinformatics},
263 key = {11}, 254 author = {11. Gremme G, Steinbiss S, Kurtz S},
264 author = {Gremme, G., Steinbiss, S. and Kurtz, S.},
265 title = {GenomeTools: a comprehensive software library for efficient processing of structured genome annotations}, 255 title = {GenomeTools: a comprehensive software library for efficient processing of structured genome annotations},
266 year = {2013}, 256 year = {2013},
267 volume = {10}, 257 volume = {10},
268 number = {3}, 258 number = {3},
269 pages = {645-656} 259 pages = {645-656},}
270 } 260 </citation>
271 </citation> 261 <citation type="bibtex">
272 <citation type="bibtex"> 262 @article{None,
273 @unpublished{None, 263 journal = {GitHub repository},
274 key = {12}, 264 author = {12. None},
275 author = {None},
276 title = {HMMER 3.1+ hmmscan search sequence(s) against a profile database}, 265 title = {HMMER 3.1+ hmmscan search sequence(s) against a profile database},
277 year = {2013}, 266 year = {2013},
278 url = {http://hmmer.org/} 267 url = {http://hmmer.org},}
279 }
280 </citation> 268 </citation>
281 </citations> 269 </citations>
282 </tool> 270 </tool>
271