comparison assembly_post_processor.xml @ 31:0fad708a9693 draft

Uploaded
author greg
date Thu, 16 Mar 2017 14:17:22 -0400
parents 0d680d17278c
children 9987e210180a
comparison
equal deleted inserted replaced
30:0d680d17278c 31:0fad708a9693
146 * **Protein clustering method** - gene family scaffold protein clustering method. Each PlantTribes scaffold data has up to three sets of clusters - GFam[8] (clusters of consensus domain architecture), OrthoFinder[9] (broadly defined clusters) or OrthoMCL[10] (narrowly defined clusters). You can also install your own data scaffold created using a different clustering method as long as it conforms to the PlantTribes scaffold data format. 146 * **Protein clustering method** - gene family scaffold protein clustering method. Each PlantTribes scaffold data has up to three sets of clusters - GFam[8] (clusters of consensus domain architecture), OrthoFinder[9] (broadly defined clusters) or OrthoMCL[10] (narrowly defined clusters). You can also install your own data scaffold created using a different clustering method as long as it conforms to the PlantTribes scaffold data format.
147 * **Trim alignments** - trim gene family multiple sequence alignments that include scaffold backbone genes and locally assembled transcripts to remove non-conserved regions (gappy sites)[11]. The trimmed alignments are used in assigning scores to locally assembled transcripts to determine how well they compare to the backbones gene models. The default setting of 0.1 removes sites tha thave gaps in 90% of the sequences in the multiple sequence alignment. This option is restricted to the range 0.0 - 1.0. 147 * **Trim alignments** - trim gene family multiple sequence alignments that include scaffold backbone genes and locally assembled transcripts to remove non-conserved regions (gappy sites)[11]. The trimmed alignments are used in assigning scores to locally assembled transcripts to determine how well they compare to the backbones gene models. The default setting of 0.1 removes sites tha thave gaps in 90% of the sequences in the multiple sequence alignment. This option is restricted to the range 0.0 - 1.0.
148 * **Strand-specific assembly?** - select 'Yes' if transcriptome library sequences were strand-specific. If 'Yes" is selected, transcripts from the minority strand (antisense) are removed. 148 * **Strand-specific assembly?** - select 'Yes' if transcriptome library sequences were strand-specific. If 'Yes" is selected, transcripts from the minority strand (antisense) are removed.
149 * **Remove duplicate sequences?** - select 'Yes' to remove duplicated and exact subsequences[12]. 149 * **Remove duplicate sequences?** - select 'Yes' to remove duplicated and exact subsequences[12].
150 * **Minimum sequence length** - set the minimum sequence length of predicted coding regions. The default is 200 bp. 150 * **Minimum sequence length** - set the minimum sequence length of predicted coding regions. The default is 200 bp.
151 151
152 </help> 152 </help>
153 <citations> 153 <citations>
154 <citation type="bibtex"> 154 <citation type="bibtex">
155 @article{None, 155 @article{None,
156 journal = {None}, 156 journal = {None},
276 year = {2013}, 276 year = {2013},
277 url = {http://hmmer.org},} 277 url = {http://hmmer.org},}
278 </citation> 278 </citation>
279 </citations> 279 </citations>
280 </tool> 280 </tool>
281