# HG changeset patch # User greg # Date 1480444712 18000 # Node ID 6655e5f6736b5194a741a9dbe083cead2a61c385 # Parent 32ade191a090ab29859026525fa1c643d019fa2f Uploaded diff -r 32ade191a090 -r 6655e5f6736b plant_tribes_assembly_post_processor.xml --- a/plant_tribes_assembly_post_processor.xml Tue Nov 29 09:09:12 2016 -0500 +++ b/plant_tribes_assembly_post_processor.xml Tue Nov 29 13:38:32 2016 -0500 @@ -1,5 +1,5 @@ - - + + pipeline plant_tribes_assembly_post_processor @@ -13,7 +13,7 @@ - + @@ -45,6 +45,7 @@ + @@ -62,10 +63,9 @@ - - + @@ -90,13 +90,46 @@ - + + + + + + + + + + + + + + + + + -Assemblyr Post Processor pipeline +This tool is one of the PlantTribes' collection of automated modular analysis pipelines that utilize objective classifications of +complete protein sequences from sequenced plant genomes to perform comparative evolutionary studies. It post-processes de novo assembly +transcripts into putative coding sequences and their corresponding amino acid translations. + +----- + +**Options** + + * **Prediction method for coding regions** - The prediction method for coding regions; one of ESTScan or TransDecoder. + * **Target gene family assembly?** - Select yes to target gene families to assemble. + * **List of orthogroup identifiers for target gene families to assemble** - History item with a list of orthogroup identifiers for target gene families to assemble. The next version of this tool will use a dynamic select list for this feature. + * **Orthogroups or gene families proteins scaffold** - PlantTribes scaffolds data. + * **Protein clustering method** - One of GFam, OrthoFinder or OrthoMCL. + * **Remove gappy sites in alignments** - Removes gappy sites in alignments (i.e. 0.1 removes sites with 90% gaps): [0.0 to 1.0] + * **De novo transcriptome assembly was performed with strand-specific library?** - Select yes if de novo transcriptome assembly was performed with strand-specific library. + * **Remove repeated sequences in predicted coding regions?** - Select yes to remove repeated sequences in predicted coding regions. + * **Minimum sequence length of predicted coding regions** - Set the minimum sequence length of predicted coding regions. +