Mercurial > repos > greg > ideas
changeset 88:462ce06410c6 draft
Uploaded
| author | greg | 
|---|---|
| date | Thu, 24 Aug 2017 10:12:45 -0400 | 
| parents | 665b0feae3de | 
| children | d6ab97fb8aca | 
| files | ideas.xml | 
| diffstat | 1 files changed, 18 insertions(+), 18 deletions(-) [+] | 
line wrap: on
 line diff
--- a/ideas.xml Thu Aug 24 09:57:20 2017 -0400 +++ b/ideas.xml Thu Aug 24 10:12:45 2017 -0400 @@ -148,7 +148,7 @@ <option value="manual">manually setting them for each selected input</option> </param> <when value="extract"> - <param name="input" type="data" format="bigwig,bam" multiple="True" label="BAM or BigWig file"> + <param name="input" type="data" format="bigwig,bam" multiple="True" label="BAM or BigWig files"> <validator type="empty_field"/> <validator type="unspecified_build"/> </param> @@ -239,26 +239,26 @@ ----- -**Required options** +**Options** - * **Cell type, Epigenetic factor and Input** - specify any number of inputs with currently supported formats, either bam or bigwig. The cell name + factor name must be unique for each input. For example, if you have replicate data you may want to specify the cell name as "cell_rep1", "cell_rep2", etc and the factor name as "factor_rep1", "factor_rep2", etc. - - * **Cell type name** - cell type name - * **Epigenetic factor name** - epigenetic factor name - * **BAM or BigWig file** - BAM or BigWig file - - * **Set genomic windows on which to process the data** - if "No" is selected, IDEAS will run whole genome segmentation. If "Yes" is selected, IDEAS will segment genomes in the unit of the windows defined by the bed file. This file can be in BED3, BED4 or BED5 format, but only the first three columns (chr posst posed) will be used. +* **Select input type** - select a data matrix file produced in a previous run or one or more Bam or Bigwig datasets. +* **Set cell type and epigenetic factor names by** - cell type and epigenetic factor names can be set manually or by extracting them from the names of the selected input datasets. The latter case requires all selected datasets to have names that contain a "-" character. - * **Window size in base pairs** - Window size in base pairs (if "No" is selected) - * **Restrict processing to specified chromosomes** - If "Yes" is selected, processing will be restricted to specified chromosomes - - * **Chromosomes** - processing will be restricted to specified chromosomes (if "Yes" is selected) + * **BAM or BigWig files** - select one or more Bam or Bigwig files from yhour history, making sure that the name of every selected input include a "-" character. + * **Cell type, Epigenetic factor and Input** - manually select any number of inputs, setting the cell type and epigenetic factor name for each. The combination of "cell type name" and "epigenetic factor name" must be unique for each input. For example, if you have replicate data you may want to specify the cell name as "cell_rep1", "cell_rep2", etc and the factor name as "factor_rep1", "factor_rep2", etc. + + * **Cell type name** - cell type name + * **Epigenetic factor name** - epigenetic factor name + * **BAM or BigWig file** - BAM or BigWig file - * **Chromosome** - specified chromosome +* **Select Bed file that defines genomic windows on which to process the data** - if "No" is selected, IDEAS will run whole genome segmentation. If "Yes" is selected, IDEAS will segment genomes in the unit of the windows defined by the bed file. This file can be in BED3, BED4 or BED5 format, but only the first three columns (chr posst posed) will be used. - * **Bed file specifying the genomic windows** - bed file specifying the genomic windows (if "Yes" is selected) + * **Window size in base pairs** - Window size in base pairs (if "No" is selected) + * **Restrict processing to specified chromosomes** - If "Yes" is selected, processing will be restricted to specified chromosomes -**Other options** + * **Chromosomes** - processing will be restricted to specified chromosomes (if "Yes" is selected) + + * **Bed file specifying the genomic windows** - bed file specifying the genomic windows (if "Yes" is selected) * **Output chromosomes in separate files** - select "Yes" to produce separate files for each chromosome, allowing you to run IDEAS on different chromosomes separately. * **Calculate the average signal in each genomic window using** - use the bigWigAverageOverBed utility from the UCSC genome browser to calculate average signal (number of reads per bp) in each genomic window. @@ -271,8 +271,8 @@ * **Initial number of states** - while IDEAS may infer 30 states or more by starting from just 20 states, it may not do so if it is trapped in a local mode. We recommend setting the initial number of states slightly larger than the number of states you expect. * **Maximum number of position classes to be inferred** - Set this value only if: - * you do not want position classes (e.g., for testing purposes), in this case set the value to 1 - * IDEAS runs slow because there are too many position classes, generally less than 100 position classes will run fine + * you do not want position classes (e.g., for testing purposes), in this case set the value to 1 + * IDEAS runs slow because there are too many position classes, generally less than 100 position classes will run fine * **Maximum number of cell type clusters allowed** - Set this value only for testing. If you set the value to 1, then all cell types will be clustered in one group. * **Prior concentration** - specify the prior concentration parameter; default is A=sqrt(number of cell types). A smaller concentration parameter (e.g., 1 or less) will emphasize more on position specificity and a larger concentration parameter (e.g., 10 * number of cell types) will emphasize more on global homogeneity.
