view boxplot.xml @ 0:1e6d7de8bbad draft

Uploaded
author elixir-it
date Tue, 09 Jun 2020 15:57:22 +0000
parents
children
line wrap: on
line source

<tool id="boxplot" name="boxplot" version="1">
  <description> Boxplots are used to compare VINYL score distribution for single genes. Useful for the identification of genes that accumulate "pathogenic" mutations</description>
  <requirements>
    <requirement type="package" >r-base</requirement>
  </requirements>
  <command> <![CDATA[
	Rscript --vanilla $__tool_directory__/GeneP_pdf.R 
	
	####INPUT
	$fileR $fileC $cutoff

	###OUTPUT
	$ofile
 	
    ]]>
  </command>
  <inputs>
	<param format="tabular,csv" name="fileR" type="data" label="affected" help="VINYL Tabular output format file. Affected individuals "/>
	<param format="tabular,csv" name="fileC" type="data" label="healthy" help="VINYL Tabular output format file. Unaffected individuals"/>
        <param name="cutoff" value="8" min="1" type="float" label="score_cutoff" help="Cut-Off score used for the identification of potentially pathogenic variants"/>

  </inputs>
  <outputs>
    <data format="pdf" name="ofile" label="${tool.name} on ${on_string}: pdf "/>
  </outputs>
  <stdio>
     <regex match="error"
            source="stdout"
            level="fatal"
            description="Unknown error encountered" />
  </stdio>
  <tests>
    <test>
      <param name="fileR" value="R.csv" ftype="csv"/>
      <param name="fileC" value="T.csv" ftype="csv"/>
      <param name="cutoff" value="8" type="float" />
      <output name="ofile" file="boxplot_output_test.pdf" ftype="pdf" />
    </test>
  </tests>
  <help>
**What it does**
VINYL is a software designed to assist in variant prioritization in medium-large cohort of patients. The program computes an aggregate score, which is based on an extensive collection of publicly available annotations, in order to identify/prioritize variants that are likely to be pathogenic or have a clinical significance. In order to derive an optimal cut off score for the variants, VINYL uses a strategy based on "survival analysis", where the pathogenicity score distribution of the affected individuals is compared with a matched cohort of unaffected individuals.
To facilitate the usage of the software, VINYL is provided in the form of a public Galaxy instance, based on the Laniakea suite. To ensure the maximum level of security, VINYL uses Encrypted data volumes for the storage of the data.



**Important Usage Note**
This wrapper provides the module of VINYL that compares score distributions of single genes. Two input files need to be provided, both in tabular format. The first file must contain VINYL scores for affected individuals, the second, equivalent score for unaffected controls. Both files can be obtained by running VINYL on annovar-annotated vcf-files (see the manual and the VINYL wrapper for further explanations). 
See the "survival" and the "VINYL" utilities in the for the delineation of the  score cut-off value and the calculation of individual scores from a single vcf file.
A complete workflow that automates the exectuion of VINYL is avaiable at XXX


  </help>
  <citations>
  </citations>
</tool>