Mercurial > repos > david-hoover > garbage1
comparison variant_select.xml @ 33:642c7f54431a draft default tip
Uploaded
author | david-hoover |
---|---|
date | Thu, 20 Sep 2012 13:23:44 -0400 |
parents | 29507d31c174 |
children |
comparison
equal
deleted
inserted
replaced
32:29507d31c174 | 33:642c7f54431a |
---|---|
1 <tool id="gatk2_variant_select" name="Select Variants" version="0.0.3"> | 1 <tool id="gatk2_variant_select" name="Select Variants" version="0.0.4"> |
2 <description>from VCF files</description> | 2 <description>from VCF files</description> |
3 <requirements> | 3 <requirements> |
4 <requirement type="package" version="2.1">gatk</requirement> | 4 <requirement type="package" version="2.1">gatk</requirement> |
5 </requirements> | 5 </requirements> |
6 <command interpreter="python">gatk2_wrapper.py | 6 <command interpreter="python">gatk2_wrapper.py |
511 </test> | 511 </test> |
512 </tests> | 512 </tests> |
513 <help> | 513 <help> |
514 **What it does** | 514 **What it does** |
515 | 515 |
516 Often, a VCF containing many samples and/or variants will need to be subset in order to facilitate certain analyses (e.g. comparing and contrasting cases vs. controls; extracting variant or non-variant loci that meet certain requirements, displaying just a few samples in a browser like IGV, etc.). SelectVariants can be used for this purpose. Given a single VCF file, one or more samples can be extracted from the file (based on a complete sample name or a pattern match). Variants can be further selected by specifying criteria for inclusion, i.e. "DP > 1000" (depth of coverage greater than 1000x), "AF < 0.25" (sites with allele frequency less than 0.25). These JEXL expressions are documented in the Using JEXL expressions section (http://www.broadinstitute.org/gsa/wiki/index.php/Using_JEXL_expressions). One can optionally include concordance or discordance tracks for use in selecting overlapping variants. | 516 Often, a VCF containing many samples and/or variants will need to be subset in order to facilitate certain analyses (e.g. comparing and contrasting cases vs. controls; extracting variant or non-variant loci that meet certain requirements, displaying just a few samples in a browser like IGV, etc.). SelectVariants can be used for this purpose. Given a single VCF file, one or more samples can be extracted from the file (based on a complete sample name or a pattern match). Variants can be further selected by specifying criteria for inclusion, i.e. "DP > 1000" (depth of coverage greater than 1000x), "AF < 0.25" (sites with allele frequency less than 0.25). These JEXL expressions are documented in the `Using JEXL expressions section <http://gatkforums.broadinstitute.org/discussion/1255/what-are-jexl-expressions-and-how-can-i-use-them-with-the-gatk>`_. One can optionally include concordance or discordance tracks for use in selecting overlapping variants. |
517 | 517 |
518 For more information on using the SelectVariants module, see this `tool specific page <http://www.broadinstitute.org/gatk/gatkdocs/org_broadinstitute_sting_gatk_walkers_variantutils_SelectVariants.html>`_. | 518 For more information on using the SelectVariants module, see this `tool specific page <http://www.broadinstitute.org/gatk/gatkdocs/org_broadinstitute_sting_gatk_walkers_variantutils_SelectVariants.html>`_. |
519 | 519 |
520 To learn about best practices for variant detection using GATK, see this `overview <http://www.broadinstitute.org/gatk/guide/topic?name=best-practices>`_. | 520 To learn about best practices for variant detection using GATK, see this `overview <http://www.broadinstitute.org/gatk/guide/topic?name=best-practices>`_. |
521 | 521 |