Mercurial > repos > lparsons > htseq_count
comparison htseq-count.xml @ 6:08a11d1eaec6
Updated HTSEQ package to version 0.5.4p1, attempted to fix galaxy install where lib64 directory does not exist
| author | lparsons |
|---|---|
| date | Mon, 11 Mar 2013 12:42:42 -0400 |
| parents | 0a835934d792 |
| children | 1615d0b5b809 |
comparison
equal
deleted
inserted
replaced
| 5:0a835934d792 | 6:08a11d1eaec6 |
|---|---|
| 1 <tool id="htseq_count" name="htseq-count" version="0.3"> | 1 <tool id="htseq_count" name="htseq-count" version="0.3.1"> |
| 2 <description> - Count aligned reads in a BAM file that overlap features in a GFF file</description> | 2 <description> - Count aligned reads in a BAM file that overlap features in a GFF file</description> |
| 3 <version_command>htseq-count -h | grep version | sed 's/^\(.*\)*\(version .*\)\./\2/'</version_command> | 3 <version_command>htseq-count -h | grep version | sed 's/^\(.*\)*\(version .*\)\./\2/'</version_command> |
| 4 <requirements> | 4 <requirements> |
| 5 <requirement type="package" version="1.6.2">numpy</requirement> | 5 <requirement type="package" version="1.6.2">numpy</requirement> |
| 6 <requirement type="package" version="0.5.3p9">htseq</requirement> | 6 <requirement type="package" version="0.5.4p1">htseq</requirement> |
| 7 <requirement type="package" version="0.1.18">samtools</requirement> | 7 <requirement type="package" version="0.1.18">samtools</requirement> |
| 8 <requirement type="package" version="1.56.0">picard</requirement> | 8 <requirement type="package" version="1.56.0">picard</requirement> |
| 9 </requirements> | 9 </requirements> |
| 10 <command> | 10 <command> |
| 11 ##set up input files | 11 ##set up input files |
| 176 each gene is considered here as the union of all its exons. One may also consider | 176 each gene is considered here as the union of all its exons. One may also consider |
| 177 each exon as a feature, e.g., in order to check for alternative splicing. For | 177 each exon as a feature, e.g., in order to check for alternative splicing. For |
| 178 comparative ChIP-Seq, the features might be binding regions from a pre-determined | 178 comparative ChIP-Seq, the features might be binding regions from a pre-determined |
| 179 list. | 179 list. |
| 180 | 180 |
| 181 **Paired-end Data MUST be sorted by QUERY NAME first** | |
| 182 | |
| 183 This tool requires that paired-end data be sorted by query name, which is NOT the default for Galaxy. Using the Picard Paired Read Mate Fixer with Query name sort FIRST is required for paired end data. | |
| 184 | |
| 185 | 181 |
| 186 Overlap Modes | 182 Overlap Modes |
| 187 ------------- | 183 ------------- |
| 188 | 184 |
| 189 Special care must be taken to decide how to deal with reads that overlap more than one feature. | 185 Special care must be taken to decide how to deal with reads that overlap more than one feature. |
| 193 The following figure illustrates the effect of these three modes: | 189 The following figure illustrates the effect of these three modes: |
| 194 | 190 |
| 195 .. image:: /static/images/count_modes.png | 191 .. image:: /static/images/count_modes.png |
| 196 :width: 500 | 192 :width: 500 |
| 197 | 193 |
| 194 | |
| 198 Strandedness | 195 Strandedness |
| 199 ------------ | 196 ------------ |
| 200 | 197 |
| 201 **Important**: The default for strandedness is yes. If your RNA-Seq data has not been made with a strand-specific protocol, this causes half of the reads to be lost. Hence, make sure to set the option Stranded to 'No' unless you have strand-specific data! | 198 **Important**: The default for strandedness is yes. If your RNA-Seq data has not been made with a strand-specific protocol, this causes half of the reads to be lost. Hence, make sure to set the option Stranded to 'No' unless you have strand-specific data! |
| 199 | |
| 202 | 200 |
| 203 Output | 201 Output |
| 204 ------ | 202 ------ |
| 205 | 203 |
| 206 The script outputs a table with counts for each feature, followed by the special counters, which count reads that were not counted for any feature for various reasons, namely | 204 The script outputs a table with counts for each feature, followed by the special counters, which count reads that were not counted for any feature for various reasons, namely |
