comparison news.txt @ 0:061c9e133d1e draft

planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/stacks2 commit 98327d2948ae1ccb5aef5db9ab88605fd74a0de7-dirty
author matthias
date Thu, 29 Nov 2018 11:43:50 -0500
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:061c9e133d1e
1 Stacks 2.2 - Aug 22, 2018
2 --------------------------
3 Feature: Added the --bestrad flag to process_radtags. When used it will look for reads that need transposed
4 before they are processed.
5 Feature: gstacks: New option --max-debrujin-reads to control the construction of the de Bruijn graph; replaces
6 --min-kmer-freq which is now deprecated
7 Bugfix: Added run number to output FASTQ headers in process_radtags to make sure read IDs are always unique.
8
9 Stacks 2.1 - June 25, 2018
10 --------------------------
11 Bugfix: Fixed a performance regression in sstacks. Recent changes in sstacks made it more likely to
12 invoke the gapped algorithm to match to the catalog. In some cases, matches to the catalog
13 would be marked as ambiguous alignments and dropped from the next stages of analysis due to
14 differences in CIGAR strings from the gapped alignment.
15 Feature: ustacks Changed --high_cov_thres default value from 2.0 to 3.0.
16 Feature: gstacks: Changed --min-kmer-freq default value from 0.05 to 0.01.
17 Feature: Added further checks on zlib calls.
18
19 Stacks 2.0 - Apr 23, 2018
20 -------------------------
21 Feature: modified cstacks and ustacks gapped alignment algorithms to always align the two stacks/alleles
22 with the most k-mers in common, removed the previous minimum k-mer limit.
23 Feature: modified cstacks so that when a sample locus matches two or more catalog loci, those catalog
24 loci are combined, or rolled-up, reducing undermerged loci that generate excess homozygote calls.
25 Feature: modified tsv2bam so that when two loci from the same sample match the same catalog locus, those
26 loci are combined.
27 Bugfix: corrected BbvCI restriction enzyme to add the missing negative strand sequence.
28 Bugfix: corrected the catalog writing routines for unzipped output files to include missing column.
29 Bugfix: populations: Fixed the filtering of monomorphic loci.
30 Bugfix: populations: Now preserving sample ordering in all outputs.
31 Bugfix: Fixed ICPC compilation.
32
33 2.0b - May 1, 2018
34 Feature: gstacks: Removed the assertion that the first basepair of each locus should
35 be part of a cutsite (now a warning).
36 Feature: gstacks: The reported effective coverage is now a more realistic weighted mean.
37 Bugfix: populations: Fixed STRUCTURE output being corrupted for some unordereds population maps.
38
39 Stacks 2.0 Beta 10 - Apr 10, 2018
40 ---------------------------------
41 Feature: Improved gapped alignment for secondary reads in ustacks.
42 Feature: Improved populations performance.
43 Feature: Added enzymes Cac8I, MslI.
44 Feature: Made population maps more tolerant to spurious extra spaces and lines.
45 Feature: populations: VCF output: changed the format of the catalog locus field and made the column 1-based.
46 Feature: gstacks: increased haplotyping rates by adding a filtering of spurious SNPs step.
47 Bugfix: Fixed populations dramatric slow-down on datasets with more than several hundred samples.
48 Bugfix: Restored the NS/locus distribution in populations's distributions log.
49 Bugfix: Fixed the populations --radpainter export.
50 Bugfix: stacks-dist-extract: Fixed OSX compatibility.
51 Bugfix: Fixed breaking bug in populations --in-vcf mode filtering statistics.
52
53 Stacks 2.0 Beta 9 - Mar 12, 2018
54 --------------------------------
55 Feature: Cleaned up tags/snps/alleles/matches files. We removed the batch ID from ustacks and cstacks
56 output, and the deprecated log likelihood fields from ustacks and cstacks. We also removed
57 the chromosome/bp/strand fields as they are no longer used in these files.
58 Feature: Renamed gstacks output files that represent the new components of the catalog:
59 gstacks.fa.gz => catalog.fa.gz; gstacks.vcf.gz => catalog.calls
60 Feature: Removed read length restrictions from ustacks/cstacks/sstacks core, reads/loci can vary in
61 length throughout the pipeline.
62 Feature: Reimplemented PLINK export format for the populations program.
63 Bugfix: Updated to HTSLib 1.7; changed to a custom build system that will work with the Stacks build
64 system.
65 Bugfix: Made gapped alignments mandatory in ustacks, cstacks, and sstacks. Added check for frameshift
66 at 3' end of the read -- if found, a match is deferred to the gapped aligner.
67
68 Stacks 2.0 Beta 8 - Feb 03, 2018
69 --------------------------------
70 Feature: populations: Now calculated deviation from Hardy-Weinberg equilibrium at the SNP level
71 (using an exact test), and at the haplotype level (using Guo+Thompson's MCMC algorithm).
72 Feature: populations: Added an export type for FineRADStructure.
73 Feature: populations: Added the GQ/GL fields in the VCF SNPs output.
74 Feature: gstacks: Made the default behavior regarding paired-end reads more logical (in
75 reference-based mode --paired has been replaced with --unpaired).
76 Feature: gstacks: Added details about samples and coverages to the log outputs.
77 Feautre: Added enzymes NspI, BbvCI, fixed BfuCI.
78 Bugfix: corrected a major performance bottleneck in populations when smoothing population statistics
79 across the genome.
80 Bugfix: populations: The VCF output now preserves the input sample order.
81 Bugfix: gstacks: Fixed the handling of a rare special case in the PCR duplicates code.
82 Bugfix: gstacks: Fixed 100% being added to all per-thread timings.
83
84 Stacks 2.0 Beta 7 - Dec 29, 2017
85 --------------------------------
86 Feature: gstacks: Added an option to remove PCR duplicates based on insert
87 size (--rm-pcr-duplicates, plus the related --rm-unpaired-reads).
88 Feature: populations: Added a haplotype Genepop export.
89 Feature: populations: improved the help; changed the output names for SNP
90 files to 'populations.snps.EXT'; added option --no_hap_exports.
91 Feature: gstacks and populations: Clarified the logs; moved distributions
92 to a separate '.xlog' file and added script stacks-xlog-extract.
93 Feature: gstacks: Tweaked the help/interface; especially, replaced --spacer
94 with --suffix (for BAM directory input).
95 Feature: Added enzymes BfuI and HinP1.
96 Feature: Added option --inline_null to clone_filter.
97 Bugfix: gstacks: Fixed a typo preventing the paired reads from being merged.
98 Bugfix: populations: Fixed a segfault that occurred with some large datasets.
99 Bugfix: Made VCF outputs more standard compliant.
100 Bugfix: populations: Repaired --fasta_samples and --fasta_samples_raw.
101 Bugfix: populations: Fixed population aborting at the end of the run
102 when an export option was specified multiple times.
103 Bugfix: gstacks: Adjusted progression report for catalog asymmetry.
104 Bugfix: Fixed installation of stacks-integrate-alignments on MacOS.
105
106 Stacks 2.0 Beta 6 - Dec 02, 2017
107 --------------------------------
108 Feature: Implmented the VCF haplotypes output.
109 Bugfix: Corrected asset failure in populations when exporting data for genepop or structure output.
110
111 Stacks 2.0 Beta 5 - Nov 27, 2017
112 --------------------------------
113 Feature: Reimplemented structure, phylip, and phylip_var exports.
114 Bugfix: Tightened up the overlap algorithm to require 80% of overlapping sequence to be
115 aligned and of the aligned sequence, 80% must be identities.
116 Bugfix: Fixed segfault in gstacks when compiled with CLANG on OS X.
117 Bugfix: gstacks: Fixed how misphasings are reported.
118
119 Stacks 2.0 Beta 4 - Nov 07, 2017
120 --------------------------------
121 Bugfix: Continued improving overlap algorithm to join SE and PE contigs.
122 Bugfix: Improved build system to handle new timing functions in gstacks.
123
124 Stacks 2.0 Beta 3 - Nov 01, 2017
125 --------------------------------
126 Feature: Added output to populations describing mean PE contig size and mean number of
127 genotyped sites per locus, which reflects the current filtering paramters.
128 Feature: Improved the output of gstacks and populations.
129 Feature: Added script `stacks-integrate-alignments`.
130 Bugfix: made further improvements to the single-end/paired-end locus overlapping algorithm.
131 Bugfix: fixed all depths being null in populations' VCF output.
132 Bugfix: Numerically tweaked the marukilow model to remove a limit case.
133
134 Stacks 2.0 Beta 2 - Oct 19, 2017
135 --------------------------------
136 Feature: gstacks: Made it possible to read from multiple BAM files at the same time; modified the
137 interface accordingly.
138 Feature: gstacks: Parallelized the reference-based mode.
139 Feature: gstacks: Added various statistics & improvements to the log output.
140 Feature: gstacks: Improved how the forward & paired-end reads are merged (in denovo mode; no more trimming).
141 Feature: populations: Added code to calculate the overlap between RAD loci when a reference is available.
142 Feature: populations: Added VCF ouput (--vcf).
143 Feature: Updated the denovo_map.pl and ref_map.pl wrappers, samples must now be specified using --samples and --popmap.
144 Bugfix: Fixed three memory leaks in populations; improved reference-aligned batch logic.
145 Bugfix: Improved overlapping code in gstacks to merge more single and paired-end contigs together.
146 Bugfix: Now compiles on Apple OS X.
147 Bugfix: Fixed a bug that skewed the fixed-site (no-SNP) likelihood in the marukilow model.
148
149 Stacks 2.0 Beta 1 - Oct 09, 2017
150 --------------------------------
151 Feature: Paired-end sequencing data can be utilized fully. In particular, when the shearing-based
152 protocol is used, the software will assemble a local contig from the paired reads across
153 the population, possibly overlap it with the forward-reads region, then align all reads to the
154 assembled contig. This new approach also fully supports double-digest protocols.
155 Feature: Haplotype calling and diploidy-violation dectection now rely on a novel, more powerful algorithm.
156 Feature: SNP and genotype-calling now uses the diploid models of Maruki and Lynch (2017).
157 Feature: The rxstacks program has been replaced with the gstacks program, and there is no need to re-run
158 some of the earlier steps of the pipeline anymore.
159 Feature: The memory footprint of the populations program has been considerably reduced and can be scaled
160 for any size data set.
161 Feature: The reference-based pipeline has been simplified, and now only comprises two steps: gstacks and populations.
162 Feature: Added --null_inline mode to clone_filter (and process_radtags) for previously unseen type
163 of oligo combination.
164
165 Stacks 1.48 - Nov 20, 2017
166 ---------------------------
167 Feature: Added HinP1I restriction enzyme.
168 Feature: Added --null_inline mode to clone_filter (and process_radtags) for previously unseen type
169 of oligo combination.
170
171 Stacks 1.47 - Sept 06, 2017
172 ---------------------------
173 Feature: Improved populations's fasta output options (especially,
174 added a option to export locus consensus sequences).
175 Feature: denovo_map.pl and red_map.pl now stop if a component
176 of the pipeline fails.
177 Feature: Improved the output of denovo_map.pl and ref_map.pl.
178 Bugfix: Added a format check in Fasta/GzFasta to avoid a potential
179 segfault when working on FastQ files.
180 Bugfix: Fixed a bug in count_fixed_catalog_snps.py that could cause
181 overwrites when working with uncompressed files.