Mercurial > repos > erinija > dnp_smooth
view dnp_smooth.xml @ 0:7e69f62cb516 draft default tip
"planemo upload commit 1a32efb8343938e8d49190003f251c78b5a58225-dirty"
author | erinija |
---|---|
date | Fri, 01 May 2020 12:10:21 +0000 |
parents | |
children |
line wrap: on
line source
<tool id="dnp_smooth" name="Smooth" version="0.1.0"> <requirements> <requirement type="package" version="1.0">dnp-fourier</requirement> </requirements> <command detect_errors="exit_code" interpreter="sh"><![CDATA[ dnp-smooth.sh "$input1" "$input2" "$input3" "$output1" ]]></command> <inputs> <param type="data" name="input1" format="tabular" label="Dinucleotide frequency profiles"/> <param name="input2" type="integer" value="3" label="Smoothing window size" /> <param name="input3" type="integer" value="4" label="Number of points to trim from both sides" /> </inputs> <outputs> <data name="output1" format="tabular" /> </outputs> <tests> <test> <param name="input1" value="smoothing-input.tabular"/> <param name="input2" value="3"/> <param name="input3" value="4"/> <output name="output1" file="smoothed-output.tabular"/> </test> </tests> <help><![CDATA[ Description:: Applies smoothing on dinucleotide profiles. Smoothing reduces noise and enhances representation of the dinucleotide frequency profiles. Smoothing is performed by moving average with chosen window size (optimal winsize=3). Smoothing script is based on a shell wrapper of the call to dnp-fourier tool which computes a periodogram of a series given as a numerical column parameters as follows: dnp-fourier -f input -o output -n {normalization 0|1|2} -l length_of_smoothing_window -t {type_of_output 1|2|3} The parameters control a type of normalization and output: Normalization 0 base normalization subtracts mean 1 linear normalization removes linear trand 2 quadratic normalization removes quadratic trend Output type 1 normalization outputs normalized original series 2 smoothing outputs smoothed original series 3 Fourier transform outputs periofogram Example:: Input tabular: pos AA AC AG AT CA CC CG CT GA GC GG GT TA TC TG TT WW SS RR YY -73 0.05664 0.0657 0.06966 0.03644 0.08026 0.09484 0.0362 0.09086 0.07084 0.04032 0.07318 0.06466 0.03862 0.06838 0.05722 0.05602 0.18772 0.24454 0.27032 0.3101 -72 0.0668 0.06476 0.0753 0.04282 0.07022 0.08034 0.03534 0.081 0.07222 0.03512 0.06774 0.0598 0.03934 0.07496 0.06628 0.06784 0.2168 0.21854 0.28206 0.30414 -71 0.063 0.0621 0.07668 0.04316 0.06926 0.07264 0.03316 0.07992 0.07546 0.03498 0.07306 0.06406 0.04182 0.07374 0.06874 0.06812 0.2161 0.21384 0.2882 0.29442 -70 0.0624 0.0643 0.07214 0.04424 0.0642 0.06998 0.03472 0.07718 0.0723 0.03982 0.07472 0.06818 0.04282 0.07674 0.06864 0.06754 0.217 0.21924 0.28156 0.29144 -69 0.0622 0.06456 0.074 0.0426 0.0661 0.07114 0.03414 0.08016 0.0703 0.03786 0.07118 0.06754 0.0421 0.07712 0.06988 0.06904 0.21594 0.21432 0.27768 0.29746 ... Output is an original input table but smoothed by moving average with given window size, tabular. ]]></help> <citations> <citation type="bibtex"> @article{pranckeviciene2020nucleosome, title={Nucleosome positioning sequence patterns as packing or regulatory. S1 Appendix}, author={Pranckeviciene, Erinija and Hosid, Sergey and Liang, Nathan and Ioshikhes, Ilya}, journal={PLoS computational biology}, volume={16}, number={1}, pages={e1007365}, year={2020}, publisher={Public Library of Science}, url = {https://doi.org/10.1371/journal.pcbi.1007365} }</citation> </citations> </tool>