# HG changeset patch # User galaxy-australia # Date 1649052530 0 # Node ID 3b8d4d04e4378aff06216992affe23686611a998 # Parent 867447554d1d4e57ca39d92a543532ff7d22403b Deleted selected files diff -r 867447554d1d -r 3b8d4d04e437 README.md --- a/README.md Mon Apr 04 05:54:33 2022 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,12 +0,0 @@ -# Cactus - -This suite contains a wrapper for the [Cactus](github.com/comparativeGenomicsToolkit/cactus) whole-genome aligner and another wrapper for exporting the data it generates. - -## Requirements - -The developers provide an official Docker container. -However, the Docker container won't run unless `/etc/passwd` is mounted ([see this issue](https://github.com/ComparativeGenomicsToolkit/cactus/issues/677)). -Because of this, it is recommended to run Cactus using Singularity. - -Cactus uses a lot of RAM. We have tested it on Galaxy using 24 GB of RAM for a -progressive alignment of three chromosome-level, 150 megabase genomes. \ No newline at end of file diff -r 867447554d1d -r 3b8d4d04e437 help.md --- a/help.md Mon Apr 04 05:54:33 2022 +0000 +++ /dev/null Thu Jan 01 00:00:00 1970 +0000 @@ -1,58 +0,0 @@ - -**What it does** - -[Cactus](https://github.com/ComparativeGenomicsToolkit/cactus) is a reference-free whole-genome multiple alignment program. -It can be used to progressively align a large number of genomes. - -**Usage** - -***Between-species mode*** - -If you are aligning genomes from **multiple species**, you need to provide a guide tree in Newick format. Cactus uses the guide tree to progressively align genomes, meaning that it doesn't need to align all possible pairs of genomes. - -A Newick-formatted tree for human, chimp and gorilla genomes looks like this: - -``` -(((human:0.006,chimp:0.006667):0.0022,gorilla:0.008825):0.0096,orang:0.01831); -``` - -The numbers are the branch lengths. - - -***Beta: Within-species mode*** - -You can also run Cactus in [pangenome mode](https://github.com/ComparativeGenomicsToolkit/cactus/blob/master/doc/pangenome.md) to align genomes of multiple individuals from the **same species**. -In this mode you will not be able to provide a guide tree. -Cactus will use [minigraph](https://github.com/lh3/minigraph) to generate a graph of the input genomes and then use the graph to order the alignments. -To use pangenome mode, select 'Within-species' in the 'Alignment mode' dropdown. - -**Input** - -The developers recommend soft-masking your genomes with RepeatMasker before running Cactus. -RepeatMasker is available on Galaxy. - -If you're using Between-species mode, you need to provide labels for the fasta files that match the leaves on the guide tree. -In the example above, you would use the label 'human' for the human fasta file. - -**Output** - -The main output of Cactus is in [HAL format](https://github.com/ComparativeGenomicsToolkit/cactus#using-the-output). -You can use the [Cactus: export](root?tool_id=cactus_export) tool to convert the Cactus output to a VG or Multiple Alignment Format (MAF) file. - - ---- - -**What it does** - -Converts the HAL output from [Cactus whole-genome multiple sequence aligner](root?tool_id=cactus_cactus) into other formats. - -**Input** - -A [HAL file](https://github.com/ComparativeGenomicsToolkit/cactus#using-the-output) from [Cactus](root?tool_id=cactus_cactus). - -**Output** - -Currently, you can export a pangenome in VG format, for use with [vg](https://github.com/vgteam/vg), and/or Multiple Alignment Format (MAF). -Note that MAF is **reference-based**, so it will contain alignments to a single reference from your list of inputs (i.e. not a graph). -By default, the first input genome will be used as the reference. -You can override this with the 'reference genome' option.