Installation
The pipelines only requires a working installation of Conda for Python 3, git and Snakemake. This guide has been tested on a clean, minimal installation of Ubuntu 19.10 and Debian Buster and should work on most Linux distributions.
MacOS and the Windows 10 Linux subsystem haven’t been tested.
Check your current setup with the following commands.
conda --version # => 5.10.0
snakemake --version # => 4.8.2
Table of contents
System Requirements
They strongly depend on the executed steps of the pipeline. You should be fine with around 200 GB of memory, 32 vCPUs and 1.5 TB of disk space for a few medium sized metagenomes.
Certain steps also can be run on rented AWS EC2 instances for example.
Miniconda
To install the Conda distribution Miniconda follow the instructions on this site.
Commands:
wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
sh Miniconda3-latest-Linux-x86_64.sh
You will need to accept the licencse and select an installation path.
If you know you accepted the licencse, you can run a non-interactive installation like this:
wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
sh Miniconda3-latest-Linux-x86_64.sh -b -p $HOME/miniconda3
# Activate base env
eval "$($HOME/miniconda3/bin/conda shell.bash hook)"
# Init for Shell functions & automatic base activation
conda init
Conda environment activation can take a while, slowing the start of a new shell/login. You can disable automatic activation with:
# From the installer, just as a reminder
conda config --set auto_activate_base false
# Afterwards you can still just run
conda activate base
Snakemake
Snakemake can be installed with the required conda distribution. Install it from the Bioconda and conda-forge channel.
# From the installer, just as a reminder
conda install -c bioconda -c conda-forge snakemake
You could also create an environment like this
conda create -n snake -c bioconda -c conda-forge snakemake
# will be placed in `~/miniconda3/envs/snake`
conda activate snake
or like this
conda create -p ./snake -c bioconda -c conda-forge snakemake
# will be placed in `snake`
conda activate ./snake
Knutt
Clone the git repository for the pipeline you want to use:
git clone https://github.com/KnuttPipeline/KnuttReads2Bins.git
git clone https://github.com/KnuttPipeline/KnuttBinAnno.git
git clone https://github.com/KnuttPipeline/KnuttBinPhylo.git
Now you can edit the configuration files (config.yml
).