Biology

Alphabetical list of available ULHPC software belonging to the 'bio' category. To load a software of this category, use: module load bio/<software>[/<version>]

Software Versions Swsets Architectures Clusters Description
ABySS 2.2.5 2020b broadwell, epyc, skylake aion, iris Assembly By Short Sequences - a de novo, parallel, paired-end sequence assembler
BEDTools 2.29.2, 2.30.0 2019b, 2020b broadwell, skylake, epyc iris, aion BEDTools: a powerful toolset for genome arithmetic. The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. The utilities are largely based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM.
BLAST+ 2.11.0, 2.9.0 2020b, 2019b broadwell, epyc, skylake aion, iris Basic Local Alignment Search Tool, or BLAST, is an algorithm for comparing primary biological sequence information, such as the amino-acid sequences of different proteins or the nucleotides of DNA sequences.
BWA 0.7.17 2019b, 2020b broadwell, skylake, epyc iris, aion Burrows-Wheeler Aligner (BWA) is an efficient program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome.
BamTools 2.5.1 2019b, 2020b broadwell, skylake, epyc iris, aion BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files.
BioPerl 1.7.2, 1.7.8 2019b, 2020b broadwell, skylake, epyc iris, aion Bioperl is the product of a community effort to produce Perl code which is useful in biology. Examples include Sequence objects, Alignment objects and database searching objects.
Bowtie2 2.3.5.1, 2.4.2 2019b, 2020b broadwell, skylake, epyc iris, aion Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes. Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes.
FastQC 0.11.9 2019b, 2020b broadwell, skylake, epyc iris, aion FastQC is a quality control application for high throughput sequence data. It reads in sequence data in a variety of formats and can either provide an interactive application to review the results of several different QC checks, or create an HTML based report which can be integrated into a pipeline.
GROMACS 2019.4, 2019.6, 2020, 2021, 2021.2 2019b, 2020b broadwell, skylake, gpu, epyc iris, aion GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles. This is a CPU only build, containing both MPI and threadMPI builds for both single and double precision. It also contains the gmxapi extension for the single precision MPI build.
HTSlib 1.10.2, 1.12 2019b, 2020b broadwell, skylake, epyc iris, aion A C library for reading/writing high-throughput sequencing data. This package includes the utilities bgzip and tabix
Jellyfish 2.3.0 2019b broadwell, skylake iris Jellyfish is a tool for fast, memory-efficient counting of k-mers in DNA.
SAMtools 1.10, 1.12 2019b, 2020b broadwell, skylake, epyc iris, aion SAM Tools provide various utilities for manipulating alignments in the SAM format, including sorting, merging, indexing and generating alignments in a per-position format.
Salmon 1.1.0 2019b broadwell, skylake iris Salmon is a wicked-fast program to produce a highly-accurate, transcript-level quantification estimates from RNA-seq data.
TopHat 2.1.2 2019b, 2020b broadwell, skylake, epyc iris, aion TopHat is a fast splice junction mapper for RNA-Seq reads.
Trinity 2.10.0 2019b broadwell, skylake iris Trinity represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-Seq data. Trinity combines three independent software modules: Inchworm, Chrysalis, and Butterfly, applied sequentially to process large volumes of RNA-Seq reads.
kallisto 0.46.1 2019b broadwell, skylake iris kallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads.

Last update: March 4, 2024