Skip to content

eRN Software

The following is a list of software the HPC and eRN team maintains. Note that not all approved/maintained software may be listed here. This list will be updated occasionally, and we ask for your patience in this regard.

Software Name Description Version
ABySS ABySS is a de novo paired-end sequence assembler that is designed for short reads. The abyss bundle packages version 2.3.6. 2.3.6
antiSMASH antiSMASH is a docker container preloaded with Fermitools, the HEASARC FTOOLS, and python libraries. The module makes antiSMASH release 23.05.19.0 available. The list of preloaded software is as follows: 23.05.19.0
AquaCrop The aquacrop bundle is a python implementation of the crop-water model AquaCrop. The bundle packages version 2.2.3. 2.2.3
ARAGORN ARAGORN is used to detect tRNA genes and tmRNA genes in nucleotide sequences. The aragorn bundle packages version 1.2.41. 1.2.41
Artemis Artemis is a DNA sequence viewer and annotation tool that allows visualisation of sequence features and the results of analyses within the context of the sequence, and its six-frame translation. The artemis bundle packages version 18.2.0. 18.2.0
AUGUSTUS AUGUSTUS predicts genes in eukaryotic genomic sequences. The augustus bundle packages version 3.5.0. 3.5.0
Authoring The authoring bundle is a collection of Latex, Tex, PDF, DOCX, Markdown and text files editing and viewing software. The bundle includes the following software: unknown
BamTools BamTools provides a toolkit for handling BAM files. The bamtools bundle packages version 2.5.2. 2.5.2
Bandage Bandage is used to visualise de novo assembly graphs. The bandage bundle packages version 0.8.1. 0.8.1
Barrnap Barrnap predicts the location of ribosomal RNA genes in genomes. The barrnap bundle packages version 0.9. 0.9
BBTools BBTools is a suite of tools designed for the analysis of DNA and RNA sequence data. The bbtools bundle packages version 39.01. 39.01
BCALM BCALM constructs the compacted de Bruijn graph from sequencing data. The bcalm bundle packages version 2.2.3. 2.2.3
BCFtools BCFtools is a set of utilities that manipulate variant calls in the Variant Call Format (VCF) and its binary counterpart BCF. The bcftools bundle packages version 1.17. 1.17
BEAST and BEAST2 BEAST and BEAST 2 are cross-platform programs for Bayesian phylogenetic analysis of molecular sequences. The beast bundle packages version 1.10.4 of BEAST and version 2.6.3 of BEAST 2. 2.6.3
Bedtools The bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The bedtools bundle packages version 2.31.0. 2.31.0
BiG-SCAPE: Biosynthetic Gene Similarity Clustering and Prospecting Engine BiG-SCAPE constructs sequence similarity networks of Biosynthetic Gene Clusters and groups them into Gene Cluster Families. The bigscape bundle packages version 1.1.8. The PFAM database was downloaded, extracted, and processed with the following commands: bash wget http://mirror.ufs.ac.za/datasets/pfam/current_release/Pfam-A.hmm.gz gunzip Pfam-A.hmm.gz hmmpress Pfam-A.hmm 1.1.8
BLAST BLAST, Basic Local Alignment Search Tool, can be used for sequence similarity searching. The blast bundle packages NCBI Blast version 2.14.0. 2.14.0
Bowtie Bowtie and Bowtie 2 are tools for aligning sequencing reads to long reference sequences. The bowtie bundle has two variants one for version 1.3.1 and one for version 2.5.1. 2.5.1
BUSCO BUSCO can be used to assess genome assembly and annotation completeness. The busco bundle packages version 5.4.7. 5.4.7
BWA BWA, Burrow-Wheeler Aligner, is used for short-read alignment. The bwa bundle packages version 0.7.17. 0.7.17
Canu Canu is a fork of the Celera Assembler designed for high-noise single-molecule sequencing. The canu bundle packages version 2.2. 2.2
CASA CASA, the Common Astronomy Software Applications package, is used for data processing of radio telescopes. The version packaged is 6.5.5. 6.5.5
CD-HIT CD-HIT is used for clustering biological sequences to reduce sequence redundancy and improve the performance of other sequence analyses. The cdhit bundle packages version 4.8.1. 4.8.1
CheckM CheckM is a set of tools for assessing the quality of genomes recovered from isolates, single cells, or metagenomes. The checkmgenome bundle packages version 1.2.2. 1.2.2
CIAO - Chandra Interactive Analysis of Observations The ciao bundle contains the following versions of the listed software. 4.15.1
CLC Genomics Workbench CLC Genomics Workbench is a complete toolkit for genomics, transcriptomics, epigenomics, metagenomic. The bundle packages version 24.0. 23.0.5
24.0
Clustal Omega Clustal Omega is used to align multiple amino acid or nucleotide sequences. The clustalo bundle packages version 1.2.4. 1.2.4
Clustal W Clustal W is used to align multiple amino acid or nucleotide sequences. The clustalw bundle packages version 2.1. 2.1
CONCOCT CONCOCT is used for unsupervised binning of metagenomic contigs by using nucleotide composition. The concoct bundle packages version 1.1.0. 1.1.0
Anaconda The conda bundle is used to quickly create bundles that package software avaialable in the Anaconda repositories. The module makes variants using Rocky Linux and Fedora as operating systems available. Python 3.9 and 3.10 are supported. unknown
CoolProp CoolProp is a thermophysical property database and can be used for fluid dynamic computations.The bundle packages the following software: 6.4.3
Cufflinks Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples. The cufflinks bundle packages version 2.2.1. 2.2.1
NCBI Datasets NCBI Datasets cli allows you to find and download sequence, annotation, and metadata for genes and genomes. The datasetscli bundle packages NCBI Datasets cli version 15.13.1. 15.13.1
DIAMOND DIAMOND is a sequence aligner for protein and translated DNA searches. The diamond bundle packages version 2.1.6. 2.1.6
DiTing DiTing recognizes biogeochemical cycles from environmental omic data accurately and efficiently. The diting bundle packages version 0.9. 0.9
ea-utils Command-line tools for processing biological sequencing data, for example: Barcode demultiplexing, adapter trimming, etc. The eautils bundle packages version 1.1.2.537. 1.1.2.537
EukRep EukRep is used to classify of Eukaryotic and Prokaryotic sequences from metagenomic datasets. The eukrep bundle packages version 0.6.7. 0.6.7
fatsp fastp provides all-in-one preprocessing for FastQ files. The fastp bundle packages version 0.23.2. 0.23.2
FastQC FastQC is a quality control tool for high throughput sequence data. The fastqc bundle packages version 0.12.1. 0.12.1
FastTree FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. The fasttree bundle packages version 2.1.11. 2.1.11
Fedora Fedora is a distribution of Linux. The module makes variants of Fedora 37 with developer tools preinstalled and without available. 37
Fermibottle Fermibottle is a docker container preloaded with Fermitools, the HEASARC FTOOLS, and python libraries. The module makes Fermibottle release 23.03.10.0 available. The list of preloaded software is as follows: 2.2.0
FigTree FigTree is a graphical viewer of phylogenetic trees. The figtree bundle packages version 1.4.4. 1.4.4
Flye Flye is a de novo assembler for single-molecule sequencing reads. The flye bundle packages version 2.9.2. 2.9.2
FragGeneScan FragGeneScan finds fragmented genes in short reads. It can also be used to predict prokaryotic genes in incomplete assemblies or complete genomes. The fraggenescan bundle packages version 1.31. 1.31
freebayes freebayes is a Bayesian genetic variant detector. The freebayes bundle packages version 1.3.6. 1.3.6
Genome Analysis Toolkit (GATK) The Genome Analysis Toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. The gatk bundle packages version 4.4.0.0. 4.4.0.0
Gblocks Gblocks eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences. The gblocks bundle packages version 0.91b. 0.91b
Geneious Prime Geneious Prime is a comprehensive suite of molecular biology and sequence analysis tools. The bundle packages version 2023.2.1. 2023.2.1
Gephi Gephi is software that cab be used to visualise and explore all kinds of graphs and networks. The version packaged is 0.10.1. 0.10.1
GrADS GrADS, Grid Analysis and Display System, is used to access, manipulate and visualize geophysical data. The latest and only version available, on the eRN, is 2.2.1. 2.2.1
GRRG - Grid Related Research Group The grrg bundle is used for data and network analysis of electrical grids. The module makes variants with Python 3.9 and 3.10 available. unknown
Gubbins Gubbins is an algorithm that iteratively identifies loci containing elevated densities of base substitutions while concurrently constructing a phylogeny based on the putative point mutations outside of these regions. The gubbins bundle packages version 3.3. 3.3
Guppy Guppy is a data toolkit for Oxford Nanopore Technologies' sequencing platforms. The bundle packages version 6.5.7. 6.5.7
HDF5 Tools HDF5 Tools, a collection of tools to view and manipulate h5 files. The hdf5tools bundle packages the following software: 1.14.3
Hifiasm Hifiasm is a fast haplotype-resolved de novo assembler for PacBio HiFi reads. The hifiasm bundle packages version 0.19.6. 0.19.6
HISAT2 HISAT2 is an alignment program for mapping NGS reads against a reference genome. The hisat2 bundle packages version 2.2.1. 2.2.1
HMMER HMMER provides tools for making probabilistic models of protein and DNA sequence domain families. The hmmer bundle packages version 3.3.2. 3.3.2
Canu Canu is a fork of the Celera Assembler designed for high-noise single-molecule sequencing. The canu bundle packages version 2.2. 2.2
IDBA IDBA is the basic iterative de Bruijn graph assembler for second-generation sequencing reads. The idba bundle packages version 1.1.3. 1.1.3
IGV The Integrative Genomics Viewer (IGV) is a tool for the visual exploration of genomic data. The igv bundle packages version 2.13.2. 2.13.2
IQ-TREE IQ-TREE can be used for phylogenomic inference. The iqtree bundle packages version 2.2.2.3. 2.2.2.3
IVA IVA is a de novo assembler designed to assemble virus genomes that have no repeat sequences. The iva bundle packages version 1.0.8. 1.0.8
Julia Julia, is a programming language often used in computational science. 1.8.5
1.9.1
Kaiju Kaiju is used for the taxonomic classification of high-throughput sequencing reads. The kaiju bundle packages version 1.9.2. 1.9.2
Kraken 2 Kraken 2 is a taxonomic classification system using exact k-mer matches to achieve high accuracy and fast classification speeds. The kraken bundle packages the following software: 2.1.3
Krona Tools Krona Tools is a set of scripts to create Krona charts from several Bioinformatics tools as well as from text and XML files. The krona bundle packages version 2.8.1. 2.8.1
LoFreq LoFreq is a variant-caller for inferring SNVs and indels from next-generation sequencing data. The lofreq bundle packages version 2.1.5. 2.1.5
MAFFT MAFFT is used to align multiple amino acid or nucleotide sequences. The mafft bundle packages version 7.520. 7.520
MEGA MEGA is a program for conducting statistical analysis of molecular evolution and for constructing phylogenetic trees. The mega bundle packages version 11.0.13. 11.0.13
MEGAHIT MEGAHIT is an NGS assembler. The megahit bundle packages version 1.2.9. 1.2.9
MetaBAT MetaBAT is a statistical framework for reconstructing genomes from metagenomic data. The metabat bundle packages version 2.15. 2.15
Kraken 2 Kraken 2 is a taxonomic classification system using exact k-mer matches to achieve high accuracy and fast classification speeds. The kraken bundle packages the following software: 2.1.3
MetaPhlAn MetaPhlAn is a tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data with species level resolution. The metaphlan bundle packages version 4.0.6. 4.0.6
Metview Metview is an application designed to be a complete working environment for both the operational and research meteorologist. The metview bundle packages version 5.19.1. 5.19.1
Minimap2 Minimap2 is a sequence alignment program that aligns DNA or mRNA sequences against a large reference database. The minimap2 bundle packages version 2.26. 2.26
ModelTest-NG ModelTest-NG can be used to select the best-fit model of evolution for DNA and protein alignments. The modeltestng bundle packages version 0.1.7. 0.1.7
MrBayes MrBayes is used for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. The mrbayes bundle packages version 3.2.7. 3.2.7
MultiQC MultiQC aggregates results from bioinformatics analyses across many samples into a single report. The multiqc bundle packages version 1.14. 1.14
Muscle Muscle is used to align multiple amino acid or nucleotide sequences. The muscle bundle packages version 5.1. 5.1
Nextclade Nextclade is used for viral genome alignment, mutation calling, clade assignment, quality checks and phylogenetic placement. The nextclade bundle packages version 2.14.0. 2.14.0
nf-core nf-core is a community effort to collect a curated set of analysis pipelines built using Nextflow. The bundle packages the dependencies required by nf-core, see the table below for the versions of the included dependencies. 1.3.23
GNU Octave GNU Octave has a powerful mathematics-oriented syntax with built-in 2D/3D plotting and visualization tools It provides a convenient command line interface for solving linear and nonlinear problems numerically, and for performing other numerical experiments using a language that is mostly compatible with Matlab. The octave bundle packages version 8.3.0. 8.3.0
OrthoFinder OrthoFinder is a platform for comparative genomics. The orthofinder bundle packages version 2.5.5. 2.5.5
PEAR - Paired-End reAd mergeR PEAR is an ultrafast, memory-efficient and highly accurate pair-end read merger. The pear bundle packages version 0.9.6. 0.9.6
PhyML PhyML is a phylogeny software based on the maximum-likelihood principle. The phyml bundle packages version 3.3.20211231. 3.3.20211231
Picard Picard is a set of tools for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF. The picard bundle packages version 3.1.0. 3.1.0
PLINK PLINK is a whole genome association analysis toolset. The plink bundle has three variants one for version 1.07, one for version 1.90b6.21, and one for version 2.00a3.7. 2.00a3.7
Anaconda The conda bundle is used to quickly create bundles that package software available in the Anaconda repositories. The module makes variants using Rocky Linux and Fedora as operating systems available. Python 3.9 and 3.10 are supported. unknown
PRANK PRANK is a probabilistic multiple alignment program for DNA, codon, and amino-acid sequences. PRANK version 170427 is packaged. 170427
Prokka Prokka is a software tool to annotate bacterial, archaeal and viral genomes. The prokka bundle packages version 1.14.6. 1.14.6
Purge_Dups Purge_Dups can be used to purge haplotigs and overlaps in an assembly based on read depth. The purgedups bundle packages version 1.2.6. 1.2.6
PyMLST - A Python Mlst Local Search Tool PyMLST is used to perform core or whole genome Multilocus Sequence Typing. The bundle packages the dependencies required by PyMLST, see the table below for the versions of the included dependencies. 2.1.3
PyMOL PyMOL is a molecular visualization system. The pymol bundle packages version 2.5.0. 2.5.0
GRRG - Grid Related Research Group The grrg bundle is used for data and network analyis of electrical grids. The module makes variants with Python 3.9 and 3.10 available. unknown
QIIME2 The QIIME 2 bundle is used for next-generation microbiome analysis. 2023.2.0
2023.5.1
QualiMap QualiMap facilitates the quality control of alignment sequencing data and its derivatives like feature counts. unknown
RAxML RAxML, Randomized Axelerated Maximum Likelihood, can be used for maximum likelihood-based phylogenetic analyses. The raxml bundle packages version 8.2.12. 8.2.12
Remmina Remmina s a Remote Desktop Client that supports SSH, VNC and RDP. The bundle packages version 1.4.31. 1.4.31
Fermibottle Fermibottle is a docker container preloaded with Fermitools, the HEASARC FTOOLS, and python libraries. The module makes Fermibottle release 23.05.19.0 available. The list of preloaded software is as follows: 23.05.19.0
Roary Roary is a high speed stand alone pan genome pipeline. The roary bundle packages version 3.13.0. 3.13.0
Rocky Linux Rocky Linux is a distribution of Linux based on RHEL. The module makes variants of Rocky Linux 8 and 9, with and without developer tools preinstalled available. 9
Rstudio Rstudio is an IDE for R a language used for statistical programming and data science. The bundle also includes R with some often used packages preinstalled. The following versions are installed: 2023.03.0+386
2023.06.2+561
RTG Tools RTG Tools includes several useful utilities for dealing with VCF files and sequence data. The rtgtools bundle packages version 3.12.1. 3.12.1
S3A - Scalable and Accurate Annotated Assembly S3A is a gene-targeted assembler based on the Overlap-Layout Consensus paradigm to reconstruct coding regions from domain annotated metagenomic sequence reads. unknown
SageMath The sage, "System for Algebra and Geometry Experimentation", bundle can be used for algebra, combinatorics, graph theory, numerical analysis, number theory, calculus and statistics. The module makes variants with Python 3.9, with SageMath 9.7, and 3.10, with SageMath 9.8, available. 9.8
MAFFT Salmon is used to produce a highly-accurate, transcript-level quantification estimates from RNA-seq data. The salmon bundle packages version 1.10.1. 1.10.1
Samtools The samtools bundle can be used to manipulate NGS data. Samtools version 1.17 is packaged. 1.17
SeqKit SeqKit is a toolkit for FASTA/Q file manipulation. The seqkit bundle packages version 2.5.0. 2.5.0
sickle sickle is a windowed adaptive trimming tool for FASTQ files using quality. The sickle bundle packages version 1.33. 1.33
3D Slicer 3D Slicer is used for visualization, processing, segmentation, registration, and analysis of medical, biomedical, and other 3D images and meshes. The version packaged is 5.6.0. 5.6.0
SPAdes SPAdes, Saint-Petersburg genome assembler, is an assembly toolkit containing various assembly pipelines. The spades bundle packages version 3.15.5. 3.15.5
SRA Toolkit The SRA Toolkit and SDK is a collection of tools and libraries for using data in the INSDC Sequence Read Archives. The sratools bundle packages version 3.0.5. 3.0.5
Stress-NG The stressng bundle is used for benchmarking and stress testing hardware. unknown
StringTie StringTie can be used for transcript assembly and quantification for RNA-Seq. The stringtie bundle packages version 2.2.1. 2.2.1
TaxonKit TaxonKit is a practical and efficient NCBI taxonomy toolkit. The taxonkit bundle packages version 0.15.0. 0.15.0
TempEst TempEst is used to investigate the temporal signal and 'clocklikeness' of molecular phylogenies. The version packaged is 1.5.3. 1.5.3
TopHat TopHat is a fast splice junction mapper for RNA-Seq reads. The tophat bundle packages version 2.1.1. 2.1.1
Tracer Tracer is used to visualise and analyse the MCMC trace files generated through Bayesian phylogenetic inference. The tracer bundle packages version 1.7.2. 1.7.2
treePL treePL is a phylogenetic penalized likelihood program. treePL version 2.6.3 is packaged. 2.6.3
Trim Galore Trim Galore is a wrapper around Cutadapt and FastQC to consistently apply adapter and quality trimming to FastQ files. The trimgalore bundle packages version 0.6.10. 0.6.10
Trimmomatic Trimmomatic is a flexible trimmer for Illumina Sequence Data. The trimmomatic bundle packages version 0.39. 0.39
Trinity Trinity assembles transcript sequences from Illumina RNA-Seq data. The trinity bundle packages version 2.15. 2.15
Remmina Remmina s a Remote Desktop Client that supports SSH, VNC and RDP. The bundle packages version 1.4.31. 1.4.31
1.5.3
Unicycler Unicycler is an assembly pipeline for bacterial genomes. The unicycler bundle packages version 0.5.0. 0.5.0
VCFtools VCFtools provides easily accessible methods for working with complex genetic variation data in the form of VCF files. The vcftools bundle packages version 0.1.16. 0.1.16
Velvet Velvet is a short read de novo assembler using de Bruijn graphs. The velvet bundle packages version 1.2.10. 1.2.10
GrADS GrADS, Grid Analysis and Display System, is used to access, manipulate and visualize geophysical data. The latest and only version available, on the ERN, is 2.2.1. 2.2.1
TempEst TempEst is used to investigate the temporal signal and 'clocklikeness' of molecular phylogenies. The version packaged is 1.5.3. 1.5.3
Wine Wine is a compatibility layer capable of running Windows applications on Linux. The bundle packages version 8.0.1. 8.0.1