| R |
The R Project for Statistical Computing |
| BioPerl |
The Bioperl Project is an international association of users & developers of open source Perl tools for bioinformatics, genomics and life science |
| Biopython |
The Biopython Project is an international association of developers of freely available Python tools for computational molecular biology |
| NCBI BLAST+ |
Basic Local Alignment Search Tool |
| DIAMOND |
Fast and sensitive protein alignment using DIAMOND |
| HMMER |
Accelerated Profile HMM Searches |
| CD-HIT |
CD-HIT: a fast program for clustering and comparing large sets of protein or nucleotide sequences |
| VSEARCH |
VSEARCH: a versatile open source tool for metagenomics |
| MUSCLE |
MUSCLE: multiple sequence alignment with high accuracy and high throughput |
| MAFFT |
MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform |
| JAligner |
Open-source Java implementation of the Smith-Waterman algorithms for biological pairwise sequence alignment |
| BWA |
Fast and accurate short read alignment with Burrows–Wheeler transform |
| HISAT2 |
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype |
| Bowtie2 |
Fast gapped-read alignment with Bowtie 2 |
| STAR |
STAR: ultrafast universal RNA-seq aligner |
| Salmon |
Alignment and mapping methodology influence transcript abundance estimation |
| kallisto |
Near-optimal probabilistic RNA-seq quantification |
| BBMap |
BBMerge – Accurate paired shotgun read merging via overlap |
| FASTX |
Collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing |
| Trimmomatic |
Trimmomatic: a flexible trimmer for Illumina sequence data |
| SeqKit |
SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation |
| seqtk |
Toolkit for processing sequences in FASTA/Q formats |
| fastp |
fastp: an ultra-fast all-in-one FASTQ preprocessor |
| HTStream |
A toolset for high throughput sequence analysis using a streaming approach facilitated by Linux pipes |
| fqtrim |
trimming & filtering of next-gen reads |
| TreeTime |
TreeTime: Maximum-likelihood phylodynamic analysis |
| FastTree |
FastTree 2–approximately maximum-likelihood trees for large alignments |
| RAxML |
RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies |
| RAxML-NG |
RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference |
| PhyML |
Estimating maximum likelihood phylogenies with PhyML |
| Pplacer |
pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree |
| SAMtools |
The Sequence Alignment/Map format and SAMtools |
| BCFtools |
https://www.ncbi.nlm.nih.gov/pubmed/28205675 |
| Bamtools |
BamTools provides both a programmer’s API and an end-user’s toolkit for handling BAM files |
| VCFtools |
The Variant Call Format and VCFtools |
| BEDTools |
BEDTools: a flexible suite of utilities for comparing genomic features |
| deepTools |
deepTools2: a next generation web server for deep-sequencing data analysis |
| BEDOPS |
BEDOPS: high-performance genomic feature operations |
| Sambamba |
Sambamba: fast processing of NGS alignment formats |
| SPAdes |
SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing |
| ABySS |
ABySS: a parallel assembler for short read sequence data |
| Velvet |
Velvet: algorithms for de novo short read assembly using de Bruijn graphs |
| MEGAHIT |
MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph |
| MetaVelvet |
MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads |
| Prodigal |
Prodigal: prokaryotic gene recognition and translation initiation site identification |
| Infernal |
inference of RNA alignments |
| antiSMASH |
antiSMASH: Rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters |
| DeepBGC |
A deep learning genome-mining strategy for biosynthetic gene cluster prediction |
| GECCO |
Accurate de novo identification of biosynthetic gene clusters with GECCO |
| Miniconda |
Package, dependency and environment management for any language |
| CD-HIT |
Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences |
| Nextflow |
Nextflow enables reproducible computational workflows |
| GATK |
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data |
| Centrifuge |
Centrifuge: rapid and sensitive classification of metagenomic sequences |
| Pavian |
Pavian: interactive analysis of metagenomics data for microbiome studies and pathogen identification |
| Kraken2 |
Metagenome analysis using the Kraken software suite |
| Bracken |
Bracken: estimating species abundance in metagenomics data |
| MetaPhlAn |
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4 |
| HUMAnN |
Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3 |
| mothur |
Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities |
| UCHIME |
UCHIME improves sensitivity and speed of chimera detection |