ahab

Add limitless scale to your pipelines.

Work like you’ve got 8 arms. πŸ¦‘

  • With our quick deployment templates, you'll be up and running with Kubernetes and other complimentary Azure services in no time. All you need is an Azure Subscription and we'll handle the rest.

  • ahab will kick off analysis jobs automatically when new data becomes available. Plus, with our Power BI report templates, you can easily see what's running/completed/failed.

  • ahab can pull data from it and write results back while playing nicely with your data lake's security.

Big on the Whale. 🐳

ahab works by packaging up your existing pipeline logic as a Docker container. It’s easy to use and helps to ensure scientific reproducibility.

Supercharged Kubernetes with the power of Azure.

Kubernetes is an open-source container orchestration tool that organizations use to run and manage clusters of machines that run facets of an application.

It turns out that this platform also works well with computationally-intensive bioinformatics pipelines. Using a cloud-based version of Kubernetes in Azure, we can orchestrate thousands of processes with ease.

A Modern Approach to High Performance Computing.

No longer do we have to fuss with clunky HPC environments with schedulers and finite resources. ahab in the Azure cloud breaks through the previously frustrating limitations.

Scalability

Need a single machine for 10 minutes or 1,000 machines for 10 hours? No problem. Dynamic scaling of the Kubernetes cluster will meet your computational needs - no matter how big or small.

Automation

Integrating your orchestration tools with ahab's API allows for automated job creation.

This is perfect for standardized tasks like processing FASTQ files through an RNA-seq or WES pipeline as the samples trickle in.

Visibility

Visibility is key to keeping track of how things are going. The ahab platform is designed to handle container logs and to keep an eye on a job's progress.

You'll be able to integrate with your reporting tool (or use our Power BI report template) to see job statuses and to find your results.

BYODocker Containers.

Data used in bioinformatics is often large, messy, and in esoteric file formats. Luckily for us, cloud-based data lakes are the perfect choice for storing and organizing data at-scale. Plus, having your data available in your cloud environment unlocks its use with other cloud tools for scalable bioinformatics pipelines, machine learning, reporting, and more!

It’s like it was built for Bioinformatics?

Getting bioinformatics pipelines up and running can sometimes feel like taping random things together and hoping they don't break. Docker has really revolutionized scientific reproducibility by containerizing all the software dependencies together so that your workflow just... works.

Building on the strengths of Docker and Kubernetes, ahab provides a platform that excels at enterprise-scale bioinformatics.

NextFlow Summit 2024 Video