asimov: Built for Complex Scientific Workflows

Purpose-built for computational research with multiple interdependent codes, long-running analyses, and publication-grade reproducibility requirements.

Quick Feature Comparison

How asimov stacks up against general-purpose workflow tools

Feature asimov Nextflow Snakemake Galaxy Cromwell
Primary Language Python Groovy/Java Python Web UI / YAML WDL
Workflow Definition Python API + YAML Nextflow DSL Snakefile (Python) Graphical or YAML WDL
Target Users Researchers / Developers Bioinformaticians / DevOps Bioinformaticians / Data Scientists Bench Scientists WDL Community / Genomics
Execution Backends HTCondor, Local, Docker Nextflow Tower, Cloud native SLURM, PBS, Local, Cloud Local, Galaxy Server Google Cloud, AWS, Azure
Job Submission Direct + Managed Native Cloud Cluster-aware Web UI Cloud-focused
Multi-language Support ✓ (Python-first) ✓ (Groovy-first) ✓ (Python-first) ✓ (Tools) ✓ (WDL tasks)
Containerization Docker, Singularity Docker, Singularity, Podman Docker, Conda Tool containers Docker
Dependency Resolution Manual + Automatic Automatic Automatic Manual Manual
Monitoring UI Web dashboard (in development) Nextflow Tower (managed) Limited Web UI Cromwell Server
Extensibility Plugin system Custom modules Custom rules Tool integration Custom tasks

Built for Complex Scientific Research

Whether in physics, chemistry, biology, climate science, or computational engineering—asimov addresses the needs of research that demands multi-code orchestration and reproducibility

🔬 Multi-Code Orchestration

Seamlessly integrate Python, C++, Fortran, and shell pipelines in a single workflow. Define complex dependencies between heterogeneous codes without preprocessing or manual orchestration.

🖥️ Academic HPC Integration

Native support for HTCondor, SLURM, and PBS—the infrastructure that powers university research. Works with your existing HPC cluster, not cloud-only platforms.

📊 Computational Scale

Orchestrate thousands of jobs across heterogeneous resources. Built for parameter sweeps, ensemble methods, statistical inference at scale, and any workflow with massive computational demands.

🔐 Research Governance

Audit trails, role-based access, and reproducibility built-in. Essential for collaborative research, multi-institution projects, and grant-mandated archival requirements.

🐍 Python-Native for Science

Built entirely in Python, asimov integrates directly with NumPy, SciPy, Astropy, and domain-specific packages. No context-switching between DSLs—your analysis logic and orchestration logic are one.

📈 Reproducible Science

Every workflow execution produces a complete audit trail. Version-tracked configurations, reproducible event data, and containerized codes ensure your research meets modern reproducibility standards.

The Research Orchestration Problem

Generic workflow tools weren't designed for the realities of computational research:

The Research Challenge
  • Multi-code workflows: Analyses flow through multiple interdependent analysis codes and simulations
  • Long-running computations: Individual jobs can take days or weeks; need robust management and recovery
  • Complex dependencies: Results from one code feed into the next; intricate orchestration needed
  • Heterogeneous compute: Mix of Python packages, compiled binaries, shell scripts, remote resources
  • Reproducibility at scale: Publication requirements demand complete audit trails across thousands of jobs
asimov's Solution
  • Orchestration framework built for multi-code scientific pipelines
  • Stateful job management with recovery from partial failures
  • Monitoring dashboards for tracking hundreds or thousands of jobs
  • Plugin architecture for adding analysis codes without forking
  • Research infrastructure designed for publication-grade reproducibility

Current Focus & Future Directions

asimov currently powers gravitational wave astronomy research, with infrastructure designed to extend to any domain requiring multi-code orchestration

🌊 Current: Gravitational Wave Astronomy

asimov is actively used for:

  • Multi-code parameter estimation pipelines (signal processing → Bayesian inference → population inference)
  • Thousands of nested sampling jobs coordinated across LIGO and research institutions
  • Ensemble simulations for synthetic data generation and detector characterization
🔮 Expanding To: Computational Science

The architecture applies directly to:

  • Computational Biology: Molecular dynamics ensembles, free energy calculations, structure prediction
  • Climate Science: Model ensemble runs, coupled simulations, large-scale data analysis
  • Materials Science: Parameter sweeps, simulation pipelines, cross-code validation
  • ML in Science: Training dataset generation, hyperparameter optimization, model inference

Why Specialized Tools Fall Short

Bioinformatics-First Tools

Nextflow, Snakemake, Galaxy assume cloud infrastructure and pre-built tool ecosystems:

  • ✗ Built for "docker pull biotools"; your Fortran solver isn't in Bioconda
  • ✗ Cloud-native designs miss HTCondor/SLURM resources universities depend on
  • ✗ Assume quick-running tasks; not designed for week-long parameter estimation
  • ✗ Limited state management for complex inter-code dependencies
Generic Job Schedulers

Slurm, PBS, HTCondor can run individual jobs, but lack research-grade orchestration:

  • ✗ No monitoring dashboards for tracking thousand-job surveys
  • ✗ Manual dependency management across codes
  • ✗ No audit trails for reproducibility
  • ✗ No integration with analysis packages (Bilby, LALInference, etc.)
Python Task Frameworks

Celery, Dask, Ray are built for distributed computing, not research orchestration:

  • ✗ No stateful workflow management
  • ✗ Poor support for HPC cluster integration
  • ✗ No built-in reproducibility or audit features
  • ✗ No monitoring UI or research-grade monitoring
asimov Fills the Gap

Purpose-built for physics research:

  • ✓ Orchestrates multi-code scientific pipelines
  • ✓ Native HPC + Python package integration
  • ✓ Monitoring and state management for long-running analyses
  • ✓ Built-in reproducibility and audit trails

Ready to Get Started?

Explore asimov with our tutorials and documentation.