Skip to content
@BiocPy

BiocPy

Facilitate Bioconductor Workflows in Python

BiocPy logo

BiocPy: Facilitate Bioconductor Workflows in Python

BiocPy brings Bioconductor's core data structures and analysis tools to the Python ecosystem. These structures, including BiocFrame and GenomicRanges, serve as essential and foundational data structures, acting as the building blocks for extensive and complex representations. For example, container classes like SummarizedExperiment, SingleCellExperiment, and MultiAssayExperiment represent single or multi-omic experimental data and metadata.

Core Packages

For a complete list of packages, visit our GitHub organization.

Data Structures

PackageDescriptionPyPILinks
BiocFrameBioconductor-like data framesPyPIGitHub | Docs
IRangesInterval arithmetic operationsPyPIGitHub | Docs | Bioconductor
GenomicRangesGenomic location analysisPyPIGitHub | Docs | Bioconductor

Containers

PackageDescriptionPyPILinks
SummarizedExperimentGenomic experiments containerPyPIGitHub | Docs | Bioconductor
SingleCellExperimentSingle-cell genomics containerPyPIGitHub | Docs | Bioconductor
SpatialExperimentSpatial transcriptomics containerPyPIGitHub | Docs | Bioconductor
SpatialFeatureExperimentExtends Spatial transcriptomics containerPyPIGitHub | Docs | Bioconductor
MultiAssayExperimentMulti-omics data frameworkPyPIGitHub | Docs | Bioconductor

R Interoperability

PackageDescriptionPyPILinks
rds2pyRead RDS files directly in PythonPyPIGitHub | Docs
BiocUtilsCommon utilities mirroring R's base functionalityPyPIGitHub | Docs
mopsyMatrix operations with R-like syntaxPyPIGitHub | Docs
pyBiocFileCacheResource caching systemPyPIGitHub | Docs | Bioconductor
txdbGenome annotations from TxDB objectsPyPIGitHub | Docs

Delayed Operations

PackageDescriptionPyPILinks
DelayedArrayDelayed operations in PythonPyPIGitHub | Docs | Bioconductor
HDF5ArrayHDF5-backed arraysPyPIGitHub | Docs | Bioconductor
TileDBArrayTileDB-backed arraysPyPIGitHub | Docs | Bioconductor

Get Started

All packages in the BiocPy are published to BiocPy PyPI org. Install the core packages using the biocpy wrapper:

pip install biocpy

Individual packages can be installed separately. See each package's documentation for specific installation instructions.

Environments

We provide conda/mamba configuration files to create environments containing most BiocPy (& friends) packages. Check out the environments repository for more information.


Friends of BiocPy

BiocPy integrates with several analysis tools and frameworks

Analysis Tools

  • libscran: Multi-model single-cell analysis in R, Python and JavaScript.
  • SingleR-inc: Cell type annotation for single-cell data.

Data Management

  • ArtifactDB: Language-agnostic access to data across computational environments.
  • tatami-inc: Read various matrix representations through a common interface.

Model Training

  • CellArr: TileDB-based genomic data storage with AI/ML dataloaders.

Contributing

We welcome contributions! Check out our developer guide to get started.

Pinned Loading

  1. rds2pyrds2pyPublic

    Read RDS files, in Python

    Python 25 3

  2. GenomicRangesGenomicRangesPublic

    Container class to represent genomic locations and support genomic analysis

    Python 27 4

  3. SummarizedExperimentSummarizedExperimentPublic

    Container class for genomic experiments

    Python 6 2

  4. SingleCellExperimentSingleCellExperimentPublic

    Container class for single-cell experiments

    Python 9 2

  5. scrnaseqscrnaseqPublic

    Collection of public single-cell RNA-seq datasets

    Python 3 1

  6. DelayedArrayDelayedArrayPublic

    DelayedArrays, in Python

    Python 1

Repositories

Showing 10 of 35 repositories

Top languages

Loading…

Most used topics

Loading…