BANKSY unifies cell typing and tissue domain segmentation for scalable spatial omics data analysis
Author(s): Vipul Singhal,Nigel Chou,Joseph Lee,Kok Hao Chen,Shyam Prabhakar
Affiliation(s): Genome Institute of Singapore
A core property of solid tissue is the spatial arrangement of cell types into stereotypical spatial patterns. These cell types can be investigated with spatial omics technologies to reveal both their omics features (transcriptomes, proteomes, etc), and their spatial coordinates. Because a cell’s state can be influenced by interactions with other cells, it is informative to cluster cells using both omics features and spatial locations. We present BANKSY, an algorithm with R and Python implementations that identifies both cell types and tissue domains from spatially-resolved -omics data. It does so by embedding cells in a product space of their own and neighbourhood omics features. In our tests, BANKSY revealed niche-dependent cell states in the mouse brain, and outperformed competing methods on domain segmentation and cell-typing benchmarks. BANKSY can also be used for quality control of spatial transcriptomics data and for spatially aware batch correction. Critically, it is substantially faster and more scalable than existing methods, enabling the processing of millions of cell datasets. In R, BANKSY can be used in the Bioconductor ecosystem via the SingleCellExperiment class.