Statistics or biology: the zero-inflation controversy about scRNA-seq data

oleh: Ruochen Jiang, Tianyi Sun, Dongyuan Song, Jingyi Jessica Li

Format: Article
Diterbitkan: BMC 2022-01-01

Deskripsi

Abstract Researchers view vast zeros in single-cell RNA-seq data differently: some regard zeros as biological signals representing no or low gene expression, while others regard zeros as missing data to be corrected. To help address the controversy, here we discuss the sources of biological and non-biological zeros; introduce five mechanisms of adding non-biological zeros in computational benchmarking; evaluate the impacts of non-biological zeros on data analysis; benchmark three input data types: observed counts, imputed counts, and binarized counts; discuss the open questions regarding non-biological zeros; and advocate the importance of transparent analysis.