Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2024 Feb 7;40(2):btae070. doi: 10.1093/bioinformatics/btae070

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2024. Published by Oxford University Press.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

PMC Copyright notice

Figure 1. — Overview of Pycallingcards. (a) Pycallingcards workflow for scCC data. Pycallingcards reads insertion data from a qbed file and then calls peaks (to create a bed file, left column). It then creates a cells-by-peaks Anndata object (h5ad file) Pycallingcards interfaces with Scanpy to complete preprocessing, clustering, and differential expression analysis of the RNA-seq data collected for each cell (right column). Pycallingcards then uses Mudata object to store the combined scCC and scRNA-seq data (h5mu file). (b) Data structure in Pycallingcards for bulk CC data. Pycallingcards reads insertion data from a qbed file and calls peaks, which generates a bed file. It later creates a groups/samples-by-peaks Anndata object (h5ad file) (b, left column). If bulk RNA-seq is provided, it uses normalized counts and results from differential gene analysis (b, right column). (c) Downstream Analysis. Pycallingcards provides functionality to compare called CC peaks with Chip-seq signal (when available), perform a footprint analysis to narrow down TF binding regions, find motifs, allow for visualization of the dataset through the WashU Epigenome Browser, perform differential peak analysis, pair CC data with RNA-seq data, and identify related SNPs by intersecting peaks with a GWAS database.