Quantitative sequencing using BID-seq uncovers abundant pseudouridines in mammalian mRNA at base resolutionDai, Qing; Zhang, Li-Sheng; Sun, Hui-Lung; Pajdzik, Kinga; Yang, Lei; Ye, Chang; Ju, Cheng-Wei; Liu, Shun; Wang, Yuru; Zheng, Zhong; Zhang, Linda; Harada, Bryan T.; Dou, Xiaoyang; Irkliyenko, Iryna; Feng, Xinran; Zhang, Wen; Pan, Tao; He, Chuan
doi: 10.1038/s41587-022-01505-wpmid: 36302989
Functional characterization of pseudouridine (Ψ) in mammalian mRNA has been hampered by the lack of a quantitative method that maps Ψ in the whole transcriptome. We report bisulfite-induced deletion sequencing (BID-seq), which uses a bisulfite-mediated reaction to convert pseudouridine stoichiometrically into deletion upon reverse transcription without cytosine deamination. BID-seq enables detection of abundant Ψ sites with stoichiometry information in several human cell lines and 12 different mouse tissues using 10–20 ng input RNA. We uncover consensus sequences for Ψ in mammalian mRNA and assign different ‘writer’ proteins to individual Ψ deposition. Our results reveal a transcript stabilization role of Ψ sites installed by TRUB1 in human cancer cells. We also detect the presence of Ψ within stop codons of mammalian mRNA and confirm the role of Ψ in promoting stop codon readthrough in vivo. BID-seq will enable future investigations of the roles of Ψ in diverse biological processes.
Precision mitochondrial DNA editing with high-fidelity DddA-derived base editorsLee, Seonghyun; Lee, Hyunji; Baek, Gayoung; Kim, Jin-Soo
doi: 10.1038/s41587-022-01486-wpmid: 36229610
Bacterial toxin DddA-derived cytosine base editors (DdCBEs)—composed of split DddAtox (a cytosine deaminase specific to double-stranded DNA), custom-designed TALE (transcription activator-like effector) DNA-binding proteins, and a uracil glycosylase inhibitor—enable mitochondrial DNA (mtDNA) editing in human cells, which may pave the way for therapeutic correction of pathogenic mtDNA mutations in patients. The utility of DdCBEs has been limited by off-target activity, which is probably caused by spontaneous assembly of the split DddAtox deaminase enzyme, independent of DNA-binding interactions. We engineered high-fidelity DddA-derived cytosine base editors (HiFi-DdCBEs) with minimal off-target activity by substituting alanine for amino acid residues at the interface between the split DddAtox halves. The resulting domains cannot form a functional deaminase without binding of their linked TALE proteins at adjacent sites on DNA. Whole mitochondrial genome sequencing shows that, unlike conventional DdCBEs, which induce hundreds of unwanted off-target C-to-T conversions in human mtDNA, HiFi-DdCBEs are highly efficient and precise, avoiding collateral off-target mutations, and as such, they will probably be desirable for therapeutic applications.
Multi-omic single-cell velocity models epigenome–transcriptome interactions and improves cell fate predictionLi, Chen; Virgilio, Maria C.; Collins, Kathleen L.; Welch, Joshua D.
doi: 10.1038/s41587-022-01476-ypmid: 36229609
Multi-omic single-cell datasets, in which multiple molecular modalities are profiled within the same cell, offer an opportunity to understand the temporal relationship between epigenome and transcriptome. To realize this potential, we developed MultiVelo, a differential equation model of gene expression that extends the RNA velocity framework to incorporate epigenomic data. MultiVelo uses a probabilistic latent variable model to estimate the switch time and rate parameters of chromatin accessibility and gene expression and improves the accuracy of cell fate prediction compared to velocity estimates from RNA only. Application to multi-omic single-cell datasets from brain, skin and blood cells reveals two distinct classes of genes distinguished by whether chromatin closes before or after transcription ceases. We also find four types of cell states: two states in which epigenome and transcriptome are coupled and two distinct decoupled states. Finally, we identify time lags between transcription factor expression and binding site accessibility and between disease-associated SNP accessibility and expression of the linked genes. MultiVelo is available on PyPI, Bioconda and GitHub (https://github.com/welch-lab/MultiVelo).
Haplotype-aware analysis of somatic copy number variations from single-cell transcriptomesGao, Teng; Soldatov, Ruslan; Sarkar, Hirak; Kurkiewicz, Adam; Biederstedt, Evan; Loh, Po-Ru; Kharchenko, Peter V.
doi: 10.1038/s41587-022-01468-ypmid: 36163550
Genome instability and aberrant alterations of transcriptional programs both play important roles in cancer. Single-cell RNA sequencing (scRNA-seq) has the potential to investigate both genetic and nongenetic sources of tumor heterogeneity in a single assay. Here we present a computational method, Numbat, that integrates haplotype information obtained from population-based phasing with allele and expression signals to enhance detection of copy number variations from scRNA-seq. Numbat exploits the evolutionary relationships between subclones to iteratively infer single-cell copy number profiles and tumor clonal phylogeny. Analysis of 22 tumor samples, including multiple myeloma, gastric, breast and thyroid cancers, shows that Numbat can reconstruct the tumor copy number profile and precisely identify malignant cells in the tumor microenvironment. We identify genetic subpopulations with transcriptional signatures relevant to tumor progression and therapy resistance. Numbat requires neither sample-matched DNA data nor a priori genotyping, and is applicable to a wide range of experimental settings and cancer types.