publications
*These authors contributed equally to this work
2024
- Relapse to cocaine seeking is regulated by medial habenula NR4A2/NURR1 in miceJessica E. Childs* , Samuel Morabito* , Sudeshna Das , Caterina Santelli , Victoria Pham , and 7 more authorsCell Reports, 2024
Drugs of abuse can persistently change the reward circuit in ways that contribute to relapse behavior, partly via mechanisms that regulate chromatin structure and function. Nuclear orphan receptor subfamily4 groupA member2 (NR4A2, also known as NURR1) is an important effector of histone deacetylase 3 (HDAC3)-dependent mechanisms in persistent memory processes and is highly expressed in the medial habenula (MHb), a region that regulates nicotine-associated behaviors. Here, expressing the Nr4a2 dominant negative (Nurr2c) in the MHb blocks reinstatement of cocaine seeking in mice. We use single-nucleus transcriptomics to characterize the molecular cascade following Nr4a2 manipulation, revealing changes in transcriptional networks related to addiction, neuroplasticity, and GABAergic and glutamatergic signaling. The network controlled by NR4A2 is characterized using a transcription factor regulatory network inference algorithm. These results identify the MHb as a pivotal regulator of relapse behavior and demonstrate the importance of NR4A2 as a key mechanism driving the MHb component of relapse.
- BIN1K358R suppresses glial response to plaques in mouse model of Alzheimer’s diseaseLaura Fernandez Garcia‐Agudo , Zechuan Shi , Ian F. Smith , Enikö A. Kramár , Katelynn Tran , and 22 more authorsAlzheimer’s & Dementia, 2024
The BIN1 coding variant rs138047593 (K358R) is linked to Late‐Onset Alzheimer’s Disease (LOAD) via targeted exome sequencing. To elucidate the functional consequences of this rare coding variant on brain amyloidosis and neuroinflammation, we generated BIN1K358R knock‐in mice using CRISPR/Cas9 technology. These mice were subsequently bred with 5xFAD transgenic mice, which serve as a model for Alzheimer’s pathology. The presence of the BIN1K358R variant leads to increased cerebral amyloid deposition, with a dampened response of astrocytes and oligodendrocytes, but not microglia, at both the cellular and transcriptional levels. This correlates with decreased neurofilament light chain in both plasma and brain tissue. Synaptic densities are significantly increased in both wild‐type and 5xFAD backgrounds homozygous for the BIN1K358R variant. The BIN1 K358R variant modulates amyloid pathology in 5xFAD mice, attenuates the astrocytic and oligodendrocytic responses to amyloid plaques, decreases damage markers, and elevates synaptic densities. BIN1 rs138047593 (K358R) coding variant is associated with increased risk of LOAD. BIN1 K358R variant increases amyloid plaque load in 12‐month‐old 5xFAD mice. BIN1 K358R variant dampens astrocytic and oligodendrocytic response to plaques. BIN1 K358R variant decreases neuronal damage in 5xFAD mice. BIN1 K358R upregulates synaptic densities and modulates synaptic transmission.
2023
- bioRxivSpatial and single-nucleus transcriptomic analysis of genetic and sporadic forms of Alzheimer’s DiseaseEmily Miyoshi* , Samuel Morabito* , Caden M. Henningfield , Negin Rahimzadeh , Sepideh Kiani Shabestari , and 19 more authorsbioRxiv, 2023
The majority of mammalian genes encode multiple transcript isoforms that result from differential promoter use, changes in exonic splicing, and alternative 3′ end choice. Detecting and quantifying transcript isoforms across tissues, cell types, and species has been extremely challenging because transcripts are much longer than the short reads normally used for RNA-seq. By contrast, long-read RNA-seq (LR-RNA-seq) gives the complete structure of most transcripts. We sequenced 264 LR-RNA-seq PacBio libraries totaling over 1 billion circular consensus reads (CCS) for 81 unique human and mouse samples. We detect at least one full-length transcript from 87.7% of annotated human protein coding genes and a total of \textbackslashtextasciitilde200,000 full-length transcripts, \textbackslashtextasciitilde40% of which have novel exon junction chains. To capture and compute on the three sources of transcript structure diversity, we introduce a gene and transcript annotation framework that uses triplets representing the transcript start site, exon junction chain, and transcript end site of each transcript. Using triplets in a simplex representation demonstrates how promoter selection, splice pattern, and 3′ processing are deployed across human tissues, with nearly half of multitranscript protein coding genes showing a clear bias toward one of the three diversity mechanisms. Evaluated across samples, the predominantly expressed transcript changes for 74% of protein coding genes. In evolution, the human and mouse transcriptomes are globally similar in types of transcript structure diversity, yet among individual orthologous gene pairs, more than half (57.8%) show substantial differences in mechanism of diversification in matching tissues. This initial large-scale survey of human and mouse long-read transcriptomes provides a foundation for further analyses of alternative transcript usage, and is complemented by short-read and microRNA data on the same samples and by epigenome data elsewhere in the ENCODE4 collection.
- hdWGCNA identifies co-expression networks in high-dimensional transcriptomics dataSamuel Morabito , Fairlie Reese , Negin Rahimzadeh , Emily Miyoshi , and Vivek SwarupCell Reports Methods, 2023
Biological systems are immensely complex, organized into a multi-scale hierarchy of functional units based on tightly regulated interactions between distinct molecules, cells, organs, and organisms. While experimental methods enable transcriptome-wide measurements across millions of cells, popular bioinformatic tools do not support systems-level analysis. Here we present hdWGCNA, a comprehensive framework for analyzing co-expression networks in high-dimensional transcriptomics data such as single-cell and spatial RNA sequencing (RNA-seq). hdWGCNA provides functions for network inference, gene module identification, gene enrichment analysis, statistical tests, and data visualization. Beyond conventional single-cell RNA-seq, hdWGCNA is capable of performing isoform-level network analysis using long-read single-cell data. We showcase hdWGCNA using data from autism spectrum disorder and Alzheimer’s disease brain samples, identifying disease-relevant co-expression network modules. hdWGCNA is directly compatible with Seurat, a widely used R package for single-cell and spatial transcriptomics analysis, and we demonstrate the scalability of hdWGCNA by analyzing a dataset containing nearly 1 million cells
- bioRxivThe ENCODE4 long-read RNA-seq collection reveals distinct classes of transcript structure diversityFairlie Reese , Brian Williams , Gabriela Balderrama-Gutierrez , Dana Wyman , Muhammed Hasan Çelik , and 40 more authorsbioRxiv, 2023
The majority of mammalian genes encode multiple transcript isoforms that result from differential promoter use, changes in exonic splicing, and alternative 3′ end choice. Detecting and quantifying transcript isoforms across tissues, cell types, and species has been extremely challenging because transcripts are much longer than the short reads normally used for RNA-seq. By contrast, long-read RNA-seq (LR-RNA-seq) gives the complete structure of most transcripts. We sequenced 264 LR-RNA-seq PacBio libraries totaling over 1 billion circular consensus reads (CCS) for 81 unique human and mouse samples. We detect at least one full-length transcript from 87.7% of annotated human protein coding genes and a total of \textbackslashtextasciitilde200,000 full-length transcripts, \textbackslashtextasciitilde40% of which have novel exon junction chains. To capture and compute on the three sources of transcript structure diversity, we introduce a gene and transcript annotation framework that uses triplets representing the transcript start site, exon junction chain, and transcript end site of each transcript. Using triplets in a simplex representation demonstrates how promoter selection, splice pattern, and 3′ processing are deployed across human tissues, with nearly half of multitranscript protein coding genes showing a clear bias toward one of the three diversity mechanisms. Evaluated across samples, the predominantly expressed transcript changes for 74% of protein coding genes. In evolution, the human and mouse transcriptomes are globally similar in types of transcript structure diversity, yet among individual orthologous gene pairs, more than half (57.8%) show substantial differences in mechanism of diversification in matching tissues. This initial large-scale survey of human and mouse long-read transcriptomes provides a foundation for further analyses of alternative transcript usage, and is complemented by short-read and microRNA data on the same samples and by epigenome data elsewhere in the ENCODE4 collection.
- The Type 2 Diabetes Knowledge Portal: An open access genetic resource dedicated to type 2 diabetes and related traitsMaria C. Costanzo , Marcin von Grotthuss , Jeffrey Massung , Dongkeun Jang , Lizz Caulkins , and 187 more authorsCell Metabolism, 2023
Associations between human genetic variation and clinical phenotypes have become a foundation of biomedical research. Most repositories of these data seek to be disease-agnostic and therefore lack disease-focused views. The Type 2 Diabetes Knowledge Portal (T2DKP) is a public resource of genetic datasets and genomic annotations dedicated to type 2 diabetes (T2D) and related traits. Here, we seek to make the T2DKP more accessible to prospective users and more useful to existing users. First, we evaluate the T2DKP’s comprehensiveness by comparing its datasets with those of other repositories. Second, we describe how researchers unfamiliar with human genetic data can begin using and correctly interpreting them via the T2DKP. Third, we describe how existing users can extend their current workflows to use the full suite of tools offered by the T2DKP. We finally discuss the lessons offered by the T2DKP toward the goal of democratizing access to complex disease genetic results.
2022
- Molecular signatures underlying neurofibrillary tangle susceptibility in Alzheimer’s diseaseMarcos Otero-Garcia , Sameehan U. Mahajani , Debia Wakhloo , Weijing Tang , Yue-Qiang Xue , and 12 more authorsNeuron, 2022
Tau aggregation in neurofibrillary tangles (NFTs) is closely associated with neurodegeneration and cognitive decline in Alzheimer’s disease (AD). However, the molecular signatures that distinguish between aggregation-prone and aggregation-resistant cell states are unknown. We developed methods for the high-throughput isolation and transcriptome profiling of single somas with NFTs from the human AD brain, quantified the susceptibility of 20 neocortical subtypes for NFT formation and death, and identified both shared and cell-type-specific signatures. NFT-bearing neurons shared a marked upregulation of synaptic transmission-related genes, including a core set of 63 genes enriched for synaptic vesicle cycling. Oxidative phosphorylation and mitochondrial dysfunction were highly cell-type dependent. Apoptosis was only modestly enriched, and the susceptibilities of NFT-bearing and NFT-free neurons for death were highly similar. Our analysis suggests that NFTs represent cell-type-specific responses to stress and synaptic dysfunction. We provide a resource for biomarker discovery and the investigation of tau-dependent and tau-independent mechanisms of neurodegeneration.
- Absence of microglia promotes diverse pathologies and early lethality in Alzheimer’s disease miceSepideh Kiani Shabestari , Samuel Morabito , Emma Pascal Danhash , Amanda McQuade , Jessica Ramirez Sanchez , and 16 more authorsCell Reports, 2022
Microglia are strongly implicated in the development and progression of Alzheimer’s disease (AD), yet their impact on pathology and lifespan remains unclear. Here we utilize a CSF1R hypomorphic mouse to generate a model of AD that genetically lacks microglia. The resulting microglial-deficient mice exhibit a profound shift from parenchymal amyloid plaques to cerebral amyloid angiopathy (CAA), which is accompanied by numerous transcriptional changes, greatly increased brain calcification and hemorrhages, and premature lethality. Remarkably, a single injection of wild-type microglia into adult mice repopulates the microglial niche and prevents each of these pathological changes. Taken together, these results indicate the protective functions of microglia in reducing CAA, blood-brain barrier dysfunction, and brain calcification. To further understand the clinical implications of these findings, human AD tissue and iPSC-microglia were examined, providing evidence that microglia phagocytose calcium crystals, and this process is impaired by loss of the AD risk gene, TREM2.
- Protocol for single-nucleus ATAC sequencing and bioinformatic analysis in frozen human brain tissueZechuan Shi* , Sudeshna Das* , Samuel Morabito , Emily Miyoshi , and Vivek SwarupSTAR Protocols, 2022
Single-nucleus ATAC sequencing (snATAC-seq) employs a hyperactive Tn5 transposase to gain precise information about the cis-regulatory elements in specific cell types. However, the standard protocol of snATAC-seq is not optimized for all tissues, including the brain. Here, we present a modified protocol for single-nuclei isolation from postmortem frozen human brain tissue, followed by snATAC-seq library preparation and sequencing. We also describe an integrated bioinformatics analysis pipeline using an R package (ArchRtoSignac) to robustly analyze snATAC-seq data. For complete details on the use and execution of this protocol, please refer to Morabito et al. (2021).
- Sci RepCortical diurnal rhythms remain intact with microglial depletionRocio A. Barahona , Samuel Morabito , Vivek Swarup , and Kim N. GreenScientific Reports, 2022
Microglia are subject to change in tandem with the endogenously generated biological oscillations known as our circadian rhythm. Studies have shown microglia harbor an intrinsic molecular clock which regulates diurnal changes in morphology and influences inflammatory responses. In the adult brain, microglia play an important role in the regulation of condensed extracellular matrix structures called perineuronal nets (PNNs), and it has been suggested that PNNs are also regulated in a circadian and diurnal manner. We sought to determine whether microglia mediate the diurnal regulation of PNNs via CSF1R inhibitor dependent microglial depletion in C57BL/6J mice, and how the absence of microglia might affect cortical diurnal gene expression rhythms. While we observe diurnal differences in microglial morphology, where microglia are most ramified at the onset of the dark phase, we do not find diurnal differences in PNN intensity. However, PNN intensity increases across many brain regions in the absence of microglia, supporting a role for microglia in the regulation of PNNs. Here, we also show that cortical diurnal gene expression rhythms are intact, with no cycling gene changes without microglia. These findings demonstrate a role for microglia in the maintenance of PNNs, but not in the maintenance of diurnal rhythms.
2021
- Single-nucleus chromatin accessibility and transcriptomic characterization of Alzheimer’s diseaseSamuel Morabito* , Emily Miyoshi* , Neethu Michael , Saba Shahin , Alessandra Cadete Martini , and 5 more authorsNature Genetics, 2021
The gene-regulatory landscape of the brain is highly dynamic in health and disease, coordinating a menagerie of biological processes across distinct cell types. Here, we present a multi-omic single-nucleus study of 191,890 nuclei in late-stage Alzheimer’s disease (AD), accessible through our web portal, profiling chromatin accessibility and gene expression in the same biological samples and uncovering vast cellular heterogeneity. We identified cell-type-specific, disease-associated candidate cis-regulatory elements and their candidate target genes, including an oligodendrocyte-associated regulatory module containing links to APOE and CLU. We describe cis-regulatory relationships in specific cell types at a subset of AD risk loci defined by genome-wide association studies, demonstrating the utility of this multi-omic single-nucleus approach. Trajectory analysis of glial populations identified disease-relevant transcription factors, such as SREBF1, and their regulatory targets. Finally, we introduce single-nucleus consensus weighted gene coexpression analysis, a coexpression network analysis strategy robust to sparse single-cell data, and perform a systems-level analysis of the AD transcriptome.
- Systems biology approaches to unravel the molecular and genetic architecture of Alzheimer’s disease and related tauopathiesEmily Miyoshi* , Samuel Morabito* , and Vivek SwarupNeurobiology of Disease, 2021
Over the years, genetic studies have identified multiple genetic risk variants associated with neurodegenerative disorders and helped reveal new biological pathways and genes of interest. However, genetic risk variants commonly reside in non-coding regions and may regulate distant genes rather than the nearest gene, as well as a gene’s interaction partners in biological networks. Systems biology and functional genomics approaches provide the framework to unravel the functional significance of genetic risk variants in disease. In this review, we summarize the genetic and transcriptomic studies of Alzheimer’s disease and related tauopathies and focus on the advantages of performing systems-level analyses to interrogate the biological pathways underlying neurodegeneration. Finally, we highlight new avenues of multi-omics analysis with single-cell approaches, which provides unparalleled opportunities to systematically explore cellular heterogeneity, and present an example of how to integrate publicly available single-cell datasets. Systems-level analysis has illuminated the function of many disease risk genes, but much work remains to study tauopathies and to understand spatiotemporal gene expression changes of specific cell types.
2020
- Integrative genomics approach identifies conserved transcriptomic networks in Alzheimer’s diseaseSamuel Morabito , Emily Miyoshi , Neethu Michael , and Vivek SwarupHuman Molecular Genetics, Aug 2020
Alzheimer’s disease (AD) is a devastating neurological disorder characterized by changes in cell-type proportions and consequently marked alterations of the transcriptome. Here we use a data-driven systems biology meta-analytical approach across three human ad cohorts, encompassing six cortical brain regions, and integrate with multi-scale datasets comprising of DNA methylation, histone acetylation, transcriptome- and genome-wide association studies, and quantitative trait loci to further characterize the genetic architecture of ad. We perform co-expression network analysis across more than twelve hundred human brain samples, identifying robust ad-associated dysregulation of the transcriptome, unaltered in normal human aging. We assess the cell-type specificity of ad gene co-expression changes and estimate cell-type proportion changes in human ad by integrating co-expression modules with single-cell transcriptome data generated from 27 321 nuclei from human postmortem prefrontal cortical tissue. We also show that genetic variants of ad are enriched in a microglial ad-associated module and identify key transcription factors regulating co-expressed modules. Additionally, we validate our results in multiple published human ad gene expression datasets, which can be easily accessed using our online resource (https://swaruplab.bio.uci.edu/consensusAD).
2019
- Comms BioSingle-cell landscape in mammary epithelium reveals bipotent-like cells associated with breast cancer risk and outcomeWeiyan Chen , Samuel J. Morabito , Kai Kessenbrock , Tariq Enver , Kerstin B. Meyer , and 1 more authorCommunications Biology, Dec 2019
Adult stem-cells may serve as the cell-of-origin for cancer, yet their unbiased identification in single cell RNA sequencing data is challenging due to the high dropout rate. In the case of breast, the existence of a bipotent stem-like state is also controversial. Here we apply a marker-free algorithm to scRNA-Seq data from the human mammary epithelium, revealing a high-potency cell-state enriched for an independent mammary stem-cell expression module. We validate this stem-like state in independent scRNA-Seq data. Our algorithm further predicts that the stem-like state is bipotent, a prediction we are able to validate using FACS sorted bulk expression data. The bipotent stem-like state correlates with clinical outcome in basal breast cancer and is characterized by overexpression of YBX1 and ENO1, two modulators of basal breast cancer risk. This study illustrates the power of a marker-free computational framework to identify a novel bipotent stem-like state in the mammary epithelium.