Publication:
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data.

dc.contributor.authorGundogdu, Pelin
dc.contributor.authorLoucera, Carlos
dc.contributor.authorAlamo-Alvarez, Inmaculada
dc.contributor.authorDopazo, Joaquin
dc.contributor.authorNepomuceno, Isabel
dc.contributor.funderSpanish Ministry of Science and Innovation
dc.contributor.funderInstituto de Salud Carlos III (ISCIII)
dc.contributor.funderEuropean Regional Development Funds (ERDF)
dc.contributor.funderEuropean Social Fund (FSE)
dc.date.accessioned2023-05-03T13:34:44Z
dc.date.available2023-05-03T13:34:44Z
dc.date.issued2022-01-03
dc.description.abstractSingle-cell RNA sequencing (scRNA-seq) data provide valuable insights into cellular heterogeneity which is significantly improving the current knowledge on biology and human disease. One of the main applications of scRNA-seq data analysis is the identification of new cell types and cell states. Deep neural networks (DNNs) are among the best methods to address this problem. However, this performance comes with the trade-off for a lack of interpretability in the results. In this work we propose an intelligible pathway-driven neural network to correctly solve cell-type related problems at single-cell resolution while providing a biologically meaningful representation of the data. In this study, we explored the deep neural networks constrained by several types of prior biological information, e.g. signaling pathway information, as a way to reduce the dimensionality of the scRNA-seq data. We have tested the proposed biologically-based architectures on thousands of cells of human and mouse origin across a collection of public datasets in order to check the performance of the model. Specifically, we tested the architecture across different validation scenarios that try to mimic how unknown cell types are clustered by the DNN and how it correctly annotates cell types by querying a database in a retrieval problem. Moreover, our approach demonstrated to be comparable to other less interpretable DNN approaches constrained by using protein-protein interactions gene regulation data. Finally, we show how the latent structure learned by the network could be used to visualize and to interpret the composition of human single cell datasets. Here we demonstrate how the integration of pathways, which convey fundamental information on functional relationships between genes, with DNNs, that provide an excellent classification framework, results in an excellent alternative to learn a biologically meaningful representation of scRNA-seq data. In addition, the introduction of prior biological knowledge in the DNN reduces the size of the network architecture. Comparative results demonstrate a superior performance of this approach with respect to other similar approaches. As an additional advantage, the use of pathways within the DNN structure enables easy interpretability of the results by connecting features to cell functionalities by means of the pathway nodes, as demonstrated with an example with human melanoma tumor cells.
dc.description.versionSi
dc.identifier.citationGundogdu P, Loucera C, Alamo-Alvarez I, Dopazo J, Nepomuceno I. Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data. BioData Min. 2022 Jan 3;15(1):1.
dc.identifier.doi10.1186/s13040-021-00285-4
dc.identifier.issn1756-0381
dc.identifier.pmcPMC8722116
dc.identifier.pmid34980200
dc.identifier.pubmedURLhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8722116/pdf
dc.identifier.unpaywallURLhttps://biodatamining.biomedcentral.com/track/pdf/10.1186/s13040-021-00285-4
dc.identifier.urihttp://hdl.handle.net/10668/20345
dc.issue.number1
dc.journal.titleBioData mining
dc.journal.titleabbreviationBioData Min
dc.language.isoen
dc.organizationHospital Universitario Virgen del Rocío
dc.organizationFundación Pública Andaluz Progreso y Salud-FPS
dc.organizationInstituto de Biomedicina de Sevilla-IBIS
dc.page.number21
dc.provenanceRealizada la curación de contenido 04/06/2025.
dc.publisherBioMed Central Ltd.
dc.pubmedtypeJournal Article
dc.relation.projectID2014–2020
dc.relation.projectIDPID2020-117979RB-I00
dc.relation.projectIDIMP/0019
dc.relation.publisherversionhttps://biodatamining.biomedcentral.com/articles/10.1186/s13040-021-00285-4
dc.rightsAttribution 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectDeep neural network
dc.subjectGene expression
dc.subjectMachine learning
dc.subjectSignaling pathway
dc.subjectSingle cell
dc.subjectTranscriptomics
dc.subjectscRNA-seq
dc.subject.decsAnálisis de expresión génica de una sola célula
dc.subject.decsProteínas
dc.subject.decsNeoplasias
dc.subject.decsMelanoma
dc.subject.decsBiología
dc.subject.decsAnálisis de secuencia de ARN
dc.subject.meshMelanoma
dc.subject.meshSingle-Cell Gene Expression Analysis
dc.subject.meshSignal Transduction
dc.subject.meshNeural Networks, Computer
dc.subject.meshSequence Analysis, RNA
dc.titleIntegrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data.
dc.typeresearch article
dc.type.hasVersionVoR
dc.volume.number15
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
PMC8722116.pdf
Size:
1.13 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
Gundogdu_IntegratingPathway_MaterialSuplementario.pdf
Size:
648.76 KB
Format:
Adobe Portable Document Format