Publication:
KnowSeq R-Bioc package: The automatic smart gene expression tool for retrieving relevant biological knowledge.

Loading...
Thumbnail Image

Date

2021-04-05

Authors

Castillo-Secilla, Daniel
Galvez, Juan Manuel
Carrillo-Perez, Francisco
Verona-Almeida, Marta
Redondo-Sanchez, Daniel
Ortuno, Francisco Manuel
Herrera, Luis Javier
Rojas, Ignacio

Advisors

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier Ltd
Metrics
Google Scholar
Export

Research Projects

Organizational Units

Journal Issue

Abstract

KnowSeq R/Bioc package is designed as a powerful, scalable and modular software focused on automatizing and assembling renowned bioinformatic tools with new features and functionalities. It comprises a unified environment to perform complex gene expression analyses, covering all the needed processing steps to identify a gene signature for a specific disease to gather understandable knowledge. This process may be initiated from raw files either available at well-known platforms or provided by the users themselves, and in either case coming from different information sources and different Transcriptomic technologies. The pipeline makes use of a set of advanced algorithms, including the adaptation of a novel procedure for the selection of the most representative genes in a given multiclass problem. Similarly, an intelligent system able to classify new patients, providing the user the opportunity to choose one among a number of well-known and widespread classification and feature selection methods in Bioinformatics, is embedded. Furthermore, KnowSeq is engineered to automatically develop a complete and detailed HTML report of the whole process which is also modular and scalable. Biclass breast cancer and multiclass lung cancer study cases were addressed to rigorously assess the usability and efficiency of KnowSeq. The models built by using the Differential Expressed Genes achieved from both experiments reach high classification rates. Furthermore, biological knowledge was extracted in terms of Gene Ontologies, Pathways and related diseases with the aim of helping the expert in the decision-making process. KnowSeq is available at Bioconductor (https://bioconductor.org/packages/KnowSeq), GitHub (https://github.com/CasedUgr/KnowSeq) and Docker (https://hub.docker.com/r/casedugr/knowseq).

Description

MeSH Terms

Algorithms
Computational Biology
Humans
Software
Transcriptome

DeCS Terms

Algoritmos
Biología computacional
Humanos
Programas informáticos
Transcriptoma

CIE Terms

Keywords

Bioconductor, Bioinformatics, Classification, Enrichment, Gene expression

Citation

Castillo-Secilla D, Gálvez JM, Carrillo-Perez F, Verona-Almeida M, Redondo-Sánchez D, Ortuno FM, et al. KnowSeq R-Bioc package: The automatic smart gene expression tool for retrieving relevant biological knowledge. Comput Biol Med. 2021 Jun;133:104387.