Publication:
PyCellBase, an efficient python package for easy retrieval of biological data from heterogeneous sources.

dc.contributor.authorPerez-Gil, Daniel
dc.contributor.authorLopez, Francisco J
dc.contributor.authorDopazo, Joaquin
dc.contributor.authorMarin-Garcia, Pablo
dc.contributor.authorRendon, Augusto
dc.contributor.authorMedina, Ignacio
dc.date.accessioned2023-01-25T13:32:22Z
dc.date.available2023-01-25T13:32:22Z
dc.date.issued2019-03-28
dc.description.abstractBiological databases and repositories are incrementing in diversity and complexity over the years. This rapid expansion of current and new sources of biological knowledge raises serious problems of data accessibility and integration. To handle the growing necessity of unification, CellBase was created as an integrative solution. CellBase provides a centralized NoSQL database containing biological information from different and heterogeneous sources. Access to this information is done through a RESTful web service API, which provides an efficient interface to the data. In this work we present PyCellBase, a Python package that provides programmatic access to the rich RESTful web service API offered by CellBase. This package offers a fast and user-friendly access to biological information without the need of installing any local database. In addition, a series of command-line tools are provided to perform common bioinformatic tasks, such as variant annotation. CellBase data is always available by a high-availability cluster and queries have been tuned to ensure a real-time performance. PyCellBase is an open-source Python package that provides an efficient access to heterogeneous biological information. It allows to perform tasks that require a comprehensive set of knowledge resources, as for example variant annotation. Queries can be easily fine-tuned to retrieve the desired information of particular biological features. PyCellBase offers the convenience of an object-oriented scripting language and provides the ability to integrate the obtained results into other Python applications and pipelines.
dc.identifier.doi10.1186/s12859-019-2726-4
dc.identifier.essn1471-2105
dc.identifier.pmcPMC6438028
dc.identifier.pmid30922213
dc.identifier.pubmedURLhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6438028/pdf
dc.identifier.unpaywallURLhttps://doi.org/10.1186/s12859-019-2726-4
dc.identifier.urihttp://hdl.handle.net/10668/13767
dc.issue.number1
dc.journal.titleBMC bioinformatics
dc.journal.titleabbreviationBMC Bioinformatics
dc.language.isoen
dc.organizationFundación Pública Andaluz Progreso y Salud-FPS
dc.organizationHospital Universitario Virgen del Rocío
dc.page.number159
dc.pubmedtypeJournal Article
dc.rightsAttribution 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectAnnotation
dc.subjectCellBase
dc.subjectDatabase
dc.subjectPython
dc.subjectRESTful
dc.subjectRepository
dc.subjectVariant
dc.subject.meshComputational Biology
dc.subject.meshDatabases, Factual
dc.subject.meshSoftware
dc.subject.meshUser-Computer Interface
dc.titlePyCellBase, an efficient python package for easy retrieval of biological data from heterogeneous sources.
dc.typeresearch article
dc.type.hasVersionVoR
dc.volume.number20
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
PMC6438028.pdf
Size:
702.22 KB
Format:
Adobe Portable Document Format