Publication:
Developing robust protein analysis profiles to identify bacterial acid phosphatases in genomes and metagenomic libraries.

dc.contributor.authorUdaondo, Zulema
dc.contributor.authorDuque, Estrella
dc.contributor.authorDaddaoua, Abdelali
dc.contributor.authorCaselles, Carlos
dc.contributor.authorRoca, Amalia
dc.contributor.authorPizarro-Tobias, Paloma
dc.contributor.authorRamos, Juan L
dc.date.accessioned2023-02-09T09:35:53Z
dc.date.available2023-02-09T09:35:53Z
dc.date.issued2020-07-16
dc.description.abstractPhylogenetic analysis of more than 4000 annotated bacterial acid phosphatases was carried out. Our analysis enabled us to sort these enzymes into the following three types: (1) class B acid phosphatases, which were distantly related to the other types, (2) class C acid phosphatases and (3) generic acid phosphatases (GAP). Although class B phosphatases are found in a limited number of bacterial families, which include known pathogens, class C acid phosphatases and GAP proteins are found in a variety of microbes that inhabit soil, fresh water and marine environments. As part of our analysis, we developed three profiles, named Pfr-B-Phos, Pfr-C-Phos and Pfr-GAP, to describe the three groups of acid phosphatases. These sequence-based profiles were then used to scan genomes and metagenomes to identify a large number of formerly unknown acid phosphatases. A number of proteins in databases annotated as hypothetical proteins were also identified by these profiles as putative acid phosphatases. To validate these in silico results, we cloned genes encoding candidate acid phosphatases from genomic DNA or recovered from metagenomic libraries or genes synthesized in vitro based on protein sequences recovered from metagenomic data. Expression of a number of these genes, followed by enzymatic analysis of the proteins, further confirmed that sequence similarity searches using our profiles could successfully identify previously unknown acid phosphatases.
dc.identifier.doi10.1111/1462-2920.15138
dc.identifier.essn1462-2920
dc.identifier.pmid32564477
dc.identifier.unpaywallURLhttps://digital.csic.es/bitstream/10261/221172/1/2020_Udaondo_EM_preprint.pdf
dc.identifier.urihttp://hdl.handle.net/10668/15787
dc.issue.number8
dc.journal.titleEnvironmental microbiology
dc.journal.titleabbreviationEnviron Microbiol
dc.language.isoen
dc.organizationIBS
dc.page.number3561-3571
dc.pubmedtypeJournal Article
dc.pubmedtypeResearch Support, Non-U.S. Gov't
dc.rights.accessRightsopen access
dc.subject.meshAcid Phosphatase
dc.subject.meshAmino Acid Sequence
dc.subject.meshBacteria
dc.subject.meshBase Sequence
dc.subject.meshGene Expression Regulation, Bacterial
dc.subject.meshGenome, Bacterial
dc.subject.meshMetagenome
dc.subject.meshMetagenomics
dc.subject.meshPhylogeny
dc.titleDeveloping robust protein analysis profiles to identify bacterial acid phosphatases in genomes and metagenomic libraries.
dc.typeresearch article
dc.type.hasVersionSMUR
dc.volume.number22
dspace.entity.typePublication

Files