Publication:
A New Pipeline for the Normalization and Pooling of Metabolomics Data.

dc.contributor.authorViallon, Vivian
dc.contributor.authorHis, Mathilde
dc.contributor.authorRinaldi, Sabina
dc.contributor.authorBreeur, Marie
dc.contributor.authorGicquiau, Audrey
dc.contributor.authorHemon, Bertrand
dc.contributor.authorOvervad, Kim
dc.contributor.authorTjønneland, Anne
dc.contributor.authorRostgaard-Hansen, Agnetha Linn
dc.contributor.authorRothwell, Joseph A
dc.contributor.authorLecuyer, Lucie
dc.contributor.authorSeveri, Gianluca
dc.contributor.authorKaaks, Rudolf
dc.contributor.authorJohnson, Theron
dc.contributor.authorSchulze, Matthias B
dc.contributor.authorPalli, Domenico
dc.contributor.authorAgnoli, Claudia
dc.contributor.authorPanico, Salvatore
dc.contributor.authorTumino, Rosario
dc.contributor.authorRicceri, Fulvio
dc.contributor.authorVerschuren, W M Monique
dc.contributor.authorEngelfriet, Peter
dc.contributor.authorOnland-Moret, Charlotte
dc.contributor.authorVermeulen, Roel
dc.contributor.authorNøst, Therese Haugdahl
dc.contributor.authorUrbarova, Ilona
dc.contributor.authorZamora-Ros, Raul
dc.contributor.authorRodriguez-Barranco, Miguel
dc.contributor.authorAmiano, Pilar
dc.contributor.authorHuerta, José Maria
dc.contributor.authorArdanaz, Eva
dc.contributor.authorMelander, Olle
dc.contributor.authorOttoson, Filip
dc.contributor.authorVidman, Linda
dc.contributor.authorRentoft, Matilda
dc.contributor.authorSchmidt, Julie A
dc.contributor.authorTravis, Ruth C
dc.contributor.authorWeiderpass, Elisabete
dc.contributor.authorJohansson, Mattias
dc.contributor.authorDossus, Laure
dc.contributor.authorJenab, Mazda
dc.contributor.authorGunter, Marc J
dc.contributor.authorLorenzo Bermejo, Justo
dc.contributor.authorScherer, Dominique
dc.contributor.authorSalek, Reza M
dc.contributor.authorKeski-Rahkonen, Pekka
dc.contributor.authorFerrari, Pietro
dc.contributor.funderInternational Agency for Research on Cancer (IARC)
dc.contributor.funderHealth Research Fund (FIS)—Instituto de Salud Carlos III (ISCIII)
dc.contributor.funderRegional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, and the Catalan Institute of Oncology—ICO (Spain)
dc.date.accessioned2023-02-09T11:51:08Z
dc.date.available2023-02-09T11:51:08Z
dc.date.issued2021-09-13
dc.description.abstractPooling metabolomics data across studies is often desirable to increase the statistical power of the analysis. However, this can raise methodological challenges as several preanalytical and analytical factors could introduce differences in measured concentrations and variability between datasets. Specifically, different studies may use variable sample types (e.g., serum versus plasma) collected, treated, and stored according to different protocols, and assayed in different laboratories using different instruments. To address these issues, a new pipeline was developed to normalize and pool metabolomics data through a set of sequential steps: (i) exclusions of the least informative observations and metabolites and removal of outliers; imputation of missing data; (ii) identification of the main sources of variability through principal component partial R-square (PC-PR2) analysis; (iii) application of linear mixed models to remove unwanted variability, including samples' originating study and batch, and preserve biological variations while accounting for potential differences in the residual variances across studies. This pipeline was applied to targeted metabolomics data acquired using Biocrates AbsoluteIDQ kits in eight case-control studies nested within the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort. Comprehensive examination of metabolomics measurements indicated that the pipeline improved the comparability of data across the studies. Our pipeline can be adapted to normalize other molecular data, including biomarkers as well as proteomics data, and could be used for pooling molecular datasets, for example in international consortia, to limit biases introduced by inter-study variability. This versatility of the pipeline makes our work of potential interest to molecular epidemiologists.
dc.description.sponsorshipThe coordination of EPIC is financially supported by International Agency for Research on Cancer (IARC) and by the Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London which has additional infrastructure support provided by the NIHR Imperial Biomedical Research Centre (BRC). The national cohorts are supported by: Danish Cancer Society (Denmark); Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l’Education Nationale, Institut National de la Santé et de la Recherche Médicale (INSERM) (France); German Cancer Aid, German Cancer Research Center (DKFZ), German Institute of Human Nutrition Potsdam-Rehbruecke (DIfE), Federal Ministry of Education and Research (BMBF) (Germany); Associazione Italiana per la Ricerca sul Cancro-AIRC-Italy, Compagnia di SanPaolo and National Research Council (Italy); Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), Statistics Netherlands (The Netherlands); Health Research Fund (FIS)—Instituto de Salud Carlos III (ISCIII), Regional Governments of Andalucía, Asturias, Basque Country, Murcia and Navarra, and the Catalan Institute of Oncology—ICO (Spain); Swedish Cancer Society, Swedish Research Council and County Councils of Skåne and Västerbotten (Sweden); Cancer Research UK (14136 to EPIC-Norfolk; C8221/A29017 to EPIC-Oxford), Medical Research Council (1000143 to EPIC-Norfolk; MR/M012190/1 to EPIC-Oxford) (United Kingdom). IDIBELL acknowledges support from the Generalitat de Catalunya through the CERCA Program. R.Z.-R. would like to thank the “Miguel Servet” program (CPII20/00009) from the Institute of Health Carlos III (Spain) and the European Social Fund (ESF). The breast cancer study (BREA) was funded by the French National Cancer Institute (grant number 2015-166). The colorectal cancer studies (CLRT1 and CRLT2) were funded by World Cancer Research Fund (MG; reference: 2013/1002; www.wcrf.org/, accessed on 14 September 2021), the European Commission (MG; FP7: BBMRI-LPC; reference: 313010; https://ec.europa.eu/, accessed on 14 September 2021). The endometrial cancer study (ENDO) was funded by Cancer Research UK (grant number C19335/A21351). The kidney study (KIDN) was funded by the World Cancer Research Fund (MJ; reference: 2014/1193; www.wcrf.org/, accessed on 14 September 2021) and the European Commission (MJ; FP7: BBMRI-LPC; reference: 313010; https://ec.europa.eu/, accessed on 14 September 2021). The generation of metabolomics data in the gallbladder cancer study (GLBD) was supported by the European Union within the initiative “Biobanking and Biomolecular Research Infrastructure—Large Prospective Cohorts” (Collaborative study “Identification of biomarkers for gallbladder cancer risk prediction—Towards personalized prevention of an orphan disease”) under grant agreement no. 313010 (BBMRI-LPC). The liver cancer study (LIVE) was supported in part by the French National Cancer Institute (L’Institut National du Cancer; INCa; grant numbers 2009-139 and 2014-1-RT-02-CIRC-1; PI: M. Jenab) and by internal funds of the IARC. For the participants in the prostate cancer study (PROS), sample retrieval and preparation and assays of metabolites were supported by Cancer Research UK (C8221/A19170), and funding for grant 2014/1183 was obtained from the World Cancer Research Fund (WCRF UK), as part of the World Cancer Research Fund International grant program. Mathilde His’ work reported here was undertaken during the tenure of a postdoctoral fellowship awarded by the International Agency for Research on Cancer, financed by the Fondation ARC.
dc.description.versionSi
dc.identifier.citationViallon V, His M, Rinaldi S, Breeur M, Gicquiau A, Hemon B, et al. A New Pipeline for the Normalization and Pooling of Metabolomics Data. Metabolites. 2021 Sep 17;11(9):631.
dc.identifier.doi10.3390/metabo11090631
dc.identifier.issn2218-1989
dc.identifier.pmcPMC8467830
dc.identifier.pmid34564446
dc.identifier.pubmedURLhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467830/pdf
dc.identifier.unpaywallURLhttps://www.mdpi.com/2218-1989/11/9/631/pdf?version=1631874206
dc.identifier.urihttp://hdl.handle.net/10668/18539
dc.issue.number9
dc.journal.titleMetabolites
dc.journal.titleabbreviationMetabolites
dc.language.isoen
dc.organizationEscuela Andaluza de Salud Pública-EASP
dc.organizationInstituto de Investigación Biosanitaria de Granada (ibs.GRANADA)
dc.provenanceRealizada la curación de contenido 24/07/2024
dc.publisherMDPI AG
dc.pubmedtypeJournal Article
dc.relation.publisherversionhttps://www.mdpi.com/resolver?pii=metabo11090631
dc.rightsAttribution 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectcancer epidemiology
dc.subjectmetabolites
dc.subjectmetabolomics
dc.subjectnormalization
dc.subjectpooling
dc.subjecttechnical variability
dc.subject.decsBiomarcadores
dc.subject.decsEpidemiólogos
dc.subject.decsEstudios prospectivos
dc.subject.decsLaboratorios
dc.subject.decsNeoplasias
dc.subject.decsProteómica
dc.subject.decsSesgo
dc.subject.meshEpidemiologists
dc.subject.meshLaboratories
dc.subject.meshProspective Studies
dc.subject.meshProteomics
dc.subject.meshBiomarkers
dc.subject.meshBias
dc.subject.meshNeoplasms
dc.titleA New Pipeline for the Normalization and Pooling of Metabolomics Data.
dc.typeresearch article
dc.type.hasVersionVoR
dc.volume.number11
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
PMC8467830.pdf
Size:
3.06 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Viallon_ANew_MaterialSuplementario.pdf
Size:
1.25 MB
Format:
Adobe Portable Document Format