Publication: Targeted maximum likelihood estimation for a binary treatment: A tutorial.
dc.contributor.author | Luque-Fernandez, Miguel Angel | |
dc.contributor.author | Schomaker, Michael | |
dc.contributor.author | Rachet, Bernard | |
dc.contributor.author | Schnitzer, Mireille E | |
dc.contributor.funder | Cancer Research UK | |
dc.date.accessioned | 2023-01-25T10:07:15Z | |
dc.date.available | 2023-01-25T10:07:15Z | |
dc.date.issued | 2018-01-09 | |
dc.description.abstract | When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G-formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In contrast propensity score methods require the correct specification of an exposure model. Double-robust methods only require correct specification of either the outcome or the exposure model. Targeted maximum likelihood estimation is a semiparametric double-robust method that improves the chances of correct model specification by allowing for flexible estimation using (nonparametric) machine-learning methods. It therefore requires weaker assumptions than its competitors. We provide a step-by-step guided implementation of TMLE and illustrate it in a realistic scenario based on cancer epidemiology where assumptions about correct model specification and positivity (ie, when a study participant had 0 probability of receiving the treatment) are nearly violated. This article provides a concise and reproducible educational introduction to TMLE for a binary outcome and exposure. The reader should gain sufficient understanding of TMLE from this introductory tutorial to be able to apply the method in practice. Extensive R-code is provided in easy-to-read boxes throughout the article for replicability. Stata users will find a testing implementation of TMLE and additional material in the Appendix S1 and at the following GitHub repository: https://github.com/migariane/SIM-TMLE-tutorial. | |
dc.description.sponsorship | This work was supported by Cancer Research UK grant number C7923/A18525. M.A.L.F. is supported by a Miguel Servet I Investigator Award (grant CP17/00206) from the Carlos III Institute of Health, and M.E.S. is supported by a Canadian Institutes of Health Research New Investigator Award. The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the funding agencies. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication | |
dc.description.version | Si | |
dc.identifier.citation | Luque-Fernandez MA, Schomaker M, Rachet B, Schnitzer ME. Targeted maximum likelihood estimation for a binary treatment: A tutorial. Stat Med. 2018 Jul 20;37(16):2530-2546 | |
dc.identifier.doi | 10.1002/sim.7628 | |
dc.identifier.essn | 1097-0258 | |
dc.identifier.pmc | PMC6032875 | |
dc.identifier.pmid | 29687470 | |
dc.identifier.pubmedURL | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6032875/pdf | |
dc.identifier.unpaywallURL | https://onlinelibrary.wiley.com/doi/pdfdirect/10.1002/sim.7628 | |
dc.identifier.uri | http://hdl.handle.net/10668/12381 | |
dc.issue.number | 16 | |
dc.journal.title | Statistics in medicine | |
dc.journal.titleabbreviation | Stat Med | |
dc.language.iso | en | |
dc.organization | Escuela Andaluza de Salud Pública-EASP | |
dc.organization | Instituto de Investigación Biosanitaria de Granada (ibs.GRANADA) | |
dc.page.number | 2530-2546 | |
dc.publisher | John Wiley & Sons Ltd. | |
dc.pubmedtype | Journal Article | |
dc.pubmedtype | Research Support, Non-U.S. Gov't | |
dc.relation.projectID | C7923/A18525 | |
dc.relation.publisherversion | onlinelibrary.wiley.com/doi/full/10.1002/sim.7628 | |
dc.rights | Attribution 4.0 International | |
dc.rights.accessRights | open access | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | causal inference | |
dc.subject | ensemble Learning | |
dc.subject | machine learning | |
dc.subject | observational studies | |
dc.subject | targeted maximum likelihood estimation | |
dc.subject.decs | Algoritmos | |
dc.subject.decs | Aprendizaje automático | |
dc.subject.decs | Funciones de verosimilitud | |
dc.subject.decs | Humanos | |
dc.subject.decs | Interpretación estadística de datos | |
dc.subject.decs | Métodos epidemiológicos | |
dc.subject.decs | Neoplasias | |
dc.subject.decs | Puntaje de propensión | |
dc.subject.decs | Simulación por computador | |
dc.subject.mesh | Algorithms | |
dc.subject.mesh | Computer Simulation | |
dc.subject.mesh | Data Interpretation, Statistical | |
dc.subject.mesh | Epidemiologic Methods | |
dc.subject.mesh | Humans | |
dc.subject.mesh | Likelihood Functions | |
dc.subject.mesh | Machine Learning | |
dc.subject.mesh | Neoplasms | |
dc.subject.mesh | Propensity Score | |
dc.title | Targeted maximum likelihood estimation for a binary treatment: A tutorial. | |
dc.type | research article | |
dc.type.hasVersion | VoR | |
dc.volume.number | 37 | |
dspace.entity.type | Publication |