RT Journal Article T1 Inter-Reader Reliability of Early FDG-PET/CT Response Assessment Using the Deauville Scale after 2 Cycles of Intensive Chemotherapy (OEPA) in Hodgkin's Lymphoma. A1 Kluge, Regine A1 Chavdarova, Lidia A1 Hoffmann, Martha A1 Kobe, Carsten A1 Malkowski, Bogdan A1 Montravers, Françoise A1 Kurch, Lars A1 Georgi, Thomas A1 Dietlein, Markus A1 Wallace, W Hamish A1 Karlen, Jonas A1 Fernández-Teijeiro, Ana A1 Cepelova, Michaela A1 Wilson, Lorrain A1 Bergstraesser, Eva A1 Sabri, Osama A1 Mauz-Körholz, Christine A1 Körholz, Dieter A1 Hasenclever, Dirk K1 Ensayo clínico controlado aleatorio K1 Enfermedad de Hodgkin K1 Sistema musculoesquelético K1 Estudio multicéntrico K1 Brown adipose tissue K1 Chemotherapy K1 Positron emission tomography K1 Lymphomas K1 Pediatrics K1 Hodgkin lymphoma K1 Lymph nodes K1 Thymus AB PurposeThe five point Deauville (D) scale is widely used to assess interim PET metabolic response to chemotherapy in Hodgkin lymphoma (HL) patients. An International Validation Study reported good concordance among reviewers in ABVD treated advanced stage HL patients for the binary discrimination between score D1,2,3 and score D4,5. Inter-reader reliability of the whole scale is not well characterised.METHODSFive international expert readers scored 100 interim PET/CT scans from paediatric HL patients. Scans were acquired in 51 European hospitals after two courses of OEPA chemotherapy (according to the EuroNet-PHL-C1 study). Images were interpreted in direct comparison with staging PET/CTs.RESULTSThe probability that two random readers concord on the five point D score of a random case is only 42% (global kappa = 0.24). Aggregating to a three point scale D1,2 vs. D3 vs. D4,5 improves concordance to 60% (kappa = 0.34). Concordance if one of two readers assigns a given score is 70% for score D1,2 only 36% for score D3 and 64% for D4,5. Concordance for the binary decisions D1,2 vs. D3,4,5 is 67% and 86% for D1,2,3 vs D4,5 (kappa = 0.36 resp. 0.56). If one reader assigns D1,2,3 concordance probability is 92%, but only 64% if D4,5 is called. Discrepancies occur mainly in mediastinum, neck and skeleton.CONCLUSIONInter-reader reliability of the five point D-scale is poor in this interobserver analysis of paediatric patients who underwent OEPA. Inter-reader variability is maximal in cases assigned to D2 or D3. The binary distinction D1,2,3 versus D4,5 is the most reliable criterion for clinical decision making. PB Public Library of Science YR 2016 FD 2016-03-10 LK http://hdl.handle.net/10668/2738 UL http://hdl.handle.net/10668/2738 LA en NO Kluge R, Chavdarova L, Hoffmann M, Kobe C, Malkowski B, Montravers F, et al. Inter-Reader Reliability of Early FDG-PET/CT Response Assessment Using the Deauville Scale after 2 Cycles of Intensive Chemotherapy (OEPA) in Hodgkin's Lymphoma. PLoS ONE. 2016; 11(3):e0149072 NO Journal Article; DS RISalud RD Apr 19, 2025