Objective The aim of the study was to compare the distribution of Prostate Imaging and Reporting Data System (PI-RADS) scores, interreader agreement, and diagnostic performance of PI-RADS v2.0 and v2.1… Click to show full abstract
Objective The aim of the study was to compare the distribution of Prostate Imaging and Reporting Data System (PI-RADS) scores, interreader agreement, and diagnostic performance of PI-RADS v2.0 and v2.1 for transition zone (TZ) lesions. Methods The study included 202 lesions in 202 patients who underwent 3T prostate magnetic resonance imaging showing a TZ lesion that was later biopsied with magnetic resonance imaging/ultrasound fusion. Five abdominal imaging faculty reviewed T2-weighted imaging and high b value/apparent diffusion coefficient images in 2 sessions. Cases were randomized using a crossover design whereby half in the first session were reviewed using v2.0 and the other half using v2.1, and vice versa for the 2nd session. Readers provided T2-weighted imaging and DWI scores, from which PI-RADS scores were derived. Results Interreader agreement for all PI-RADS scores had κ of 0.37 (v2.0) and 0.26 (v2.1). For 4 readers, the percentage of lesions retrospectively scored PI-RADS 1 increased greater than 5% and PI-RADS 2 score decreased greater than 5% from v2.0 to v2.1. For 2 readers, the percentage scored PI-RADS 3 decreased greater than 5% and, for 2 readers, increased greater than 5%. The percentage of PI-RADS 4 and 5 lesions changed less than 5% for all readers. For the 4 readers with increased frequency of PI-RADS 1 using v2.1, 4% to 16% were Gleason score ≥3 + 4 tumor. Frequency of Gleason score ≥3 + 4 in PI-RADS 3 lesions increased for 2 readers and decreased for 1 reader. Sensitivity of PI-RADS of 3 or greater for Gleason score ≥3 + 4 ranged 76% to 90% (v2.0) and 69% to 96% (v2.1). Specificity ranged 32% to 64% (v2.0) and 25% to 72% (v2.1). Positive predictive value ranged 43% to 55% (v2.0) and 41% to 58% (v2.1). Negative predictive value ranged 82% to 87% (v2.0) and 81% to 91% (v2.1). Conclusions Poor interreader agreement and lack of improvement in diagnostic performance indicate an ongoing need to refine evaluation of TZ lesions.
               
Click one of the above tabs to view related content.