Endometrial Pipelle Biopsy Computer-Aided Diagnosis: A Feasibility Study
Vermorgen, Sanne; Gelton, Thijs; Bult, Peter; Kusters-Vandevelde, Heidi V.N.; Hausnerová, Jitka; Van de Vijver, Koen; Davidson, Ben; Stefansson, Ingunn Marie; Kooreman, Loes F.S.; Qerimi, Adelina; Huvila, Jutta; Gilks, Blake; Shahi, Maryam; Zomer, Saskia; Bartosch, Carla; Pijnenborg, Johanna M.A.; Bulten, Johan; Ciompi, Francesco; Simons, Michiel
Journal article, Peer reviewed
Published version
View/ Open
Date
2023Metadata
Show full item recordCollections
- Department of Clinical Medicine [2099]
- Registrations from Cristin [10467]
Abstract
Endometrial biopsies are important in the diagnostic workup of women who present with abnormal uterine bleeding or hereditary risk of endometrial cancer. In general, approximately 10% of all endometrial biopsies demonstrate endometrial (pre)malignancy that requires specific treatment. As the diagnostic evaluation of mostly benign cases results in a substantial workload for pathologists, artificial intelligence (AI)-assisted preselection of biopsies could optimize the workflow. This study aimed to assess the feasibility of AI-assisted diagnosis for endometrial biopsies (endometrial Pipelle biopsy computer-aided diagnosis), trained on daily-practice whole-slide images instead of highly selected images. Endometrial biopsies were classified into 6 clinically relevant categories defined as follows: nonrepresentative, normal, nonneoplastic, hyperplasia without atypia, hyperplasia with atypia, and malignant. The agreement among 15 pathologists, within these classifications, was evaluated in 91 endometrial biopsies. Next, an algorithm (trained on a total of 2819 endometrial biopsies) rated the same 91 cases, and we compared its performance using the pathologist’s classification as the reference standard. The interrater reliability among pathologists was moderate with a mean Cohen’s kappa of 0.51, whereas for a binary classification into benign vs (pre)malignant, the agreement was substantial with a mean Cohen’s kappa of 0.66. The AI algorithm performed slightly worse for the 6 categories with a moderate Cohen’s kappa of 0.43 but was comparable for the binary classification with a substantial Cohen’s kappa of 0.65. AI-assisted diagnosis of endometrial biopsies was demonstrated to be feasible in discriminating between benign and (pre)malignant endometrial tissues, even when trained on unselected cases. Endometrial premalignancies remain challenging for both pathologists and AI algorithms. Future steps to improve reliability of the diagnosis are needed to achieve a more refined AI-assisted diagnostic solution for endometrial biopsies that covers both premalignant and malignant diagnoses.