TY - JOUR
T1 - Designing a Computer-Aided Detection system for Barrett ’s neoplasia
T2 - Insights in architectural choices, training strategies and inference approaches
AU - Kusters, Carolus H. J.
AU - Boers, Tim G. W.
AU - Jaspers, Tim J. M.
AU - Jong, Martijn R.
AU - van Eijck van Heslinga, Rixta A. H.
AU - Jukema, Jelmer B.
AU - Fockens, Kiki N.
AU - de Groof, Albert J.
AU - Bergman, Jacques J.
AU - van der Sommen, Fons
AU - de With, Peter H. N.
N1 - Publisher Copyright:
© 2025 The Authors
PY - 2025/9/1
Y1 - 2025/9/1
N2 - Background and Objective: Detecting early neoplasia in Barrett's Esophagus (BE) presents significant challenges due to the subtle endoscopic appearance of lesions. Computer-Aided Detection (CADe) systems have the potential to assist endoscopists by enhancing the identification and localization of these early-stage lesions. This study aims to provide comprehensive insights into the structured design and development of effective CADe systems for BE neoplasia detection, addressing unique challenges and complexities of endoscopic imaging and the nature of BE neoplasia. Methods: We conduct an extensive evaluation of architectural choices, training strategies, and inference approaches to optimize CADe systems for BE neoplasia detection. This evaluation includes 10 backbone architectures and 4 semantic segmentation decoders. Training strategies assessed are domain-specific pre-training with a self-supervised learning objective, data augmentation techniques, incorporation of additional video frames and utilization of variants for multi-expert segmentation ground-truth. Evaluation of inference approaches includes various model output fusion techniques and TensorRT conversion. The optimized model is benchmarked against 6 state-of-the-art CADe systems for BE neoplasia detection across 9 diverse test sets. Results: The experimental results demonstrate the impact of incorporating structured design considerations, leading to measurable and incremental performance gains of up to 7.8% on dedicated validation sets. The contributions particularly stand out for the domain-specific pre-training and the use of a hybrid CNN-Transformer architecture, which benefits robustness and overall performance. The model optimized through these design choices achieves statistically significant improvements over existing CADe systems, with p-values in the range p∈[0.0019,0.031]. It outperforms state-of-the-art models in classification and localization, with improvements of up to 12.8% over the second-best performing model. These gains demonstrate enhanced peak performance, generalization capabilities, and robustness across diverse test sets representative of real-world clinical challenges. Conclusion: This study provides critical insights into the structured development of effective CADe systems for Barrett's neoplasia detection. By addressing the specific challenges associated with endoscopic imaging and Barrett's neoplasia, the study demonstrates that careful consideration of architectural choices, training strategies, and inference approaches results in significantly improved CADe performance. These findings underscore the importance of tailored design and optimization in developing robust and clinically effective CADe systems. The code is made publicly available at: https://github.com/BONS-AI-VCA-AMC/Insights-CADe-BE.
AB - Background and Objective: Detecting early neoplasia in Barrett's Esophagus (BE) presents significant challenges due to the subtle endoscopic appearance of lesions. Computer-Aided Detection (CADe) systems have the potential to assist endoscopists by enhancing the identification and localization of these early-stage lesions. This study aims to provide comprehensive insights into the structured design and development of effective CADe systems for BE neoplasia detection, addressing unique challenges and complexities of endoscopic imaging and the nature of BE neoplasia. Methods: We conduct an extensive evaluation of architectural choices, training strategies, and inference approaches to optimize CADe systems for BE neoplasia detection. This evaluation includes 10 backbone architectures and 4 semantic segmentation decoders. Training strategies assessed are domain-specific pre-training with a self-supervised learning objective, data augmentation techniques, incorporation of additional video frames and utilization of variants for multi-expert segmentation ground-truth. Evaluation of inference approaches includes various model output fusion techniques and TensorRT conversion. The optimized model is benchmarked against 6 state-of-the-art CADe systems for BE neoplasia detection across 9 diverse test sets. Results: The experimental results demonstrate the impact of incorporating structured design considerations, leading to measurable and incremental performance gains of up to 7.8% on dedicated validation sets. The contributions particularly stand out for the domain-specific pre-training and the use of a hybrid CNN-Transformer architecture, which benefits robustness and overall performance. The model optimized through these design choices achieves statistically significant improvements over existing CADe systems, with p-values in the range p∈[0.0019,0.031]. It outperforms state-of-the-art models in classification and localization, with improvements of up to 12.8% over the second-best performing model. These gains demonstrate enhanced peak performance, generalization capabilities, and robustness across diverse test sets representative of real-world clinical challenges. Conclusion: This study provides critical insights into the structured development of effective CADe systems for Barrett's neoplasia detection. By addressing the specific challenges associated with endoscopic imaging and Barrett's neoplasia, the study demonstrates that careful consideration of architectural choices, training strategies, and inference approaches results in significantly improved CADe performance. These findings underscore the importance of tailored design and optimization in developing robust and clinically effective CADe systems. The code is made publicly available at: https://github.com/BONS-AI-VCA-AMC/Insights-CADe-BE.
KW - Architectures
KW - Barrett's Esophagus
KW - Endoscopic image analysis
KW - Training and inference strategies
UR - https://www.scopus.com/pages/publications/105008804413
U2 - 10.1016/j.cmpb.2025.108891
DO - 10.1016/j.cmpb.2025.108891
M3 - Article
C2 - 40561841
SN - 0169-2607
VL - 269
JO - Computer methods and programs in biomedicine
JF - Computer methods and programs in biomedicine
M1 - 108891
ER -