Skip to main navigation Skip to search Skip to main content

Multiple mask and boundary scoring R-CNN with cGAN data augmentation for bladder tumor segmentation in WLC videos

  • Nuno R. Freitas*
  • , Pedro M. Vieira
  • , Catarina Tinoco
  • , Sara Anacleto
  • , Jorge F. Oliveira
  • , A. Ismael F. Vaz
  • , M. Pilar Laguna
  • , Estêvão Lima
  • , Carlos S. Lima
  • *Corresponding author for this work
  • University of Minho
  • LABBELS – Associate Laboratory
  • Department of Gastroenterology, Hospital de Braga, Braga, Portugal.
  • University of Aveiro
  • Polytechnic Institute of Leiria
  • Istanbul Medipol University
  • University of Amsterdam
  • Hospital CUF Porto

Research output: Contribution to journalArticleAcademicpeer-review

13 Downloads (Pure)

Abstract

Automatic diagnosis systems capable of handling multiple pathologies are essential in clinical practice. This study focuses on enhancing precise lesion localization, classification and delineation in transurethral resection of bladder tumor (TURBT) to reduce cancer recurrence. Despite deep learning models success, medical applications face challenges like small and limited datasets and poor image characterization, including the absence lack of color/texture modeling. To address these issues, three solutions are proposed: (1) an improved texture-constrained version of the pix2pixHD cGAN for data augmentation, addressing the tradeoff of generating high-quality images with enough stochasticity using the Fréchet Inception Distance (FID) measure. (2) Introducing the Multiple Mask and Boundary Scoring R-CNN (MM&BS R-CNN), a new mask sub-net scheme where multiple masks are generated from the different levels of the mask sub-net pipeline, improving segmentation accuracy by including a new scoring module to refine object boundaries. (3) A novel accelerated training strategy based on the SGD optimizer with the second momentum. Experimental results show significant mAP improvements: the data generation scheme improves by more than 12 %; MM&BS R-CNN proposed architecture is responsible for an improvement of about 1.25 %, and the training algorithm based on the second-order momentum increases mAP by 2–3 %. The simultaneous use of all three proposals improved the state-of-the-art mAP by 17.44 %.
Original languageEnglish
Article number102723
JournalArtificial intelligence in medicine
Volume147
DOIs
Publication statusPublished - 1 Jan 2024
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Lesion localization
  • Multi-pathology detection
  • Texture-constrained GAN

Fingerprint

Dive into the research topics of 'Multiple mask and boundary scoring R-CNN with cGAN data augmentation for bladder tumor segmentation in WLC videos'. Together they form a unique fingerprint.

Cite this