• search hit 34 of 81
Back to Result List

Universal lesion detection utilising cascading R-CNNs and a novel video pretraining method

  • According to the WHO, approximately one in six individuals worldwide will develop some form of cancer in their lifetime. Therefore, accurate and early detection of lesions is crucial for improving the probability of successful treatment, reducing the need for more invasive treatments, and leading to higher rates of survival. In this work, we propose a novel R-CNN approach with pretraining and data augmentation for universal lesion detection. In particular, we incorporate an asymmetric 3D context fusion (A3D) for feature extraction from 2D CT images with Hybrid Task Cascade. By doing so, we supply the network with further spatial context, refining the mask prediction over several stages and making it easier to distinguish hard foregrounds from cluttered backgrounds. Moreover, we introduce a new video pretraining method for medical imaging by using consecutive frames from the YouTube VOS video segmentation dataset which improves our model’s sensitivity by 0.8 percentage points at a falseAccording to the WHO, approximately one in six individuals worldwide will develop some form of cancer in their lifetime. Therefore, accurate and early detection of lesions is crucial for improving the probability of successful treatment, reducing the need for more invasive treatments, and leading to higher rates of survival. In this work, we propose a novel R-CNN approach with pretraining and data augmentation for universal lesion detection. In particular, we incorporate an asymmetric 3D context fusion (A3D) for feature extraction from 2D CT images with Hybrid Task Cascade. By doing so, we supply the network with further spatial context, refining the mask prediction over several stages and making it easier to distinguish hard foregrounds from cluttered backgrounds. Moreover, we introduce a new video pretraining method for medical imaging by using consecutive frames from the YouTube VOS video segmentation dataset which improves our model’s sensitivity by 0.8 percentage points at a false positive rate of one false positive per image. Finally, we apply data augmentation techniques and analyse their impact on the overall performance of our models at various false positive rates. Using our introduced approach, it is possible to increase the A3D baseline’s sensitivity by 1.04 percentage points in mFROC.show moreshow less

Export metadata

Statistics

Number of document requests

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Shahin AmiriparianORCiDGND, Alexander Meiners, Daniel Rothenpieler, Alexander KathanORCiD, Maurice GerczukORCiD, Björn W. SchullerORCiDGND
Frontdoor URLhttps://opus.bibliothek.uni-augsburg.de/opus4/117368
ISBN:979-8-3503-2447-1OPAC
Parent Title (English):2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 24-27 July 2023, Sydney, Australia
Publisher:IEEE
Place of publication:Piscataway, NJ
Type:Conference Proceeding
Language:English
Year of first Publication:2023
Release Date:2024/12/09
First Page:1
Last Page:4
DOI:https://doi.org/10.1109/embc40787.2023.10340964
Institutes:Fakultät für Angewandte Informatik
Fakultät für Angewandte Informatik / Institut für Informatik
Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Embedded Intelligence for Health Care and Wellbeing
Nachhaltigkeitsziele
Nachhaltigkeitsziele / Ziel 3 - Gesundheit und Wohlergehen
Dewey Decimal Classification:6 Technik, Medizin, angewandte Wissenschaften / 61 Medizin und Gesundheit / 610 Medizin und Gesundheit