Breast Cancer QA Project

BC Cancer | Medical Physics Research

Developed an automated quality assurance (QA) tool to evaluate deep learning-generated lymph node contours for breast cancer radiotherapy planning.

Research Poster Thumbnail
Presented at BC Cancer Summit 2024 – Quality Assurance of AI-based IMN Contours

Background

Breast cancer can spread to the internal mammary lymph nodes (IMNs), located near the center of the chest. When involvement is confirmed, IMN irradiation (IMNI) can improve outcomes; however, its use in low-risk patients remains controversial due to potential radiation exposure to the heart, lungs, and contralateral breast. This project contributes to a retrospective population study at BC Cancer investigating how primary tumor location and IMN radiation dose relate to survival outcomes. To support this research, a data pipeline is being developed to automate the import and segmentation of approximately 19,000 patient CT datasets (2005–2014) using Limbus Contour, a deep learning-based auto-segmentation software. My contributions include developing C# scripts for contour transfer and a QA system to evaluate the accuracy of AI-generated contours for future pipeline integration.

Different doses
Example dose distributions for plans that exclude the internal mammary nodes (IMNs, cyan ovals) (left) and include the IMNs (right).

1.0 Project Objectives

Driving Challenges:

2.0 My Contributions

3.0 Methodology

3.1 Implementation

In-house scripts were written in C# using the Eclipse Scripting API (ESAPI) to analyze contour quality. The QA tool flagged patients with contours outside of statistically defined ranges (median ± 2SD, Min/Max, IQR) or with structural faults such as missing slices.

QA Tool Workflow Diagram
Pipeline for contour QA analysis and flagging process.

3.2 Evaluation Approaches

3.3 Technical Challenges

One of the main challenges was that filtering contours based purely on their absolute length along the z-axis could inadvertently exclude patients with atypical anatomy, such as very tall or very short individuals. To address this, we instead used ratios (such as IMN:Lung and IMN:Chest Wall) to normalize contour dimensions relative to each patient’s anatomy.

Ratios
Comparison of IMN:reference organ ratios: IMN:Chest wall (top), IMN:Lung (bottom)

4.0 Key Results

MethodContours Flagged (n=100)Most Common Error
Method A (Clinical)3Missing slices
Method B (AI-reviewed subset)32Z-length errors
Clinician Review47Z-length errors
Comparison of baseline contour sources used for QA evaluation.
spec
Comparison of filter sensitivity and specificity using absolute organ lengths vs length ratios

5.0 Findings & Discussion

Clinical contours were too inconsistent to serve as a reliable ground-truth. The reviewed subset of Limbus contours was more effective for QA. The most common failure mode was incorrect Z-dimension contouring. Absolute length metrics introduced patient-size bias, making normalized ratios a better approach.

6.0 Tools & Technologies

C# Eclipse Scripting API (ESAPI) Medical Imaging Deep Learning QA Data Normalization

7.0 Next Steps

8.0 Acknowledgments

This project was supported by the BC Cancer Foundation’s Sprakkar Award and a research agreement with Limbus AI/Radformation. Collaborators include Amy Frederick (PhD), Alanah Bergman (PhD), Tania Karan (MSc), and Alan Nichol (MD).

9.0 Reflection

This project taught me how to bridge machine learning with clinical workflows, emphasizing the importance of quality assurance in large-scale AI studies. It also deepened my understanding of radiotherapy planning and the practical challenges of integrating AI into healthcare.