Cite

Introduction

There are two clinical scenarios in which delineation of the lumpectomy cavity (LC) is required during breast cancer radiotherapy: boost after whole breast irradiation (WBI) and accelerated partial breast irradiation (APBI). WBI after organsparing surgery reduces the risk of breast cancer recurrence and mortality.1,2 Delivery of boost dose to the LC clinical target volume (CTVLC) is an important component of this treatment. It has been shown to improve local control at an increased risk of moderate to severe fibrosis.3 APBI is becoming increasingly utilized in selected groups of patients. Shorter overall treatment time, reduced radiation exposure of the organs at risk and comparable disease control make it a good alternative to WBI for early stage disease.4-6 High accuracy of contouring and precision of treatment delivery are needed to optimize the delicate therapeutic ratio between treatment benefit and side effects. This is especially important in the setting of highly conformal dose delivery to a small volume, such as boost after WBI and becomes critical during APBI where the entire dose is delivered to the CTVLC. Inter-observer variation (IOV) in contouring is one of the main contributors to the cumulative budget of uncertainties in radiotherapy.7 It may undermine the gain of high-precision technologies, blur the dose-effect relations and compromise treatment comparisons. For the individual patient, geographical miss of the target volume leads to increased chance of relapse, while unnecessary irradiation of normal tissues increases the probability of side effects. Respecting the common contouring guidelines accompanied by adequate training and high quality imaging are the most important strategies to reduce contouring variation.7-11

Currently, CT is the standard imaging modality for CTVLC contouring. Due to its poor ability for soft tissue depiction, placement of surgical clips or markers at the edges of LC is recommended to improve tumour bed delineation.12-16 But reliability of inserted markers as a surrogate for tumour bed is a matter of debate9,16-18 and omission of their placement in some patients poses a special challenge to the radiation oncologist during CTVLC delineation.19 The role of MRI for contouring in breast cancer radiotherapy is controversial11,20-23 and the evidence to support its use in patients without markers in the tumour bed is scarce.24,25 In our present study, we aimed to (1) quantify the IOV and (2) assess the accuracy of CT- and MRI-based CTVLC contouring in patients without clips in the LC. Our null hypothesis was that there is no statistically significant difference between MRI- and CT- based contouring in this supgroup of patients.

Patients and methods
Patients and images

Anonymized image data sets of patients with pathology-proven unilateral invasive ductal carcinoma of the breast, treated by breast conserving surgery and adjuvant radiotherapy in 2013 were considered for this study. Cases without surgical clips in the LC and available CT- and MRI-simulator data sets were eligible for inclusion. Adjuvant radiotherapy had to consist of WBI followed by CTVLC boost. Patients who underwent oncoplastic surgery were excluded. All radiotherapy was completed before initiation of the study and the presented work did not interfere with routine management of our patients. The study protocol was reviewed and given ethical approval by the Institutional Medical Research Centre which governs our Institutional Review Board (Trial registration number: 15329/15).

Acquisition of CT and MRI simulator images

During simulation and treatment, patients were placed in comfortable and reproducible supine position with arms abducted over the head. For CT simulation patients were placed on breast board and wires were used to identify the surgical scars and drainage sites. Non-contrast volumetric CT study with contiguous slices of 5 mm thickness was obtained from the level of the body of the mandible to at least 5 cm below the inframammary fold (Siemens Somatom Sensation ® 16-slice scanner, 120 kVp, approximately 90 mAs, voxel size of 1.26 × 1.26 × 5 mm, matrix size of 512 × 512). MR images were obtained on a dedicated wide-bore 1.5T 450w MRI simulator (General Electrics Optima ®) equipped with radiotherapy applications. The MRI in this study was a simulation procedure and was acquired supine as per CT planning with efforts made to replicate the positioning as much as achievable. The arms were elevated and cradled, and external alignment lasers used to align the tattoos, albeit the incline was not applied due to limitations of the MRI bore diameter. The supine positioning achieved a more similar deformation of the breast tissue to the planning CT than a prone diagnostic arrangement. General purpose Flex coils were used. Our breast MRI protocol included T2 weighted FSE propellor, proton density with fat saturation, Dixon type LAVA-Flex and balanced steady state gradient echo FIESTA imaging sequences. All sequences were acquired axially with matrix size of 288 × 288, approximately 42 cm field of view and slice thickness of 5 mm. For the T2 FSE sequence, mean system related geometric distortions after the application of the vendor-provided correction algorithms were 0.5, 0.9 and 1.9 mm for radial distances of 100, 200, and 250 mm respectively. Anonymized non-registered CT and MRI data-sets were imported to the ECLIPSE workstation (Varian, Medical Systems ®) for contouring.

Cavity visualization score and contouring

Cavity visualization score (CVS) was recorded by each observer for all cases and both modalities, using the standardized numeric scale ranging from 1 (cavity not visualized) to 5 (all cavity margins clearly visualized).26 CTVLC was contoured separately on CT and MRI by five experienced radiation oncologists (observers), who were blinded for each other’s delineations. The observers had access to clinical and imaging findings at time of diagnosis and to surgical and pathology reports. They were asked to respect the following instructions during delineation:

Adjust window level to optimize visualization of the region of interest.

Contour on axial images.

When contouring on the MRI, use the T2 weighted FSE images as primary data set and take the information from other sequences into account.

Allow for a minimum interval of 2-weeks between CT- and MRI-based contouring to minimize bias resulting from familiarity with the cases.

Create CTVLC according to our departmental guidelines:

First, delineate the lumpectomy cavity (LC) as intra-mammary post-lumpectomy changes. During delineation, compare findings with contralateral anatomy to identify differences in geometry, tissue architecture, formation of seroma, hematoma or scar tissue, fat replacement on CT and decreased signal intensity on MRI. While contouring, take all available information into account to identify the LC (tumour location on preoperative imaging, pathology reports, lumpectomy scar on the skin, etc.).

To define CTVLC, add a 15 mm uniform margin around the LC and edit it to exclude the chest wall and skin.

Finally, the expert consensus (EC) contours of CTVLC were delineated on CT and MRI for all cases. EC contouring was led by the senior radiation oncologist, taking the opinions of all five observers into account.

Analysis of contouring uncertainties

Contouring uncertainties on CT and MRI were analysed from two perspectives, reflecting our study objectives: (1) to quantify the IOV, global variability between delineations was assessed and (2) to quantify contouring accuracy, deviations of observers from the EC contours were analysed. Contour analysis tool 1 (CAT 1) software and related methodology27,28 was used for volumetric and distance-based computations.

Inter-observer variation

Mean volumes and standard deviations (SD) of CTVLC were calculated for each study case on CT- and MRI-based approach. Inter-observer coefficients of variance (CoV – ratio between SD and mean value) and ratios between the smallest and largest volume were determined for each case and modality. Inter-observer conformity index was calculated based on the generalized formalism (CIgen), which is independent of the number of the analysed volumes.29 It equals the sum of intersections of all possible volume-pairs divided by the sum of their unions.

Contouring accuracy

We used the EC as a surrogate for the “ground truth” contour. Deviations from EC were measured on CT and MRI for all cases and observers. Accuracy index (AI) was determined according to paired CI formalism.29 AI was calculated as the ratio between common and encompassing volume for each pair of EC and observer’s contour. Further, mean absolute distances between contours of individual observers and EC were calculated in contouring plane. This method has been used before and is described in detail.27,28,30 Briefly, the inter-delineation distances (IDD) were calculated between each voxel of observer’s contour and nearest voxel of the EC contour in 72 angular steps of 5 degrees for all slices.27,28,30

Statistical analysis

Statistical design of the study did not entail calculation of the sample size and the number of observers. Instead, all evaluable cases satisfying the inclusion criteria to the point of study initiation and all available observers from our department were included to maximize the statistical power. Continuous variables were presented as mean values with standard deviations. Paired sample t-test was used to compare mean values of analysed variables between CT and MRI. P-value of < 0.05 was considered as the limit for statistical significance. SPSS for windows (SPSS Inc., 1989–2015, Chicago, Illinois) was used for data analysis.

Results
Cavity visualization score

The use of MRI improved the cavity visualization in 11 out of 12 (92%) cases (Figure 1). In the remaining one case, mean CVS was equal (3.0) on both modalities. Mean CVS was 3.88 +/− 0.99 and 3.05 +/− 1.07 for MRI and CT, respectively (p = 0.001). Correlation of CI and AI with CVS is shown in Figure 1. CIgen and AI improved with increasing CVS for both contouring approaches. Example of contouring variation for two selected cases with a high and low CVS is presented in Figure 2.

(A) Generalized conformity index (CIgen) and (B) accuracy index (AI) as a function of the cavity visualization score (CVS) for CT and MRI based contouring of lumpectomy cavity clinical target volume. None of the patients had surgical clips inserted in the tumor bed. Case numbers are indicated for each modality.

CT and MRI based contouring in two examples with high and low cavity visualization scores (CVS). Observers’ delineations are white and expert consensus (EC) contours black. (A) Case with a CVS of 4.8 on CT and 5 on MRI: mean generalized conformity index (CIgen), accuracy index (AI) and inter-delineation distance (IDD) were 0.79, 0.85 and 2.4 mm on CT and 0.80, 0.86 and 2.2 mm on MRI. (B) Case with a CVS of 2 on CT and 3 on MRI: mean CIgen, AI and IDD were 0.46, 0.61 and 6 mm for CT and 0.63, 0.73 and 4.5 mm for MRI.

Inter-observer variation

The results of IOV analysis are presented in Table 1 and Figure 1A. Mean CIgen for MRI was significantly superior to CIgen for CT (0.74 +/− 0.07 vs. 0.67 +/− 0.12, p = 0.007). CIgen for MRI was higher than for CT in 10 (83 %) cases. In case number 9, CT-based CIgen was superior to MRI (0.77 vs. 071) and in case number 6 they were identical (0.76). Mean volumes of CTVLC were 154 +/− 26 cm3 on CT and 152 +/− 19 cm3 on MRI (non-significant difference). Mean volumetric CoV was non-significantly lower for MRI when compared with CT (12% vs. 18 %; p = 0.1). Similarly, average ratio between the smallest and largest delineated volume was non-significantly higher for MRI when compared with CT (0.8 +/− 0.1 vs. 0.7+/− 0.1.; p = 0.1).

Results for inter-observer variation in contouring. The difference in mean generalized conformity index (CIgen) between the CT and MRI based contouring was statistically significant (p = 0.007)

CaseCTMRI
Mean V [cm3] (SD)CoV [%]CIgenMean V [cm3] (SD)CoV [%]CIgen
1198 (10)50.85106 (4)40.87
2241 (37.5)160.69256 (28)110.77
3108 (20.7)190.56125 (10)80.75
475 (18.1)240.6492 (12)130.69
5175 (47.9)270.46217 (68)310.63
6140 (10.1)70.76125 (14)110.76
7103(39.7)390.4864 (2)40.66
8180 (22.5)120.79158 (14)90.8
9135 (19.8)150.77126 (20)160.71
10204 (44.3)220.68215 (19)90.76
1199 (14.4)150.66135 (11)80.79
12195 (27.3)140.69210 (29)140.74
MEAN (SD)154 (26)180.67 (0.12)152 (19)120.74 (0.07)

CoV = Coefficient of Variance; CTVLC = Clinical Target Volume of Lumpectomy Cavity; SD = Standard Deviation

Contouring accuracy

Results of analysis of deviations from EC contours are shown in Table 2 and Figure 1B. Observers placed all contours in the correct breast quadrant. Mean AI was higher for MRI when compared with CT (0.81 +/− 0.04 vs. 0.76 +/− 0.07; p = 0.004). MRI-based mean AI was superior to CT in 10 (83 %) cases. In case number 9, CT-based AI was slightly superior to MRI (0.81 +/− 0.04 vs. 0.8 +/− 0.05) and in case number 1, AI was the same for both modalities (0.88 +/− 0.1) (Table 2, Figure 1B). There was small but significant difference in mean IDD between CT and MRI (3.6 mm +/− 2.3 mm vs. 3 mm +/− 1.5 mm; p = 0.017). Corresponding mean CoV for CT was higher than for MRI (61 % vs. 49 %; p = 0.003). The mean value of maximal IDD was 13 +/− 6 mm for CT and 10 +/− 4 mm for MRI (p = 0.06).

Accuracy index (AI) and inter-delineation distances (IDD), based on the expert consensus (EC) delineation as the reference. The differences in AI and IDD between CT and MRI were statistically significant (p < 0.05)

CTMRI
Case
IDD [mm]AIIDD [mm]AI
MeanSDCoV [%]MeanSDCoV [%]Mean(SD)CoV [%]MeanSDCoV [%]
12.10.8380.880.0231.80.58320.880.012
23.61.9530.780.1123.21.9590.830.023
33.92.6670.670.07103.21.8560.780.022
42.31430.750.0682.30.9390.770.056
564670.610.15244.52.1470.730.1420
62.61.1420.820.0222.61380.830.022
764.1680.640.09143.61.8500.760.057
82.41.2500.850.0222.21450.860.044
932.9970.810.0452.91.8620.800.056
104.53.5780.770.0793.32610.840.022
113.32610.740.09122.51.3520.830.023
123.42.3680.770.0683.61.7470.800.022
MEAN3.62.3610.760.07931.5490.810.045

CoV = Coefficient of Variance; CTVLC = Clinical Target Volume of Lumpectomy Cavity; SD = Standard Deviation

Discussion

Results of the present study rejected our null hypothesis: MRI, when compared with CT, led to (1) reduced IOV and (2) improved accuracy for CTVLC contouring in patients without markers in the tumour bed. Keeping in mind the cost and complexity of utilizing MRI for radiotherapy planning, our findings justify its use in selected cases.

CT-based delineation of the LC is prone to IOV, even among experienced radiation oncologists.7,10,11,31-34 In various tumour sites, MRI has been shown to reduce contouring uncertainties when compared with CT.35-41 Based on these findings, MRI is becoming increasingly implemented for contouring and is the recommended gold standard in some malignancies.42 However, there are many studies that failed to demonstrate improved contouring with the use of MRI for various tumour sites.43-47 As far as breast cancer is concerned, several authors investigated the impact of adding MRI to CT for delineation of lumpectomy cavity with negative or inconclusive outcome.11,16,20-23 Den Hartogh et al. found that addition of postoperative MRI to CT guided delineation marginally increased the target volumes and failed to reduce the IOV.22 Similarly, Kirby et al. reported that addition of MRI to CT resulted in tumour bed volumes that were discordant with those based on CT and clips alone. With the use of MRI, the tumour bed volume increased in 28 out of 30 cases included, resulting in a median CTV increase of 10.3% (−33.6%−80.9%).20 Mast et al. compared CT- and MRI-based delineations of breast and LC by four observers in 10 patients. The mean CI for the LC was 0.52 for CT and 0.48 for CT combined with MRI (p = 0.33).23 In another similar study, the inter-observer agreement was even lower. While MRI and CT enabled similar visualization of the LC, MRI resulted in lower generalized CI (0.32 +/− 0.25) when compared with CT (0.52 +/− 0.21).21

The rationale to use MRI in our study was to improve contouring consistency for cases without surgical clips in the tumour bed. Mean CVS on MRI (3.88 +/− 0.99) was significantly superior to CVS on CT (3.05 +/− 1.07) (p = 0.001). CVS was improved in 92% and was accompanied by an increase of CI and AI in 83% cases. For both modalities, we found an increase of CIgen and AI with increasing CVS (Figure 2). Therefore, inter-observer concordance depended directly on the ability to visualize lumpectomy cavity, which was superior on MRI. Of note, in all of the reports which failed to show benefit of MRI, clips were placed at the edges of LC.11,16,20-23 In a study by Giezen et al., four observers (2 radiologists and 2 radiation oncologists) obtained a mean CVS of 2.8 +/− 1.7 for MRI and 2.9 +/− 1.7 for CT. In contrast to our findings, Giezen et al. demonstrated superiority of CT over MRI for contouring, especially at low CVS.21 With increasing CVS values, both modalities performed better and the CIgen from MRI approached that from CT. The lack of added value of MRI in this and other published studies20-23 could be attributed to better visibility of the clips on CT, introducing a bias in its favour, as acknowledged by the authors.21 This effect becomes especially important at low CVS values. Our positive findings could be attributed also to the fact that MRI was performed as simulation procedure, replicating the CT planning supine position as much as achievable.

To our knowledge, there are only two publications in addition to our present study which demonstrated added value of MRI for delineation of post-lumpectomy tumour bed.24,25 In the study by Jolicoeur et al., there were no surgical clips implanted at time of lumpectomy. Three observers delineated the post-lumpectomy tumour bed in 70 patients. Highly significant IOV was demonstrated for CT based contouring of the tumour bed (p < 0.0001), while agreement was high for the MRI-based approach. The volumes of MRI based contours were 30–40% smaller than the CT-derived volumes. In another study with three observers and 36 cases, mean CVS for the LC was 3.3 and 4.3 for CT and T2 MRI, respectively (p < 0.0001). Better CVS was reflected in superior inter-observer consistency and volumetric agreement of contours. The authors stated that surgical clips were occasionally, but not routinely placed by the referring surgeons.24

Based on our results, addition of MRI to CT could be justified as a good alternative to CT alone for selected patients in whom the placement of surgical clips in the tumour bed was omitted. But despite concerns regarding their reliability as a surrogate for tumour bed17,18,48, placement of clips followed by CT-based contouring of LC should be currently considered as the gold standard.16 This approach has been shown to improve the accuracy of LC contouring, reduce the overall boost volume and help prevent geographical miss and underdosage of the LC.13-16,49-52 But the technique of placement and the number of inserted markers differs between institutions and surgeons and is even omitted in some cases. Kirwan et al. recently reported on a retrospective study of 196 cases, assessing the compliance with recommendations for clip insertion. Although recommended by the clinical guidelines, the clip insertion was omitted in 56% of cases while additional 7% of patients had only two or fewer clips inserted. Ten of 31 referring surgeons routinely omitted clips and the omission rate was significantly higher for centres with low (≤ 1 patient) when compared with high (≥ 14 patients) rate of recruitment to IMRT clinical trials (67% vs. 27%, respectively; p < 0.001).19 These results emphasize the need for good collaboration between radiation oncologists and surgeons and standardization of clip placement.9 Auditing of clip insertion has been suggested as one of the key performance indicators for quality control of breast cancer surgery.19

Based on their study which demonstrated reduction of IOV when adding MRI to CT, Jolicoeur et al. proposed that the use of CT-MRI fusion may obviate the need for surgical clips altogether.25 However, while reduction of IOV indicates increased contouring agreement, it doesn’t necessarily imply improved accuracy. To assess the accuracy, individual delineations would in theory need to be compared with the ground truth or correct delineation. In the absence of the histopathological proof, the ground truth is an elusive concept. Different approaches, including simultaneous truth and performance level estimation (STAPLE), expert consensus (EC) or their combination have been used as surrogates for correct delineation.27,53 In our current study, the concept of EC delineation was applied. Keeping in mind the limitations of the “ground truth” definition, our results indicate that adding MRI to CT improves contouring accuracy in cases without surgical clips in the LC cavity.

Comparison of our results with findings of other studies is challenging due to the variable conditions under which contouring was performed and the diversity of methods used for IOV assessment. The impact of variables such as experience and specialty of observers, use of guidelines, type of surgery, etc. should be kept in mind when comparing reports.16 As far as the methods for IOV assessment are concerned, CI is one of the most commonly used quantifiers. In general, CI is a measure of overlap between analyzed volumes, but there is a diversity of formalisms used in the literature which cannot be directly compared. The generalized CI (CIgen) formalism is independent of the number of delineations, enabling the comparisons between studies with different number of observers.29 Regardless of the CI formalism used, the impact of contouring variation on CI is inversely proportional to the size of the analyzed volume. Therefore, same absolute deviation between analyzed contours will result in lower CI for small volumes (i.e. tumour bed) when compared with larger volumes (i.e. tumour bed with a margin). The effect of margins on CI is particularly relevant in breast cancer, where the contours are typically cropped to exclude the skin and chest wall, improving the apparent conformity between observers.

In our study, mean CIgen of 0.67 (+/− 0.12) and 0.74 (+/− 0.07) was obtained for CT and MRI-based contouring of CTVLC, respectively. Major et al. studied the impact of contouring guidelines on consistency of LC and planning target volume (PTV) contouring for multi-catheter partial breast irradiation. When contouring was performed on pre-implant scans by experienced observers and according to the guidelines (similar conditions as in our study), they obtained a CIgen of 0.59 and 0.73 for LC and PTV, respectively. The margins for PTV were similar to our margins for CTVLC, making the resulting volume sizes comparable between the two studies. Of note, CIgen for PTV, obtained by CT and clip-based contouring54 was similar to our CIgen for CTVLC, obtained by MRI in patients without clips. The lower CIgen for LC when compared with PTV54 reflects the sensitivity of CIgen to the volume size, as described above. Majority of other published studies reported on contouring uncertainties for tumour bed, with a CIgen ranging from 0.32–0.52.21-23 Our results compare favourably with the existent literature. This can be attributed to strict compliance with contouring guidelines, participation of experienced observers and use of high quality imaging.

Low number of observers and cases that were entered in analysis can be considered as the main limitations of our study. Considering the need for specific expertise in breast radiotherapy, experience in interpretation of MRI and relative rarity of cases without clips in LC, higher number of observers and cases is challenging to obtain outside a multi-institutional setting. This challenge is reflected in the limited number of observers and cases in studies, published by several authors before us.20-23 Multi-centre collaborative projects may represent the optimal approach to overcome this limitation and shed more light on the subject of contouring uncertainties in general.

Conclusions

In breast cancer patients without clips in the tumour bed after breast conserving surgery, MRI improved the visualization of lumpectomy cavity when compared with CT. Consequently, interobserver agreement and accuracy of contouring of lumpectomy cavity clinical target volume were improved. Placement of surgical clips, followed by CT-based contouring is the gold standard for contouring of the boost volume for postoperative irradiation in breast cancer. However, in patients without clips, addition of MRI to CT simulator images should be considered to improve delineation accuracy. Further studies with higher number of observers and cases are required to confirm our findings.

Declarations

The study protocol was reviewed and given ethical approval by the Institutional Medical Corporation Medical Research Centre. Datasets generated and analysed during study are not publicly available due to patient confidentiality but are available from corresponding author on reasonable request and after institutional approval. This study was not funded.

eISSN:
1581-3207
Language:
English
Publication timeframe:
4 times per year
Journal Subjects:
Medicine, Clinical Medicine, Radiology, Internal Medicine, Haematology, Oncology