Innovative Applications of O.R.Evaluation of breast cancer mammography screening policies considering adherence behavior
Introduction
Breast cancer is the most common cancer worldwide in women contributing more than 25 percent of the total number of cancer cases diagnosed in 2012 (World Cancer Research Fund International, 2014). It is also the most common non-skin cancer affecting women in the United States. According to the American Cancer Society, about one in eight US women will develop breast cancer in her lifetime. In 2015, an estimated 231,840 invasive breast cancer cases will be diagnosed, and about 40,290 will die from breast cancer (American Cancer Society, 2015).
Mammography plays an important role in detection of breast cancer when the cancer is in its early stage. It is known to be an effective method and the current standard practice for early diagnosis of breast cancer. On average, mammography can detect breast cancer 1.7 years before a woman can feel a lump in her breast (Center for Disease Control and Prevention, 2013). In addition, it can reduce breast cancer mortality rate by a reasonable estimate of 15 percent (Gøtzsche & Nielsen, 2009).
However, mammography screening recommendations have been the subject of significant debate in recent years. There are varying screening guidelines from different organizations about when to start and end mammograms and how frequently a woman should undergo mammography screenings. For example, the American Cancer Society (ACS), the department of Health and Human Services (HHS), the American Medical Association (AMA), and the American College of Radiology (ACR) recommend screening mammography every year, beginning at age 40. In 2009, the U.S. Preventive Services Task Force (USPSTF) issued revised screening mammography guidelines, which resulted in a significant controversy. According to the USPSTF guidelines, screening mammograms should be done every two years between age 50 and 75 for women at average risk of breast cancer (The US Preventive Services Task Force, 2013).
Although mammography is known to be one of the most effective methods of detecting breast cancer, there are many concerns on its adverse effects on women's quality of life. Unreliability of mammography, i.e., false negative and false positive results and over-diagnosis are among the deficiencies of mammography (Epstein, Bertell, & Seaman, 2001). A prospective cohort study of 7 mammography registries of the Breast Cancer Surveillance Consortium (BCSC) showed that the adjusted sensitivity increased with age from 68.6 percent in women ages 40–44 to 83.3 percent in women ages 80–89 (Breast Cancer Surveillance Consortium, 2013). Similarly, the specificity rate increased from 88.2 percent for women aged 40 through 44 years to 93.4 percent in women older than 75 (Carney et al., 2003). Moreover, about one-third of all aggressive cancers are diagnosed in the interval between successive annual mammograms (Epstein, Steinman, & LeVert, 1998). These cancers are known as interval cancers and are defined as cancers detected within 12 months after a negative mammogram (Burhenne et al., 1994). Some of the interval cancers are present at the time of mammography screening (false-negatives) while others grow rapidly in the interval between a mammogram and detection (Hébert-Croteau, Théberge, Langlois, Major, & Brisson, 2005). Generally, interval cancers grow rapidly and are frequently diagnosed at advanced stages (Caumo et al., 2010). In addition, studies have shown that receiving mammograms increases a woman's chance of developing breast cancer due to exposure to radiation (Epstein, Bertell and Seaman, 2001, Yaffe and Mainprize, 2011, Gøtzsche and Nielsen, 2009). Each radiation-absorbed dose (rad) of exposure increases breast cancer risk by 1 percent (Epstein et al., 2001). Moreover, Bleyer and Welch (2012) also found that up to a third of diagnosed breast cancers are overdiagnosed cases and do not need treatment.
Various studies have been conducted to evaluate and compare different screening policies or to identify optimal policies. Kirch and Klein (1974) developed an inspection strategy for the detection of an age-dependent disease with the objective of minimizing detection delay. To illustrate their methodology, they developed optimal schedules for breast cancer examinations. Shwartz (1978) developed a mathematical model to evaluate life expectancy gain associated with different screening policies under different assumptions about the rate of disease progression, the characteristics of the screening technique, and the frequency of screenings. Parmigiani (1993) presented a continuous time non-Markovian stochastic process model of disease progression to analyze which age groups and what part of the population should undergo breast cancer screening, while minimizing the total expected loss or risk including financial costs, side effects, wasted time, stress due to false-negative test results, etc. Zelen (1993) formulated a screening scheduling problem to maximize a weighted utility function in a continuous time setting. In Zelen's model all the parameters were assumed to be stationary except for the incidence rate. Baker (1998) used a mathematical parametric model to assess different screening policies in terms of minimizing the cost of cancer plus the cost of carrying out any screening. Baker's model was developed based on a small data set used to estimate the model parameters, and compares a small set of cost optimal policies under different sets of constraints. Maillart, Ivy, Ransom, and Diehl (2008) formulated a partially observable Markov chain model to evaluate a wide range of dynamic mammography screening policies as well as current practices. They compared different policies in terms of the resulting lifetime breast cancer mortality risk and the expected number of mammograms women should undergo for each policy and generated a frontier of efficient policies. In their formulation, they considered age-based dynamics of breast cancer (i.e., increasing incidence, decreasing aggression). They also incorporated the imperfect nature of the screening outcomes and dynamics of test result accuracy (increasing sensitivity and specificity rates with age). Ayer, Alagoz, and Stout (2011) developed a finite-horizon partially observable Markov decision process (POMDP) model to determine the optimal personalized mammography screening strategy based on personal risk characteristics of women such as their prior screening history. Ahern, Cheng, and Shen (2011) developed a mathematical model to optimize cancer screening schedules, taking into account the trade-off between the benefits and costs of screenings in their proposed utility function. They considered two different optimization frameworks: optimize the number of screening examinations with equal screening intervals between exams but without a pre-fixed total cost; and optimize the ages at which screening should be given for a fixed total cost and prove that the optimal solution exists under each of the two frameworks. Li, Zhu, Klein, and Kong (2014) developed a finite-horizon discrete-time partially observable Markov chain model to assess colonoscopy screening strategies in terms of the expected cumulative quality-adjusted life years (QALYs) and the expected cumulative cost. They also calculated the incremental cost-effectiveness ratio for different screening policies.
However, none of the abovementioned studies has taken patient behavior into consideration. They all assume that patients adhere to the guidelines perfectly and undergo the prescribed screening mammograms. This is also true for the current screening recommendations which are based on the assumption of 100 percent adherence to the guidelines. However, not all women have the same attitudes toward breast cancer screening. A recent study by the Centers for Disease Control and Prevention (CDC) revealed that about 67.1 percent of women aged 40 and older had one mammogram between 2008 and 2010 (Center for Disease Control and Prevention, 2014). In another longitudinally data analysis of the BCSC data registry, more than 75 percent of women have had less than five mammograms in the 14-year interval, between 1996 and 2009, which implies a low compliance with the current guidelines (Breast Cancer Surveillance Consortium, 2013).
There are a limited number of studies taking individual adherence behavior into consideration. Brailsford, Harper, and Sykes (2012) used a three-phase discrete event simulation to model breast cancer and screening policies incorporating women's behavioral factors in their model. They assigned behavioral attributes to each simulated woman to control her compliance with the prescribed mammograms in their model. They compared a limited number of screening policies, including the current UK policy, in terms of the number of screen detected cancers, and life yeas saved. In another study, Ayer, Alagoz, Stout, and Burnside (2015) analyzed the role of behavioral heterogeneity in women's adherence on optimal mammography screening recommendations.
In this research, incorporating women's adherence behavior to mammography recommendations, a mathematical framework is proposed to evaluate and compare various screening policies in terms of QALYs and the lifetime mortality risk of breast cancer. Our study is different from the two studies discussed above in the approach of incorporating patient adherence to the screening tests. In contrast to earlier studies, we allow uncertainty in a patient's adherence probabilities in the model. Adherence is also a function of the length of the interval between two subsequent screenings. Moreover, assuming a patient's adherence behavior is related to and can be estimated from her perception/planning toward mammography, not only can we evaluate and compare screening policies for a specific patient knowing her personal characteristics, but we can identify efficient policies for the general population as well. In addition, we incorporate in our model the possibility of interval cancer detection, and the increased risk of developing breast cancer due to exposure to X-ray radiation during a mammography test.
The remainder of this paper is organized as follows. In Section 2, the proposed model is presented. In Section 3, model inputs and parameters estimation, and computational results are presented. Finally, Section 4 summarizes the findings and discusses future work.
Section snippets
Model formulation
A randomized discrete-time finite-horizon partially observable Markov chain model is developed to evaluate various mammography screening policies in terms of the expected QALYs and lifetime breast cancer mortality risk. This problem is formulated as a partially observable Markov chain because the women's true health states are not outwardly observable due to the imperfect nature of mammography screening tests.
The model takes into account two methods of detection: screening mammography and
Numerical studies
A wide range of routine screening policies and two-phase policies (with changing screening intervals) are evaluated in this section. Policies are defined as a quintuple (starting age, first screening interval, switching age, second screening interval, stopping age). For example, policy (40,1,50,2,80) represents a policy that recommends women start getting mammograms at age 40 and undergo the screening test every year up to age 50 and then undergo mammogram every 2 years up to age 80. Another
Analyses on QALYs
Table 4 presents the top five policies for the three different adherence cases in terms of QALYs. It also presents the associated lifetime breast cancer mortality risk for each policy.
The best policies for both Case 1 and the general population include the ACS policy. However, Case 1 has lower QALYs and higher breast cancer mortality risk comparing to the general population due to lower adherence probabilities. The best policies for the perfectly adherent case, however, does not include the
Conclusion
Current screening mammography guidelines assume that women's compliance with the recommendations is perfect, i.e., women undergo their mammograms as prescribed by their physicians/health providers. However, this is not the case in reality. Women skip mammograms for different reasons. This study investigates the effect of imperfect adherence for a wide range of screening mammography policies. Two different methods of breast cancer detection are considered: mammography examination and
References (46)
- et al.
Incorporating human behavior in simulation models of screening for breast cancer
European Journal of Operational Research
(2012) - et al.
Using a partially observable Markov chain model to assess colonoscopy screening strategies—A cohort study
European Journal of Operational Research
(2014) - et al.
Risk-specific optimal cancer screening schedules: An application to breast cancer early detection
Statistics in Bioscience
(2011) - American Cancer Society, 2015. http://www.cancer.org/cancer/breastcancer/detailedguide/breast-cancer-key-statistics...
- et al.
National vital statistics reports
(2012) - et al.
A POMDP approach to personalize mammography screening decisions
Operations Research
(2011) - Ayer, T., Alagoz, O., Stout, N.K., Burnside E.S. (2015). Heterogeneity in women's adherence and its role on optimal...
Use of a mathematical model to evaluate breast cancer screening policy
Health Care Management Science
(1998)Should women be routinely taught breast self-examination to screen for breast cancer?
Canadian Medical Association Journal
(2001)- et al.
Effect of three decades of screening mammography on breast-cancer incidence
New England Journal of Medicine
(2012)
Interval breast cancers in the Screening Mammography Program of British Columbia: Analysis and classification
AJR. American Journal of Roentgenology
Individual and combined effects of age, breast density, and hormone replacement therapy use on the accuracy of screening mammography
Annals of Internal Medicine
Interval cancers in breast cancer screening: Comparison of stage and biological characteristics with screen-detected cancers or incident cancers in the absence of screening
Tumori
A Markov chain method to estimate the tumour progression rate from preclinical to clinical phase, sensitivity and positive predictive value for mammography in breast cancer screening
Statistician
Dangers and unreliability of mammography: Breast examination is a safe, effective, and practical alternative
International Journal of Health Services
The breast cancer prevention program
Interval cancer and survival in a randomized breast cancer screening trial in Stockholm
Breast Cancer Research and Treatment
Weighing the risks and benefits of tamoxifen treatment for preventing breast cancer
Journal of the National Cancer Institute
Cited by (19)
Patient adherence in healthcare operations: A narrative review
2024, Socio-Economic Planning SciencesPartially observable collaborative model for optimizing personalized treatment selection
2023, European Journal of Operational ResearchDerivative-free optimization of combinatorial problems – A case study in colorectal cancer screening
2021, Computers and Chemical EngineeringCitation Excerpt :Simulation optimization searches for the optimal screening strategy using Derivative-Free Optimization (DFO) algorithms, with the output of a microsimulation as the objective function. The utilization of POMDPs for screening strategy determination have been used for numerous forms of cancer in addition to CRC, such as cervical (Namén León, 2015), stomach (Saumoy et al., 2018), prostate (Zhang and Denton, 2018), breast (Cevik et al., 2018; Madadi et al., 2015; Molani et al., 2019), liver (Lee et al., 2019), and lung (Petousis et al., 2019) cancer. As the literature available for the use of POMDPs is vast when considering the general topic of cancer screening, we focus our review on POMDP studies specific to CRC (Erenay et al., 2014; Li et al., 2015).
A partially observable Markov chain framework to estimate overdiagnosis risk in breast cancer screening: Incorporating uncertainty in patients adherence behaviors
2019, Omega (United Kingdom)Citation Excerpt :Please note that incorporation of CBE and BSE does not affect the mammography-induced overdiagnosis risk estimation, which is the focus of this study. This estimate is also comparable with 3.57% estimate reported by Maillart et al. [29], and 2.56% reported by Madadi et al. [27]. In this section, we present overdiagnosis analysis results for in-practice and some alternative policies.
Investigating the effectiveness of supplemental breast cancer screening tests considering radiologists’ bias
2023, IISE Transactions on Healthcare Systems Engineering