This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
Introduction
In this paper, we shall focus on topics where methods from optimal control theory will be applied to biomedical problems arising from cancer treatments. Specifically, we will focus on optimal control problems for systems of differential equations. Optimal control for partial differential equations, stochastic differential equations, discrete difference equations and so on are not considered in this document.
There exists a vast literature in which optimal control methods are applied to cancer treatment. The early literature on applications of optimal control to cancer treatment is almost exclusively on cancer chemotherapy (see references [14, 32, 45]). Much of this work is on cell-cycle nonspecific mathematical models and focusing on different growth models [32]. The monograph [31] by Martin and Teo is entirely devoted to such models, but including increasingly more comprehensive medical conditions such as single and multi-drug resistance.
Later, a greater emphasis in research has been put on cell-cycle specific models. In reference [10], the authors considered phase-specific administration of cytotoxic agents to increase the selectivity of therapy. Reference [46] models the proliferation cycle of leukemia. Eisen [14] also gives some introductory discussion of cell-cycle specific chemotherapy and compartmental models (distinguishing between proliferating and quiescent cells), but mostly in the context of pharmacokinetics and clearly the emphasis is more on modeling than on analysis.
After that, a great effort was done to study the development and analysis of compartmental models which describe and analyze the actions of drugs on specific compartments from a control theory point of view [49, 50, 51]. In these models, the emphasis is on the cancerous cells and side effects are only measured indirectly through the total dose of drugs administered.
Thus, from the mathematical point of view, the modeling of chemotherapy for cancer using the optimal control theory has 40 years of history. Different frameworks have been considered in the literature, depending on the study approach, from macroscopic levels [45] to microscopic levels [25,26], including systems of differential equations (v. [9,11,16,40] and their references) and equations in partial derivatives (v. [45] and their references). Normally, the control variable represents the dose of the drug and the state variables represent the tumor cells. In general, the main objective is to determine the optimal application of the drugs that minimizes the number of tumor cells at the end of treatment, while maintaining the toxic effects of the drug at an acceptable level. To reduce the toxicity of the anticancer drug, which also affects normal cells, it is customary to add some control restrictions, isoperimetric or state. There are different types of restrictions depending on the assumptions of the model, mainly punctual or integral. For example, a maximum rate is prescribed at which the chemotherapeutic drug is administered at each time or a limit is imposed on the total dose of the drug in the therapy interval (see [9, 12, 27]).
On the other hand, mathematical modeling of tumor growth in general is very important when it comes to having a model that fits the biology of the problem [35]. There are many mathematical models that deal with the different aspects of the growth of a tumor. They use ordinary differential equations [2, 6, 13, 54], partial differential equations [1, 3, 34], stochastic processes [38], game theory [52], etc.
In this review, we focus on 4 applications of the control theory to the growth of tumors:
The first is referred to the application of the theory of optimal control to compartmental models.
The second deals with the theory of optimal control of brain tumors.
The third deals with a topic that is becoming more and more important: the resistance in tumors to different treatments and, therefore, to optimize such kind of treatments.
The latest studies the application of optimal control to antiangiogenic and chemotherapeutic treatments.
Section 2 shows several compartmental models, which have been useful in the understanding of the application of optimal control theory to the growth of tumors.
Another topic of great importance is the study of the administration of chemotherapy in brain tumors [6, 20, 37]. Thus, in this review we study a model developed in the reference [40] based on a work by Ribba [37]. This revision can be found in Section 3.
Next topic is the drug administration in presence of multi-drug resistance cells. There exists a vast literature on optimal administration of chemotherapeutical drugs in the presence of drug resistance [7, 8, 18, 26, 33]. A model of sensitive and resistant cells to the treatment can be found in Section 4.
Optimal control is useful as a methodology in response to mathematical models for novel cancer treatments as anti-angiogenic treatments [27, 39]. A mathematical model with anti-angiogenic and chemotherapeutical treatment can be found in Section 5.
Finally, the review ends with some Conclusions in Section 6.
Cell-cycle specific cancer chemotherapy
In this section, we analyze a set of differential equations for a class of cell-cycle specific compartmental models for cancer chemotherapy.
Each cell goes through a sequence of phases from birth to cell division: G1→G0→S→G2→M. While the simplest mathematical models for the treatment of cancer with chemotherapy, applying the theory of optimal control, treat the complete cycle as a compartment, the multicompartmental models group the phases of the cell cycle with the purpose of representing the action of the different types of chemotherapeutic agents used. Cytotoxic agents generally act in the phase G2/M because the cell walls become thin and porous in mitosis M and, therefore, the cell is more vulnerable to attack. The cytostatic agents aim to synchronize the transitions of cells through the cell cycle causing a brief and invisible inhibition of the DNA synthesis in the S phase and the retention of the cells in the G1 phase. Recruitment agents cause the cells to leave the inactive G0 compartment where they are not vulnerable to attacks by any chemotherapeutic drug. This classification is general, being able to find agents that have more than one effect depending on their concentration [43]. Depending on the specific type of tumor that is considered, and the drugs that are administered, an appropriate grouping of the phases of the cell cycle is carried out.
A 2– compartment model with a cytotoxic agent
This model deals with the application of a single cytotoxic agent that is activated in the G2/M phase of the cell cycle. Taking into account the sensitivity of the drug to the phase, the cell cycle is divided into two compartments, one combining the second growth phase G2 and the mitosis M and the other formed by the rest of the phases. The state of the system is described by a two-dimensional vector where N1(t) represents the mean number of cancer cells in the first compartment at time t (composed of the phases G0, G1 and S) and N2(t) the average number of carcinogenic cells in the second compartment at time t (composed of G2 and M). The equation for the second compartment has the form
Now, we consider a cytotoxic agent whose concentration in blood is represented by the function u acting in the G2/M phase. It is assumed that all cells are sensitive to treatment (homogeneous population). According to the log-kill hypothesis, it is assumed that the concentration of the drug u(t) kills a fraction of the output flow a2N2(t) of cells of the G2/M compartment and, therefore, the number of cells dead is given by fu(t)a2N2(t), ϕ being a parameter of constant chemotherapeutic death. The control set is a compact interval [0,umax], with umax the maximum dose rate or the maximum concentration. In the model, the u control always appears together with the constant ϕ and so, in order to keep the number of free parameters as low as possible, it is combined with the maximum dose rate in an amount represented as umax under the assumption that umax≤ 1. If the concentration is high enough, then really umax = 1 is realistic: almost all the cancer cells in that compartment can be killed. The cells that die in G2/ M leave this compartment, that is, they are counted as outputs from the second compartment, but they no longer enter the first compartment. In this sense, it is considered to kill the cell even if apoptosis is not induced. The remaining fraction (1-u)a2N2 suffers cell division and therefore the controlled mathematical model becomes
where T is the final fixed time, r = (r1,r2) is a positive vector of weights and q = (q1,q2) is a vector row of weights not negative. The penalty term rN(T) = r1N1(T)+r2N2(T) represents an average weight of the total number of cancer cells at the end of treatment and the term qN(t) = q1N1(t)+q2N2(t) measures the volume of the tumor during treatment. Side effects of the treatment are only indirectly included in the model through the minimization of the total dose, $\int^{T}_{0}u(t)dt$. The positive coefficient s in this integral is redundant since it could be absorbed by the other weights. Recall that the number of cancer cells that do not undergo cell division at time t and that are consider eliminated is given by u(t)a2N2, that is, u(t) is proportional to the fraction of inefficient divisions. Since the drug kills healthy cells at a similar rate, the integral $\int^{T}_{0}u(t)dt$ represents the cumulative negative effects of the treatment on the normal tissue or its toxicity. The aim is to minimize tumor volume in both compartments at the end of treatment, but taking into account the tumor volume in both compartments during treatment, while minimizing side effects. Thus, the following optimal control problem arises:
Problem 1
For a fixed time T > 0, minimize J subject to the system of equations
for all measurable functions Lebesgue (respectively, piecewise continuous) u : [0,T] → [0,umax] with A and B given by equation (6), [41, 47].
The necessary conditions for optimality for Problem 1 are given by the Pontryagin maximum principle [28, 36, 42]. The main necessary conditions for optimality is the statement that optimal controls minimize (respectively, maximize) the Hamiltonian function
over the control set [0,umax] pointwise along the optimal controlled trajectory (N*,u*) and the multipliers l.
The conditions of the Pontryagin maximum principle for our optimal control problem are
Theorem 1
If u* is an optimal control with corresponding trajectory N* then there exist a continuous function λ : [0,T]→ (ℝ2)* called the adjoint function, such the following conditions are satisfied:
Nontriviality: λ(t) ≠ 0 for all t ϵ [0,T].
Adjoint equations and transversality conditions: the multiplier λ is a solution to the terminal value problem
Minimum condition: the optimal control minimizes the Hamiltonian H pointwise over the control set [0,umax] along the optimal controlled trajectory and the multiplier λ(t), i. e,
For control-affine problems as is our case, the best candidates for optimal controls turn out to be finite concatenations of bang and singular control. The precise concatenation sequences need to be determined through an analysis of the switching function. Then, it is straightforward, although somewhat complicated and tedious, to calculate the singular control [22]. It is given by
However, singular controls are not necessarily minimizing, but they can also be maximizing. Now, if we use the Legendre-Clebsch condition, a higher-order necessary condition for optimality of singular controls [42], allows us to distinguish between these two classes. For the case of a minimizing control u, the Legendre-Clebsch condition states that
$$\begin{equation}\frac{\partial}{\partial u}\frac{d^2}{dt^2}\frac{\partial H}{\partial u}(\lambda(t),N_*(t),u_*(t))\leq 0,\quad \forall t\in I. \end{equation}$$
In our problem, such Legendre-Clebsch condition becomes
Hence, the Legendre-Clebsch condition for minimality of a singular control is violated. Thus, singular controls are locally maximizing for this problem. This brings us to the next result
Theorem 2
If (N*,u*) is an optimal controlled trajectory for problem 1, then there does not exist an interval on which the control u* is singular.
We show some examples of extremal bang-bang controls that have been computed using the values in Table 1, which are shown in Figure 2.
Numerical values for the coefficients and parameters used in numerical computations for the optimal control problem 1, extracted from [41].
Coefficient
Interpretation
Numerical value
a1
Inverse transit time through G0/G1+ S
0.197
a2
Inverse transit time through G2/M
0.356
N10
Initial condition for N1
0.7012
N20
Initial condition for N2
0.2988
umax
Maximum dose rate/concentration effectiveness of the drug
0.90
s
Penalty/weight for the total dose of cytotoxic agent
0.50
q1
Penalty/weight in the objective for the average number of cancer cells in G0/G1+ S during therapy
0.10
q2
Penalty/weight in the objective for the average number of cancer cells in G2/M during therapy
0.10
r1
Penalty/weight in the objective for the average number of cancer cells in G0/G1+ S at the end of therapy
3
r2
Penalty/weight in the objective for the average number of cancer cells in G2/M at the end of therapy
3
T
Therapy horizon
T = 7,21,60
A 3– compartment model with a cytotoxic and recruiting agent
This model was originally formulated by Swierniak et al. [47], where a cytotoxic agent, which can act in S or G2/M phases, is combined with a recruitment agent, which attracts inactive cells in the G0 compartment to re-enter the active cell cycle and thus be attacked by the cytotoxic agent. One of the biggest problems in the chemotherapeutic treatment of some leukemias is a large residue of inactive cells in the G0 phase, which are not sensitive to most of the cytotoxic agents. This problem has also been reported in breast and ovarian cancer. In this model, three compartments are considered: G0, G1 and S +G2/M. The state space is thus the first orthant P in R3. The variables N0, N1 and N2 represent the states. The newborn cells initiate the process of cell division, but they can remain dormant and remain in the quiescent stage G0. Let p0 and p1 be positive numbers, with p0+ p1 =1, representing the probabilities that the daughter cells will pass through the first growth phase without obstacles (p1) or fall into the state of quiescence (p0). A recruiting agent is applied to reduce the average transit time through the G0 compartment, resulting in a higher outflow from G0. If w represents the concentration of the recruitment agent, it is assumed that the outflow is increased by a factor of 1 +w, 0 ≤ w ≤ wmax, where w = 0 represents that no drug is being applied and wmax corresponds to a full dose treatment. A cytotoxic agent u, 0 ≤ u ≤ umax, is applied in the third compartment, where umax corresponds to the maximum dose (see Figure 3 ). The combination of these drugs with the standard characteristics of cell division and cell death (and under the log-kill hypothesis), results in a bilinear control system, of the form
As in the 2– compartment model considered earlier, the ai are positive coefficients related to the inverse transit times of the cancer cells through the compartments.
Thus, we considere the following optimal control problem:
Problem 2
For a fixed therapy horizon [0,T], minimize the objective
$$J=rN(T)+\int_{0}^{T} qN(t)+s_1u(t)+s_2w(t) dt$$
over all Lebesgue-measurable (respectively, piecewise continuous) functions$(u,w):[0,T]\to[0,u_{max}]\times [0,w_{max}], u_{max}\neq \frac{1}{2}$,subject to equations [47]
The following theorem shows that the optimal control for the cytotoxic agent u can not be singular:
Theorem 3
Suppose (N*,u*,w*) is an optimal controlled trajectory for problem 2. Then there does not exist an interval on which the control u* is singular, i.e., the cytotoxic agent is not given at partial dose rates/concentrations.
The following theorem establishes what the behavior of the recruitment agent w should be:
Theorem 4
Let (N*,u*,w*) be an optimal controlled trajectory for problem 2. If the weight s2in the Lagrangian for the recruiting agent w is zero, s2 = 0 then there does not exist an interval on which the control w* is singular. However, if this coefficient is positive, s2> 0, then the Legendre-Clebsch condition for optimality of a singular control w is satisfied.
The proof of this theorem can be found again in [47].
For small positive weights s2, and this is the important case for this model, bang-bang controls are optimal. Figure 4 shows the graphs of such controls u and w and their corresponding trajectories when the weight s2 in the Lagrangian is chosen as s2 = 0 and s2 = 0:1 for parameter values specified in Table 2.
Numerical values for the coefficients and parameters used in numerical computations for the optimal control problem 2 with cytotoxic and recruitment agents, extracted from [41].
Coefficient
Interpretation
Numerical value
a0
Inverse transit time through G0
0.05
a1
Inverse transit time through G1
0.5
a2
Inverse transit time through S +G2/M
1
p0
Probability that cells enter G0
0.9
p1 = 1– p0
Probability that cells enter G1
0.1
N0(0)
Initial condition for N0
0.8589
N1(0)
Initial condition for N1
0.0954
N2(0)
Initial condition for N2
0.0456
umax
Maximum dose rate/concentration effectiveness
0.95
wmax
Maximum dose rate/concentration effectiveness
6
s1
Penalty/weight at the cytotoxic agent
1
s2
Penalty/weight at the recruiting agent
0, 0.1
q0
Penalty/weight in the objective for the average number of cancer cells in G0 during therapy
3
q1
Penalty/weight in the objective for the average number of cancer cells in G1 during therapy
1
q2
Penalty/weight in the objective for the average number of cancer cells in S +G2/M during therapy
1
r0
Penalty/weight in the objective for the average number of cancer cells in G0 at the end of therapy
3
r1
Penalty/weight in the objective for the average number of cancer cells in G1 at the end of therapy
1
r2
Penalty/weight in the objective for the average number of cancer cells in S +G2/M at the end of therapy
1
T
Therapy horizon
T = 21
A 3– compartment model with a cytotoxic and blocking agent
In this model the combination of two drugs is considered: a cytotoxic agent u, which is responsible for killing tumor cells and acts in the G2/M phase, and a blocking agent v (cytostatic), which slows the development of tumor cells in the S phase, preventing them from passing into the cell division phase [48]. Therefore, three compartments are considered: the first formed by the phases G0 and G1, the second by the S phase and the third is formed by the phases G2 and M. The space of states is the first orthant P in ℝ3, being Ni(t), i = 1,2,3, the number of cancer cells in the i-th compartment at time t.
As a result, the flow of tumor cells from the second compartment to the third is reduced by a factor given by v(t), from a value of a2N2(t) to (1-v(t))a2N2(t), where 0 ≤ v(t) ≤ vmax< 1. The value v(t) = 0 represents that the treatment is not being applied, while vmax represents a complete dose. The control u represents the dose of cytotoxic agent administered with u = 0 indicating the absence of treatment and u = 1 the maximum dose (see Figure 5). Under all assumed conditions for the previous compartmental models (homogenous tumor population, constituted by cells sensitive to therapy and under the log-kill hypothesis), we get back to a bilinear system again of the form (18), but now the matrices involved have the form
respectively. Once again, we analyze the optimality properties of singular controls, starting with the cytostatic agent [47].
Theorem 5
If (N*,u*,v*) is an optimal controlled trajectory for problem 3, then there does not exist an interval on which the control v* is singular.
For this model we can no longer assert in general that the cytotoxic agent u can not be singular, but need to impose an inequality relation on the inverse transit times in the cell cycle and the maximum reduction vmax that the cytostatic agent can achieve.
If (N*,u*,v*) is an optimal controlled trajectory for problem 3, there also does not exist an interval on which the control u* is singular.
We close this section with some numerical experiments of locally optimal controls for problem 3. The parameter values are given in Table 3 and condition (25) is satisfied. Figure 6 shows two different scenarios.
Numerical values for the coefficients and parameters used in computations for the optimal control problem 3 with cytostatic and cytotoxic agents, extracted from [41].
Coefficient
Interpretation
Numerical value
a1
Inverse transit time through G1/G0
0.197
a2
Inverse transit time through S
0.395
a3
Inverse transit time through G2/M
0.107
N1(0)
Initial condition for N1
0.3866
N2(0)
Initial condition for N2
0.1722
N3(0)
Initial condition for N3
0.4412
umax
Maximum dosage/concentration/effectiveness of cytotoxic agent
0.95
vmax
Maximum blocking effect of cytostatic agent
0.30
s1
Penalty/weight at the cytotoxic agent
1
s2
Penalty/weight at the cytostatic agent
0.01
q1
Penalty/weight in the objective for the average number of cancer cells in G1/G0 during therapy
1, resp. 0.1
q2
Penalty/weight in the objective for the average number of cancer cells in S during therapy
1, resp. 0.1
q3
Penalty/weight in the objective for the average number of cancer cells in G2/M during therapy
1, resp. 0.1
r1
Penalty/weight in the objective for the average number of cancer cells in G1/G0 at the end of therapy
1, resp. 8.25
r2
Penalty/weight in the objective for the average number of cancer cells in S at the end of therapy
1, resp. 8.25
r3
Penalty/weight in the objective for the average number of cancer cells in G2/M at the end of therapy
1, resp. 8.25
T
Therapy horizon
T = 21
Low grade glioma and chemotherapy
In this section, we consider a low grade glioma (LGG), which is a term denoting a set of primary progressive brain tumor of astrocytic or oligodendroglial origin characterized by slow and continuous growth preceding the anaplastic transformation [30]. Such model was considered by Ribba and colaborators [37] and assumes that a LGG is composed of proliferative P and nonproliferative quiescent tissue Q. The chemotherapy treatment u(t) eliminates proliferative cells and the quiescent cells are also affected by the treatment and become damaged quiescent cells Qp. Damaged quiescent cells, when re-entering the cell cycle, be able to repair their DNA and become proliferative once again or can die because of unrepaired damages. For simplicity, here it is assumed that the dosage, the concentration and the effect of the drug are equal. Thus, it is assumed that the drug kills a proportion of u of proliferating cells, γuP, and damages a proportion of u of quiescent cells, γuQ. Therefore, the system of differential equations that describes the proliferative, non-proliferative and damaged cells and the effect of chemotherapy on these populations is given by [40]
where w is a weight parameter. We obtain two different functionals with biological relevance for w = 0 and w = 1. Thus, for w = 0 we obtain a functional in the form of Mayer that focuses on the tumor cells at the end of the treatment,
$$J_0(u)=P(T)+Q(T),$$
while for w =1 we obtain a functional in the form of Lagrange that takes into account the tumor cells throughout the dynamic process:
$$J_1(u)=\int_{0}^{T}P(t)+Q(t)dt.$$
In addition, we introduce a restriction for additional control by imposing a limit on the total dose of the drug:
$$\begin{equation}\label{restr1} \int_{0}^{T}u(t)dt\leq M. \end{equation}$$
Thus, we have the following optimal control problem:
Problem 4
For a fixed time T > 0, minimize Jw(u) subject to the system of equations (26) for all measurable functions Lebesgue (respectively, piecewise continuous) u : [0,T]→[0,umax] and satisfying the constraint (30).
First of all, existence of optimal controls are proved:
Theorem 7
Suppose that the initial conditions of the control system (26) satisfy P(0) = P0> 0, Q(0) = Q0> 0, Qp(0)=0 and N (0)=P0+ Q0+Qp0< K. Then there is an optimal control for the function Jw(u), 0 ≤w ≤1, in (29).
The proof of this theorem can be found in [40]. On the other hand, system (26) can be written in a matrix form as
The control is not determined a priori by the minimum condition when Φ(t) = 0. Thus, we consider two cases:
If the switching function Φ(t) has only a finite number of zeros in an interval Ib ⊂ [0,T], the control u* is called bang-bang over Ib. In this case we have u*(t) ϵ {0,uM} for all t ϵ Ib.
If Φ(t) ≡ 0 is maintained over a time interval Is = [t1,t2], 0 ≤t1< t2≤ T, we say that u* is singular about Is.
For the optimal control problems that are linear in the control, it is typical that the optimal solutions are concatenations of bang-bang and singular controls, but this does not derive directly from the necessary conditions. In principle, since the controls are measurable Lebesgue, more degenerate situations can not be excluded a priori.
As in the previous section, we can use the strict Legendre-Clebsch condition as a necessary condition for the existence of singular controls. Thus, if we assume that the strict Legendre-Clebsch condition is satisfied
the relation $\frac{d^2}{dt^2}\phi(t)=0$gives rise to the following formula for singular control in terms of the state variable x and the adjoint variable λ:
We solve the optimal control problem 4 using AMPL software [19], which is linked to the IPOPT Interior Point optimization code. We use a high number N 2 [3000,10000] of grid points and implement the trapezoidal integration scheme [5].
The objective of this section is to calculate a standard chemotherapy protocol for a LGG, where the total amount of TMZ dose is M = 5 and the treatment time is T = 1 (30 days). We use the parameters of Table 4 and the initial conditions
Values of the biological parameters in the model of LGG evolution.
Variable
Value (Units)
P0
0.924 mm
Q0
42.3 mm
Qp0
0 mm
K
100 mm
cp
0.114 mo -1
cPQ
0.0226 mo-1
cQP
0.0045 mo-1
δQP
0.0214 mo-1
γ
0.842
$$P(0)=0.924, \quad Q(0)=42.3, \quad Q_p(0)=0.$$
w = 0. The state variables, the optimal control and the change function are shown in Figure 7. The optimal treatment consists of DMT therapy. Initially, the dose level is zero and then jump to the maximum tolerated dose (DMT) for 5 days. Initially, the proliferative and quiescent cells are growing, but after applying the chemotherapy, both types of cells are drastically decreasing until the end of the treatment period. We obtain the objective value and terminal states
w =1. Figure 8 shows the optimal control and the state variables for the weight w =1 in the objective (29). The optimal treatment is based on DMT therapy, but now, in contrast to the solution for w = 0, the initial dose is the maximum dose until the total dose M = 5 is reached. Subsequently, no dose is administered. We get the following final value for the objective and the state variables
Therefore, for w ≠ ws all controls are bang-bang with a single switching that occurs at the time when the total amount of dose, M =5, has been administered. But the structure of the control "switch" in the weight ws = 0:004067 being, for this value, a totally singular optimal control.
The singular control using is obtained in (36). On the other hand, we can obtain the change times t1 = 20:1 and t2 = 25:2. The control and state variables are shown in Figure 10. We get the following final value for the objective and the state variables
The switching times for are t1 =3:00 and t2 =18:4 showing a fairly long singular arc. We get the following final value for the objective and the state variables
Figure 11 shows the state variable and the optimal control.
Cancer chemotherapy and drug resistance
Resistance to drugs is one of the most important problems in the treatment of cancer, causing a high number of deaths. There are several probabilistic models for the development of drug resistance [8, 53]. The amplification of a gene is an increase in the number of copies of that gene present in the cell after cell division, the depletion corresponds to a decrease in its number of copies. The hypothesis of amplification of a gene to a copy establishes that in cell division at least one of the two daughter cells will be an exact copy of the mother cell while the second, with some positive probability, will undergo genetic amplifications. Next, we present a model studied by Ledzewicz et al. in [26].
A 2– compartment model with sensitive and resistant subpopulations
Here, we consider two compartments consisting of drug sensitive and resistant cells, and we denote the numbers of cells in the sensitive and resistant compartments by S and R, respectively. Within the one copy forward gene amplification model, once a sensitive cell undergoes cell division, the mother cell dies and one of the daughters will remain sensitive. The other daughter, with probability q, 0 < q < 1, changes into a resistant cell. Similarly, if a resistant cell undergoes cell division, then the mother cell dies, and one of the daughters remains resistant. However, for cancer cells it is possible that a resistant cell may mutate back into a sensitive cell. This phenomenon is well-documented in the medical literature where experiments have shown that the resistant cell population decreases in a drug free medium [44]. We therefore include a probability r ≥ 0 that one of the daughters of a resistant cell may become sensitive. Generally r will be small.
Thus, the controlled dynamics can be represented as
and the initial condition is given by positive numbers S0 and R0 at time t = 0. Setting N = (S,R)T , the dynamics is described again by a bilinear system
The problem that we want to solve is to delay the onset of the time when drugs are no longer effective. Mathematically, we consider the problem to minimize an objective of the form
where k and l are row-vectors of weights with the components of k positive and those of l non-negative. The penalty term kN(T) represents a weighted average of the total number of cancer cells at the end of an assumed fixed therapy interval [0,T] and the term lN(t) models cumulative effects during the therapy. The control term u(t) in the Lagrangian models the negative side effects of the drugs, measured in the L1 norm. The parameters k and l can also be used to put a stronger emphasis on the number of cancer cells since it can be argued that a duplication of the cancer or the drug dose is more hazardous than a duplication of the objective would represent.
Thus, the optimal control problem has the form
Problem 5
For a fixed therapy interval [0,T], minimize the objective J (46), over all Lebesgue measurable functions (respectively, piecewise continuous) u : [0,T]→[0,umax] subject to the equations (42) and (43).
Using the Pontryagin Maximum Principle, there exists a continuous function λ satisfying the adjoint equation
over the control set [0,umax] along (λ(t),N*(t)). Since the Hamiltonian is linear in u and the control set is an interval, defining the switching function Φ by
Whether the singular control will be admissible, i.e., whether it will obey the control bounds 0 ≤ u ≤ umax, will depend on the parameter values, but also on the region in the state space where the state (S,R)T lies and cannot be answered in general. In this case the singular control is called of order 1 and it is again where we have to use the generalized Legendre-Clebsch condition as before we did:
Since (2-q)λ0+ qλ1> 0 the generalized Legendre-Clebsch condition therefore implies:
Proposition 8
Singular controls are not optimal in regions of the state space where qS > (2-q)R.
This holds as long as the resistant population is very small and then optimal controls will be bang-bang, i.e., correspond to sessions of full dose chemotherapy with rest periods interlaced. However, as the portion of resistant cells R increases, the Legendre-Clebsch condition will be satisfied and if admissible, there are mathematical reasons to suspect that it indeed will be the optimal control in this case. It is also quite intuitive that a full dose may do more harm than good once resistance builds up and thus the optimal strategy may switch to give partial doses, i.e., use singular controls.
Cancer chemotherapy and antiangiogenic treatment
Cancer is fundamentally a genetic disease: it arises as a consequence of pathological changes in the information contained in the DNA. That is, the uncontrolled growth of the cells is the result of alterations or mutations in the genetic material. These mutations lead to a process of altering the cell division and the inability to undergo programmed cell death (or apoptosis). The growth of a single cancer (or primary cancer) does not usually lead to the death of the organism. However, some cancer cells can acquire the ability to enter the bloodstream, travel to another part of the body and begin to grow in a different organ. This process is known as metastasis. Generally, metastatic growth is what kills the body. When a cancer cell is created, it can multiply and lead to a tumor of small size without the need for blood supply. At this stage, the growth of an avascular tumor stops. Subsequently, to continue growing, since the tumor needs oxygen and nutrients to grow, provided by the blood, the tumor sends out chemical signals called promoters. This induces the blood vessels to grow towards the tumor and leads to the complete vascularization of the tumor, which allows it to increase in size. The formation of new blood vessels is called angiogenesis. The understanding of the role of angiogenesis in the development of cancer has advanced significantly due to the studies conducted by Folkman [17]. We now know that without angiogenesis, the size of most primary tumors does not exceed 3 mm. One type of very important promoters are the so-called Vascular Endothelial Growth Factors (VEGF). These factors stimulate cellular responses joining the tyrosine kinase receptors (a type of enzyme) on the surface of the endothelial cells and stimulate their proliferation and migration to the tumor. Other pro-angiogenic factors are PDGF, PIGF or FGF. Therefore, anti-angiogenic therapies are important in the treatment of certain cancers. But, by itself, the antiangiogenic treatment only prevents the tumor from developing its support of blood vessels and does not directly destroy the cancer cells. In fact, the benefits of using anti-angiogenic therapies alone are, at best, transient and are followed by the restoration of tumor growth and progression [4]. As a consequence, it is common not to limit the treatment to a body, but to combine different treatments in the hope of achieving a synergistic effect. In this way, antiangiogenic treatment should be combined with other treatments, such as radiotherapy or chemotherapy. In this section, we combine antiangiogenic therapy with chemotherapy.
where p(t) is the size of the tumor, s(t) is the number of stable vessels, q(t) is the number of unstable vessels and u(t) and v(t) are the antiangiogenic and cytotoxic agents respectively. Table 5 shows the values of the parameters of the model.
Values of the biological parameters in the model.
Variable
Description
Value (Units)
ξ
Tumor growth parameter
0.084 day-1
b
Tumor-induced stimulation parameter
5.85 day-1
d
Tumor-induced inhibition parameter
0.00873 mm-2 day-1
μ
Loss of vascular support
0.02 day-1
χ
Maduration of unstable vessels parameter
0.025 day-1
γ
Anti-angiogenic elimination parameter
0.15 kg/mg day-1
φ
Cytotoxic killing parameter for the tumor
0.1 kg/mg day-1
Thus, the problem that we want to solve is to minimize the size of the tumor p(t):
For a free final time T, minimize the objective J, equation (55), over all Lebesgue measurable functions (respectively, piecewise continuous) u : [0,T]→[0,uM] and v : [0,T]→[0,vM] subject to the equations (52), (53) and (54) and the constraints (56).
As was done in previous problems, the switching functions can be calculated to get
Therefore, the generalized Legendre-Clebsch condition is satisfied if λ3(t) > 0, ∀t 2 I. We have then the following proposition:
Proposition 9
If the optimal control u* is singular over an interval I =(a,b), then λ2,λ3,λ4,λ5>0. In addition, the control u* ends in an interval (t1,T) where u* = 0.
Then, we suppose the control u is singular and let’s calculate that control. After some tedious calculations, we get the following singular optimal control
$$\begin{equation}u^{*}_{\text{sing}}=\frac{\lambda_1\psi_1+\lambda_2\psi_2+\lambda_3(\varphi b v+\psi_3)}{2b\gamma\lambda_3}, \end{equation}$$
Regarding to the control v, analytically it is complicated to characterize it for what the numerical methods are used. Thus, in all the numerical simulations, ϕ< 0 and thus the control v is of the bang-bang type.
In this survey, a series of problems of optimal control theory applied to cancer treatments have been addressed. First of all, a class of cell cycle specific compartmental models for cancer chemotherapy was analyzed. By considering the phases of the cell cycle separately, it is possible to appropriately model the different actions of various drugs involved. A common characteristic of the models analyzed in the second section is that it is implicitly assumed that the cancer population is homogeneous and consists of cells that are sensitive to the chemotherapy applied. If we then minimize a weighted average of the tumor population and the total dose of chemotherapy given over a fixed therapy interval, it will be seen that in this scenario it indeed is optimal to give chemotherapy in one full dose session upfront at the beginning of the therapy interval. These results are fully consistent with and confirm as optimal the classical MTD (maximum tolerated dose) regimen. Essentially, the underlying scenario in these models is one where cancer is growing rapidly at a critical level, but sensitive to chemotherapeutic agents. The intuitive optimal solution then is to hit it as hard as possible, as soon as possible.
In the third section, low grade glioma, a type of brain tumor, together with the action of a cytotoxic agent, as the temozolomide, is studied [39]. For this problem, the optimal control solutions depend on the treatment time. If the treatment time is long enough (T = 30 months), the optimal control will be unique. On the other hand, if the treatment time is T = 30 days, the control will be of the bang-bang type.
Many tumors consist of heterogeneous agglomerations of subpopulations of cells that show widely varying sensitivities toward the actions of a particular chemotherapeutic agent [15, 29]. Coupled with the fact that growing tumors also exhibit considerable evolutionary ability to enhance cell survival in an environment that is becoming hostile, this leads to multi-drug resistance of some strains of the cells. Naturally, it makes sense to combine drugs with different activation mechanisms to reach a larger population of the tumor cells-and this is what is being done-but the sad fact remains that some cells develop multi-drug resistance to a wide variety of even structurally unrelated drugs. There may even exist subpopulations of cells that are not sensitive to the treatment from the beginning (ab initio, intrinsic resistance). For certain types of cancer cells, there are simply no effective agents known. We have explored this aspect in section four, studying a 2– compartment model with sensitive and resistant populations.
Finally, we have considered tumor’s microenvironment. One of the most important component of a tumor’s microenvironment is its vasculature. In order to grow beyond a small size, a growing tumor needs to develop its own network of blood vessels and capillaries that will provide it with nutrients and oxygen. This process is called angiogenesis and was already pointed out as a therapeutic target by J. Folkman in the early 1970s [23,24]. Antiangiogenenic treatments aim at depriving the tumor of this needed vasculature by either disrupting the signaling process that the tumor uses to recruit surrounding, mature, host blood vessels or by directly inhibiting the growth of endothelial cells that form the lining of the newly developing blood vessels and capillaries. We have studied this kind of treatment together with the chemotherapy [39] which is based on this model by Hahnfeldt et al. [21].
We hope that this survey will be useful in understanding the optimal treatment protocols in tumors. Mathematical models need to be developed to aid in our understanding of how to implement this work and hopefully to show how to develop new optimal treatment strategies.