Section: Overall Objectives
General strategy
The general strategy consists of the interactions of several stages:
-
Stage 1: Derivation of mechanistic models based on the biological knowledge and the available observations. The construction of such models relies on the up-to-date biological knowledge at the cellular level including description of the cell-cycle, interaction with the microenvironement (angiogenesis, interaction with the stroma). Such models also include a “macroscopic" description of specific molecular pathways that are known to have a critical role in carcinogenesis or that are targeted by new drugs. We emphasize that for this purpose, tight interactions with biologists are crucial. Lots of works devoted to the modeling at the cellular level are available in the literature. However, in order to be able to use these models in a clinical context, we also need to describe the tumor at the tissue level. The in vitro mechanical characterization of tumor tissues has been widely studied. However, no description that could be patient specific or even tumor specific is available. It is therefore necessary to build adapted phenomenological models, according to the biological and clinical reality.
-
Stage 2: Data collection. In the clinical context, data may come from medical imaging (MRI, CT-Scan, PET scan) at different time points. It is also a crucial point: we need longitudinal data in time in order to be able to understand the time course of the disease. The data can also be obtained from the analysis of blood samples or biopsies. It is critical to have tight collaborations with clinicians for the selection of the specific cases to focus on, the understanding of the key points and of the key data, the classification of the grades of the tumors, the understanding of the treatment, ...In the preclinical context, data can be macroscopic measurements of the tumor volume for subcutaneous cases, green fluorescence protein (GFP) quantifications for total number of living cells, non-invasive bioluminescence signals or even imaging obtained with devices adapted to small animals.
-
Stage 3: Adaptation of the model to the data. The model has to be adapted to the data: it is useless to have a model taking many biological features of the disease into account if it cannot be reliably parameterized with the data. For example, very detailed description of the angiogenesis process given in the literature cannot be used, even if one has data arising from perfusion MRIs. A pragmatic approach has to be developed for this purpose. On the other hand, one has to try to model any element that can be useful to exploit the image. Parameterizing must be performed carefully in order to achieve an optimal trade-off between the accuracy of the model, its complexity, identifiability and predictive power. Parameter estimation is a critical issue in mathematical biology, which we cannot avoid: if there are too many parameters, it will be impossible to estimate them but if the model is too simple, it will be too far from reality.
-
Stage 4: Data assimilation. Due to the complexity of the data – for example multimodal, longitudinal medical imaging – the data assimilation is a major challenge. Such a process is a combination of methods for solving inverse problems and statistical methods including machine learning strategies. Presently, most of the inverse problems – developed in the team – are solved using a gradient method coupled with some Monte-Carlo type algorithm. More efficient methods could be used as for example the sequential methods, i.e. the Kalman type filters or the so-called Luenberger filter (nudging). Using the sequential methods can also simplify the Stage 3 because they can be used even with complex models. Of course, the strategy used by the team depends on the quantity and the quality of data. It is not the same if we have an homogeneous population of cases or if it is a very specific isolated case.
-
Stage 4': Data assimilation of gene expression. "Omics" data become more and more important in oncology and we aim at developing our models using the available information. For example, in our previous work on GIST, we have taken into account the cases with mutation on Ckit. However, it is still not clear how to use in general gene expression data in our macroscopic models, and particularly how to connect the genotype to the phenotype and the macroscopic growth. We expect to use statistical learning techniques on populations of patients in order to move towards this direction, but we emphasize that this task is very prospective and is a scientific challenge in itself.
-
Stage 5: Simulation. Once the models have been parametrized, the simulation part can be done. We also need to include a quantification of uncertainties and to produce 3D simulations that can be confronted to reality.