Section: Application Domains
Application fields in biology
Our methods are applied in several fields of molecular biology.
Our main application field is marine biology, as it is a transversal field with respect to issues in integrative biology, dynamical systems and sequence analysis. Our main collaborators work at the Station Biologique de Roscoff. We are strongly involved in the study of brown algae: the meneco, memap and memerge tools were designed to realize a complete reconstruction of metabolic networks for non-benchmark species [97], [69]. On the same application model, the pattern discovery tool protomata learner combined with supervised bi-clustering based on formal concept analysis allows for the classification of sub-families of specific proteins [67]. The same tool also allowed us to gain a better understanding of cyanobacteria proteins [4]. At the larger level of 4D structures, classification technics have also allowed us to introduce new methods for the characterization of viruses in marine metagenomic sample [72]. Finally, in dynamical systems, we use asymptotic analysis (tool pogg) to decipher the initiation of sea urchin translation [51] [95]. We are currently involved in two new applications in this domain: the team participates to a Inria Project Lab program with the Biocore and Ange Inria teams, focused on the understanding on green micro-algae; and we are involved in the deciphering of phytoplancton variability at the system biology level in collaboration with the Station Biologique de Roscoff (ANR Samosa).
In micro-biology, our main issue is the understanding of bacteria living in extreme environments, mainly in collaboration with the group of bioinformatics at Universidad de Chile (funded by CMM, CRG and Inria-Chile). In order to elucidate the main characteristics of these bacteria, we develop efficient methods to identify the main groups of regulators for their specific response in their living environment. To that purpose, we use constraints-based modeling and combinatorial optimization. The integrative biology tools meneco bioquali, ingranalysis, shogen, lombarde were designed in this context [77]. In 2016, two applications focused on the study of extremophile consortium of bacteria have been performed with these tools [61], [38]. In parallel, in collaboration with Ifremer (Brest), we have conducted similar work to decipher protein-protein interactions within archebacteria [96]. Our sequence analysis tool (logol) allowed us to build and maintain a very expressive CRISPR database [10] [50].
Similarly, in environmental sciences, our goal is to propose methods to identify regulators of very complex phenotypes related to environmental issues. In collaboration with researchers from Inra/Pegase laboratory, we develop methods to distinguish the response of breeding animals to different diaries or treatments [37] and characterize upstream transcriptional regulators [59], applied to porks [74], [75], [76]. Semantic-based analysis was useful for interpreting differences of gene expression in pork meat [79].
In addition, constraints-based programming also allows us to decipher regulators of reproduction for the pea aphid, an insect that is a pest on plants [85], [120]. This was performed in collaboration with Inra/Igepp. This paved the way to the recent research track initiated in the team about integration of heterogeneous data with RDF-technologies (see AskOmics software) [84], [70] and about graph-compression (see powergrasp software).
In bio-medical applications, we focus our attention on the confrontation of large-scale measurements with large-scale knowledge repositories about regulation pathways such as Transpath, PID or pathway commons. In collaboration with Institut Curie, we have studied the Ewing Sarcoma regulation network to test the capability of our tool bioquali to accurately correct and predict a large-scale network behavior [47]. Our ongoing studies in this field focus on the exhaustive learning of discrete dynamical networks matching with experimental data, as a case study for modeling experimental design with constraints-based approaches. To that purpose, we collaborate with J. Saez Rodriguez group at EBI [115] and N. Theret group at Inserm/Irset (Rennes) [40]. The dynamical system tools caspo and cadbiom were designed within these collaborations. Ongoing studies focus on the understanding of the metabolism of xenobiotics (mecagenotox program) and the filtering of sets of regulatory compounds within large-scale signaling network (TGFSysBio project).