Clôture de l’appel à chaires ANITI – liste des projets retenus

Uncertainty Quantification for Physical and Artificial Intelligence systems

Facing Low Resource Natural Language Processing

Principal Investigator: Braud, Chloé – CNRS / IRIT

Co-Chair: Benamara, Farah – UT3 / IRIT

Summary

Natural Language Processing is a subfield of Artificial Intelligence that aims at building computational models of human language. Current approaches rely on machine learning algorithms that need data for training and evaluation. However, annotating data is expensive and time-consuming: in consequence, annotations are not much available for most languages, specialized domains, and for specific high-level tasks. This leads to issues with robustness – when systems are unable to generalize to new situations –, and fairness, since NLP applications only work for a limited range of human language productions. This ANITI2.0 Starting Chair proposal intends to tackle these issues by leveraging Weak Supervision, a learning paradigm that allows the development of hybrid systems and showed promising results on several NLP tasks.

The general idea is to use labeling functions based either on rules derived from expert knowledge or on statistical systems predictions, thus allowing to take advantage of all the available sources of information, even if noisy. Within our Chair FLowReN (Facing Low-Resource NLP), we will extend this paradigm to high-level, semantic-pragmatic tasks that most suffer from data scarcity. We will also investigate multilingual, multimodal and multitask approaches, to both enlarge the sources of weak supervision and enhance performance for low-resource languages and domains. The theoretical results will serve in real world use cases as given by two industrial partners that will support this Chair: Airbus and Liebherr. In this context, the ability to adapt to few annotated data within specialized domains is crucial. Finally, we aim at exploring the explicability of the end systems, through the control over the entire learning process. This project based on hybrid learning aims at bringing new state of the art on major NLP tasks while going a step further toward robust and fair AI.

Mathematical Approaches for Deep Learning, representation Learning, And high-Dimensional Statistics

LArge Tensors for daTa analysIs and maChine lEarning

Hybrid, Interpretable Machine Learning

Principal Investigator: Wilson, Dennis – ISAE / ISAE

Co-Chair : Almar, Rafaël – IRD / LEGOS

Summary

The HIML project is an ambitious initiative aimed at combining the strengths of symbolic regression and deep learning to create Hybrid Symbolic Regression (HSR) for interpretable machine learning models. With a focus on high-stakes domains, such as environmental modeling, the project seeks to develop models that are both accurate and easily understood by domain experts. The research is divided into two main directions.

The first research direction is centered on theoretical advancements in HSR, utilizing Large Language Models (LLMs) to evolve code and deep learning as a surrogate fitness to guide the evolutionary process.

The goal is to improve the accuracy and interpretability of models by working directly with code and exploring parts of the data not yet covered by the current population of functions.

The second research direction emphasizes the practical application of HSR to enhance environmental models in two crucial areas: coastal risk assessment and ENSO (El Niño-Southern Oscillation) modeling. By prioritizing interpretability and working closely with domain experts, the HIML project aims to develop more accurate, actionable models that can inform decision-making in the face of climate change and other environmental challenges.

In summary, the HIML project sets out to make significant contributions to both theoretical and applied aspects of machine learning, particularly in the context of environmental modeling. By leveraging the power of Hybrid Symbolic Regression, the project aspires to create more accurate and interpretable models that can effectively inform high-stakes decisions and foster better communication among scientists, policymakers, and other stakeholders. The success of this project holds the potential to transform the way machine learning is employed in critical domains, enabling more responsible and effective decision-making in the face of global challenges

Human-Centered AI for Argument-based Deliberation

Principal Investigator(s): Amgoud, Leila – CNRS / IRIT

Co-Chair(s) : Lagasquie-Schiex, Marie-Christine – UT3 / IRIT, Tamine, Lynda – UT3 / IRIT, Zarate, Pascale – UT Capitole / IRIT, Ben Kraiem, Ines – Sogeti/Capgemini / SogetiLabs

Summary

Recognized as vital in a group decision-making process, deliberation allows stakeholders discussing and reaching agreements on controversial issues before making ultimate decisions. It brings several benefits, one of which is ensuring well-informed and well-accepted decisions. Its backbone is argumentation, which consists of justifying claims by arguments, i.e., reasons behind claims. The greatest challenges facing deliberation systems are identifying, analysing, evaluating, and aggregating large sets of interacting arguments, generally of disparate types, and solving potential disagreements between stakeholders.

The HuCAD project promotes a novel holistic paradigm fostering human-machine collaboration for effective deliberations. It aims to develop AI systems that not only address all of the above challenges, but also enhance argumentation capabilities of stakeholders by suggesting relevant arguments, retrieved from the web, and facilitating a good grasp of a debate thanks to an automatically generated and structured synthesis of the most salient arguments. The end goal of HuCAD is a suite of theoretical developments, namely (i) a sufficiently general formal theory of argumentation which supports real-world arguments, (ii) language models grounded on computational argumentation and tailored for argument retrieval and argumentation synthesis, (iii) an advanced theory of multiple criteria decision making founded on the two previous developments; and an application in real-life scenarios through a collaboration with the industrial partner CAPGEMINI (ex. SOGETI), experienced in collaborative decision making. The consortium includes world-renowned academic experts in computational argumentation, decision theory and information retrieval.

The project has multiple applications with strong social and economic impact, among which but not limited to, debate platforms which abound on the web for enhancing opinion formation, new-generation argumentative search engines able to foster debates by providing in-context arguments pro and con in either a list or structured synthesis forms, beyond the classical ten-blue links form, and obviously collaborative decision making platforms which are increasingly used by companies and institutions.

Frugal Reinforcement Learning for Stochastic Networks

Center for Moral Artificial Intelligence

Disaster risk prediction and multivariate anomaly detection

Principal Investigator(s): Daouia, Abdelaati – TSE GE / TSE-R

Co-Chair(s): Sabourin, Anne – Université Paris Cité, Stupfler, Gilles – Université d’Angers

Summary

Disaster or global risk assessment is concerned with the analysis of rare events that carry the potential of serious impacts on our health, the environment or the economy. This includes systemic risk mitigation which is crucial in finance and insurance, especially with the advent of climate, epidemiological, and cybersecurity risks. Available methods typically break down in realistic settings, where the data can feature heavy tails with various forms of heterogeneity (different sample sizes, different marginal distributions including heteroskedasticity, etc.), dependence across time and/or space, non-stationarity in time due to economic crises and climate change, and intricate covariates representing microeconomic characteristics or describing e.g.

climate, biosphere and environmental states. Another major challenge arises when the number of response variables is large in extremal regression models, where computational constraints on the sample size and theoretical difficulties in handling high-dimensional information plague the accuracy of prediction and inference about tail risk. Regression quantiles, which are the usual metrics for quantifying such conditional risk, are themselves often criticized for their lack of alertness and reactivity to the severity of extreme (disastrous) observations. Our project attempts to solve these difficulties through the lens of extreme value theory combined with machine learning methods (random forests, gradient tree boosting and deep neural networks) or with dimension reduction techniques, so as to propose least asymmetrically weighted squares regression models that have the ability to extrapolate beyond the range of observed values and to model complex covariate dependencies in the predictors. Our applications include risk assessment of cyber insurance on data breaches, as well as risk prediction of complex environmental and climatic processes. They are also concerned with anomaly detection in industrial problems.

Innovations in the Wake of COVID-19

Principal Investigator(s): Chen, Daniel L. – CNRS / TSE-R

Summary

Slow justice delivery is associated with a poor business climate and can have serious economic and welfare consequences (World Bank, 2017). When institutional barriers limit access to judicial resolutions, victims of crimes are incapable of pursuing restitution, beneficiaries of government services cannot benefit from needed resources, and mundane administrative tasks become insurmountable. An estimated 1.5 billion people around the world are unable to obtain justice for administrative, criminal, or civil justice problems (World Justice Project, 2019). This global challenge of unmet justice has been severely exacerbated by the COVID-19 pandemic. This project creates knowledge and evidence on how to build more resilient justice systems in the wake of COVID-19 through tech-enabled innovations.

My team will develop original governance and institutions research with policy implications for COVID-19 recovery efforts of not only judiciaries in Kenya, Peru, Pakistan, Brazil, and India where the research will be conducted, but also other judiciaries around the world. In particular, the insights of our research on various e-justice innovations will be relevant to developing countries where legal capacity has been most affected by COVID-19. Open source tools will be released such that other countries can adapt and use them to support struggling judiciaries.

The questions are threefold: How can we improve productivity and effectiveness of courts dealing with backlogs of cases? How can we expand access to justice for citizens? Can data science and artificial intelligence reduce information frictions and unlock the positive effects of justice on economic development? We will evaluate different e-justice innovations with the goal of strengthening judiciaries to deal with the growing backlog of cases and low citizen access to courts. The results will help us to understand which innovations work to build resilience in justice systems around the world in the wake of COVID-19.

BRAIN

AI for Smart and Sustainable Air Traffic Management and Air Mobility

Principal Investigator: Delahaye, Daniel – ENAC / ENAC LAB

Summary

The proposed chair is targeted to addressed two main pillars of future air mobility which are stronly related: AI for Sustainable Air Operations and Air Mobility Air transportation is currently facing environmental challenges for which AI may bring some solutions in order to reduce the overall aviation impacts. It is considered that air operation optimization may reduce CO2 emission by 10%. In addition non-CO2 aviation impact (contrails which participate to the radiative forcing) may also be reduced by new optimization aircraft trajectories at large scale. Noise abatement issues will be also considered in this research. We propose in this chair to develop new AI decision support tools for optimizing air operations (trajectory planning, etc…), in order to match those new challenges . Such sustainable trajectories (continuous climb and descent, fuel optimal trajectories in the presence of wind, etc…) do not stick to the airways network and will then more difficult to manage for air traffic controllers (ATCO) . We propose also to develop new AI decision support tools for helping ATCO to manage such trajectories by increasing the level of automation of the ground segment. In addition ML algorithms will be investigated to improve the prediction of air operations (trajectory prediction, etc…) but also the persistent contrail favorable areas in the airspace.

AI for ATM Automation If we compare the onboard side and the ground side, on can notice that automation has been much more developed in the cockpit than in the control rooms. This is due to the difficulty to bring automation in the controller task which consist to manage conflict detection and resolution between aircraft. Many efforts have been done in the past in order to develop decision support tools to help controllers to manage the traffic and then to enhance the capacity of the system. Unfortunately few improvements have been done in this direction due to the lack of certification of such algorithms and we propose to investigate this field with the new development of the artificial intelligence. The main objective of this research is to develop decision support tool in order to help controller to manage sustainable trajectories (continuous climb, etc..) and then to improve the overall capacity and sustainability of the air transportation system. In order to reach this goal we propose to develop trustable AI algorithms with a strong focus on the associated explicability and robustness for such a critical application. In addition, improvement of trajectory prediction algorithm for conflict detection will be also addressed in this research.

Advances in majorization-minimization algorithms for optimization with non-quadratic loss functions

Principal Investigator(s): Fevotte, Cédric – CNRS / IRIT

Co-Chair(s): Cazelles, Elsa – CNRS / IRIT, Soubies, Emmanuel – CNRS / IRIT

Summary

Many problems in machine learning and signal processing involve the optimisation of a loss function with respect to a set of parameters of interest. A common choice is the quadratic loss because it enjoys convenient mathematical properties that make it prone to optimisation. However, from a modelling point of view, the quadratic loss underlies a Gaussian model that does not always comply with the geometry of the data. This is the case when dealing with nonnegative, integer-valued or binary data for which non-quadratic losses are more suitable.

The aim of AMINA is to advance the theory and methodology of optimisation with non-quadratic loss functions using the framework of majorisation-minimisation (MM). MM consists in iteratively building and minimising a locally tight upper bound of the loss. In other words, it resorts to the iterative optimisation of a local approximation. This is an intuitive and yet powerful optimisation framework that does not require stringent assumptions. MM algorithms decrease the value of the loss at every iteration and do not require tuning parameters. Well-designed upper bounds can finely capture the local curvature of the loss, resulting in efficient updates. Though MM can be traced back to the 1970s, it has enjoyed a significant revival in the last ten years. I played a part in this revival with highly cited articles about MM for nonnegative matrix factorisation (NMF) with the beta-divergence, a wide class of loss functions of important practical value.

AMINA will tackle challenging problems related to the design and convergence ofMMalgorithms in four innovative machine learning and signal processing settings: 1) non-alternating updates for NMF, 2) phase retrieval with the beta-divergence, 3) unbalanced optimal transport for audio interpolation, 4) stochastic MM for deep learning. Designing efficient optimisation algorithms with convergence guarantees is a crucial step in building trustworthy AI systems.

Designing Artificial Social Reasoners

Machine Learning for Sustainable International Development (ML4SID)

Guaranteed and frugal deep learning<br>

Principal Investigator: Malgouyres, François – UT3 / IMT

Co-Chair: Landsberg, Joseph – Texas A&M University

Summary

The project aims at providing guarantees for deep learning methods and building new frugal architectures.

It consists of three complementary parts.

In the first part, we will seek to establish theoretical guarantees, which can be computed for moderatesize problems, for the learning of deep ReLU networks. As deep neural networks are used in a context where the number of examples is generally smaller than the number M of parameters of the network, the guarantees are based on properties valid on a subset of parameters, leading to the description of functions whose ‘complexity’ C is much smaller than the number of parameters. In this work, we will study different notions of local complexities, similar to a ‘local pseudo-dimension’, based on a geometric analysis of neural networks.

Another complementary and simpler approach is based on the quantification of predictive uncertainty a posteriori, once the model is trained. The idea is to statistically evaluate the uncertainty in an efficient and guaranteed way. General techniques (called conformal prediction) exist for ‘blackbox’ models. We will develop variants that exploit the structure of the models and tasks in a tigher way (guaranteed tuning of multiple hyperparameters, multi-task learning setting).

For embeddability purposes, we will study quantized networks, associated models, and algorithms.

We will generalize and adapt a recent study providing convergence guarantees for the ‘straightthrough- estimator’, the main algorithm used to optimize quantized weights in deep learning. We will also continue ongoing work on quantized aware training and robustness, and study new low-bit matrix models, leading to more expressive networks at constant memory or computational cost. The methodological developments will be tested on time series and object-detection tasks.

eXplainability science in artifiCIal intelligENCE

Evolution of galaxies using Machine Learning

Reinforcement Learning on a Diet

User-Centered Interactive Machine Learning for 3D Point Cloud Analysis

Anomaly Detection and Diagnosis

Principal Investigator: Travé-Massuyes , Louise – CNRS / LAAS

Co-Chair(s): Lasserre, Jean Bernard – CNRS / LAAS, Chanthery, Elodie – INSA / LAAS, Jauberthie, Carine – UT3 / LAAS, Pucel, Xavier – ONERA / DTIS

Summary

The AC chair project ADDX aims to bridge model-based and data-based methods for anomaly detection (AD) and diagnosis (DX), drawing mutual benefits and closely integrating them in a hybrid AI framework. It gives pride of place to anomaly detection because anomalies, also defined as outliers or out-of-distribution observations, are essential to be detected in data as they can indicate data corruption or faulty behavior. Trust in Artificial Intelligence (AI) systems depends on this because their reliability relies on inputs lying in the training distribution. On the other hand, anomaly detection plays a crucial role in certifying data obtained from sensors or images, as well as in identifying symptoms that can be used to drive diagnosis reasoning and health management. Explicit knowledge extraction from data and learning guided by knowledge applied to the previous problems will be the keystones of the research for this chair. The approaches from shallow and deep learning will be confronted and synergically integrated.

The proposal is organized in a balanced way between “blue sky” research and more applied research that meets socio-economical needs. Collaboration is foreseen with other chairs on the themes of polynomial optimization and robustness as well as with the DEEL project of IRT Saint Exupéry. Industrial partnership is also planned and several industrial companies have already shown interest in this chair, namely Airbus, Continental, Batconnect, Carl-Berger Levrault, Vitesco Technologies and applications in the medical domain are also foreseen with first contacts taken with Hopital Purpan CHU Toulouse.

The chair will be carried by a team of five researchers who bring skills in three complementary fields for the targeted tasks: AI, maths and automatic control. This spectrum of expertise the achievement of hybrid AIresults that will advance the state of the art.

Hybridizing AI and Large-scale Simulations for Engineering Design

Hybrid Policy Optimization for Safe and Efficient Robotic Manipulation and Locomotion

Principal Investigator: Righetti, Ludovic – New York University / LAAS

Co-Chair: Mansard, Nicolas – CNRS / LAAS

Summary

Robotics has seen tremendous progress in recent years thanks to advances in simulators, optimizers, and reinforcement learning. Despite impressive demonstrations, however, these methods have yet to be fully realized in real-world settings, and scaling beyond the lab still is challenging. What if we could unlock the full potential of constrained optimization and reinforcement learning to train safe, effective policies for dynamic locomotion and manipulation tasks? In HYPOMEL, we propose to achieve this objective by exploiting the results in ANITI 1.0 and of past collaborative research projects. We will first revisit the way reinforcement learning algorithms work by taking advantage of the consolidated expertise gained by years of progress in trajectory optimization, and exploiting recent advances in differentiable simulators. Our aim is to achieve faster, more accurate convergence, reducing the computational burden and providing strict guarantees on the resulting optimal policy. We will then incorporate strategies for planning across hybrid dynamic modes, such as switching between continuous and discrete control, and integrate multi-modal sensory information, including force and tactile sensing. These enhancements will increase the robustness and safety of our policies during physical interactions, enabling effective manipulation and locomotion in complex environments. The experimental capabilities of our team, along with the expertise of Ludovic Righetti in ANITI, will pave the way for a comprehensive and revolutionary framework that hybridizes the advantages of both MPC and RL, ultimately enabling the development of scalable, complex, and reliable robotic behaviors that can be deployed in real-world applications. It will benefit from direct interaction with the synergy chair C3PO (should it be accepted) and other chairs in numerical optimization and machine learning, from effective collaboration with the robotic manufacturer PAL and with the end-user AIRBUS, and consider direct socio-economic impacts with a particular attention on direct dissemination to younger public through dedicated robotics activities.

Trust and Responsibility in Artificial Intelligence

Principal Investigator(s): Bolte, Jérome – TSE GE / TSE-R, Smolin, Alexei – TSE GE / TSE-R, Eynard, Jessica – UT Capitole / IDP, Loubes, Jean-Michel – UT3 / IMT

Co-Chair(s): Mangematin, Céline – UT Capitole / IDP, May, Xiaoyi – UT2J / IMT, Rhodes, Andrew – UT Capitole / ?, Pauwels, Edouard – UT3 / IRIT, Renault, Jérôme – TSE GE / TSE-R

Summary

Artificial intelligence (AI) is revolutionizing numerous sectors, leading to significant economic, legal, social, and regulatory consequences. Its transformative potential highlights the importance of addressing both opportunities and challenges. Recently, an open letter by prominent AI experts and industry leaders [1] advocated that “AI research and development should be refocused on making today’s powerful, state-of-the-art systems more accurate, safe, interpretable, transparent, robust, aligned, trustworthy, and loyal.” They also emphasized that “AI developers must work with policymakers to dramatically accelerate development of robust AI governance systems. These should at a minimum include: new and capable regulatory authorities dedicated to AI; …

a robust auditing and certification ecosystem; liability for AI-caused harm.” There is indeed an urgent need to develop governing principles for AI development and operations, akin to quality standards for goods and services. However, regulating, auditing, and improving AI systems present unique challenges. First, AI systems are highly complex mathematical processes, making the identification, measurement, and mitigation of risks and biases scientifically and technically demanding tasks. Second, AI

is a multi-purpose technology with societal impacts that extend far beyond those of conventional goods and services. Achieving successful AI regulation and fostering trust in AI systems necessitates interdisciplinary collaboration among AI experts, social scientists, and legislators.

Our interdisciplinary research team combines expertise in law, statistics, optimization, and economics, with the goal of cultivating an AI ecosystem that is efficient, economically viable, and respectful of rights, liberties, and social welfare. By addressing the pressing concerns surrounding AI, we aim to establish the foundation for a sustainable, ethical, and responsible future in AI technology, ultimately transforming the way AI systems are managed and governed.

EXPLainablE and physics-informed Ai for Regional weaTHer prediction

Principal Investigator(s): Risser, Laurent – CNRS / IMT, Trojahn, Cassia – UT2J / IRIT, Raynaud, Laure – Météo-France / CNRM

Co-Chair(s): Lapeyre, Corentin – Cerfacs, Mohanamuraly, Pavanakumar – Cerfacs, Masson, Valery – Météo-France / CNRM, Bovalo, Christophe – BULL SAS / ATOS, Kaminski, Gwenael – UT2J / CLLE

Summary

Accurate predictions of future weather conditions are essential for the safety of people and goods, and for the management of a wide range of economic activities. Since the mid-20th century, weather prediction has relied on physical modelling of the atmospheric dynamics, with significant and regular performance improvements. However, the precise prediction of high-impact local phenomena remains difficult and computationally expensive. The rapid progress of Artificial Intelligence (AI) technologies, with early impressive applications to weather forecasting, offers unexpected opportunities for a new generation of weather predictions based on hybridisation of traditional physical models and AI, allowing for increased accuracy and timeliness in a cost-effective way. This chair will focus on some of the most promising avenues to leverage the potential of AI for very high resolution probabilistic weather forecasts, with efforts dedicated to the development and evaluation of both hybrid approaches and purely data-driven forecasts. Outcomes will provide 1/ a deep overview of the strengths and weaknesses of these approaches, with assessment of their possible roles for the future of operational weather forecasting, and 2/ an innovative physics-informed AI prediction system that optimally combines the best of physics and AI worlds. To achieve these ambitious goals, research will be conducted in a holistic framework, supported by a strong and pluri-disciplinary consortium gathering expertise in atmospheric modelling, AI and social sciences. Beyond the methodological and technological breakthroughs that are likely to come up from these works, a particular emphasize will be put on the development of explainable and robust AI solutions, as well as on their transfer and acceptance by the end-users. This is motivated by legal concerns, given that weather forecasting is likely to be considered as a high-risk system in the European Commission AI Act.

Advanced Eddy-resolving Global and Integrated modeling using machine learning for accurate climate predictions

Principal Investigator(s): Zhang, Sixin – INP / IRIT, Renault, Lionel – IRD / LEGOS

Co-Chair(s): Benshila, Rachid – CNRS / LEGOS, Simon, Ehouarn – INP / IRIT

Summary

Accurate prediction of Earth’s potential climate trajectories under human pressure at both short-term and long term relies on correctly representing physics, chemistry and biology in global climate models. In the last two decades, two relevant findings emerged in determining the oceanic circulation and air-sea fluxes: the key role potentially played by eddy-scale (O(100) km) oceanic processes and their related air-sea interactions. The continued increase in computational resources has now made it possible to run regional mesoscale coupled models, global ocean submesoscale permitting stand-alone model with a spatial resolution of ~2 km and even global ocean-atmosphere coupled model with a spatial resolution of a few kilometers. By essence, future climate is an uncharted territory, which precludes any direct assessment of the realism of climate scenarios. To overcome this issue, the climate community can rely on paleoclimate observations and reconstructions, and the assumption that if one can reproduce a past climate, one should be able to realistically simulate a future climate. To this end, global coupled simulations must be performed over centennial periods of time (e.g., paleoclimatic periods), which implies a computational cost for high-resolution simulations that may be only possible between 2050 and 2080. In terms of oceanic forecast, high-resolution simulations can be too heavy to run and there is a need to better take into account the ocean-atmosphere coupling.

Are we doomed to use very high-resolution global models or can we rely on new generation parameterizations of fine scale oceanic processes and Ocean-Atmosphere interactions? AEGIR proposes a way forward, by developing new methodologies in machine learning to be applied in Earth Science and by developing parameterizations of oceanic mesoscale processes and air-sea fluxes.

Cobots with Conversation, Cognition and PerceptiOn

Principal Investigator(s): Asher, Nicholas – Emérite / IRIT, Serre, Thomas – Brown University, Stasse, Olivier – CNRS / LAAS, VanRullen, Rufin – CNRS / CERCO

Co-Chair(s): Arnold, Alexandre – Airbus, Boutin, Victor – Brown University, Flayols, Thomas – CNRS / LAAS, Hunter, Julie – LINAGORA, Muller, Philippe – UT3 / IRIT, Mansard, Nicolas – CNRS / LAAS, Pellegrini, Thomas – UT3 / IRIT

Summary

Recent AI models have achieved remarkable success in specific domains (e.g. vision, language, robotic agent control), and there is a push towards ever larger models combining multiple input and output modalities. In theory, multimodal representations can help vision scientists by endowing sensory inputs with semantic information; similarly, linguists can use them to ground NLP tokens in the sensorimotor environment and create a form of referential meaning; roboticists can also take advantage of these versatile representations for navigation and action planning. But in practice, current models rely on brute-force training approaches using billions of labelled examples, while the datasets and computing resources available to academic and industrial researchers are typically much smaller. Compared to artificial neural networks, real brains learn much more efficiently; we thus take inspiration from the cognitive science idea of a Global Workspace (GW) to build a novel class of AI systems (PI: VanRullen). The GW, a unique model of multimodal grounding (encompassing perception, action and semantic representations), can promote advances in perceptual models (PI: Serre), and support both top-down interactions (from language and semantics to perception and action) of interest to linguists (PI: Asher), and bottom-up interactions (from active perception and navigation to semantic abstractions) of interest to roboticists (PI: Stasse). The high-risk/high-gain hypothesis is that the modalities complement one another synergistically, such that the whole system is much more efficient than the sum of its parts, not just for multimodal tasks but also when evaluated in the initial domains (vision, NLP, robotics).

Building frugal perceptual and cognitive models that can support language grounding and embodiment and provide semantic representations to robotic agents is expected to have important beneficial consequences for ANITI’s industrial partners (e.g. Airbus, Linagora).

Certified AI for Understanding Intracellular Dynamics

Principal Investigator(s): Cortés, Juan – CNRS / LAAS, Weiss, Pierre – CNRS / IMT

Summary

Looking at cells with a standard microscope might give a feeling of a relatively simple structure.

In reality, understanding the sophisticated molecular activity at the basis of life is arguably one of the greatest current challenges in biology. Joint advances in artificial intelligence, microscopy and structural bioinformatics point to significant breakthroughs in the coming years. However, this requires the development of new theories and techniques that form the core of this project.

We want to devise new tools to visualize, model and understand dynamic intracellular processes.

This usually requires reasoning at multiple spatio-temporal scales. Therefore, different experimental techniques and models are necessary to provide complementary information for the overall understanding of complex processes involving molecular and supra-molecular systems. A clear example are processes related to genome organization and regulation implicating intrinsically disordered protein regions (IDRs), which play an essential role in the cell, but which still escape the current technologies.

Imaging such molecular systems is an intricate problem due to physical barriers such as diffraction limit in optics or diffusion in live imaging. Only highly noisy and incomplete information can be gathered from techniques such as cryogenic electron microscopy or super-resolution fluorescence microscopy.

By mixing carefully designed molecular modeling methods, physics informed inverse problem solvers and experimental data, we plan to access information that is yet unavailable in a certified manner.

To achieve our goals, we will be assisted by experts in artificial intelligence, cell and molecular biology, physics and optics. The experimental side of the project will rely on two cutting edge imaging platforms in Toulouse, the METI and the LITC.

This interdisciplinary research project should provide tools for a better understanding of the functional roles of IDRs in genome regulation processes, opening the road to future therapies.

REpresentation Learning for Earth Observation

Principal Investigator(s): Inglada, Jordi – CNES / CESBIO, Dobigeon, nicolas – INP / IRIT, Fauvel, Mathieu – INRAE / CESBIO, Valero, Silvia – UT3 / CESBIO

Co-Chair(s): Oberlin, Thomas – ISAE / ISAE, Michel, Julien – CNES / CNES, Gürol, Selime – CERFACS

Summary

Recent Earth Observation (EO) systems have opened up new opportunities for land survey systems that provide critical information for climate change monitoring, mitigation, and adaptation. Monitoring Essential Climate and Biodiversity Variables (EVs) provides key information to understand climate, biodiversity and environmental changes. However, retrieving EVs from multi-source data is challenging due to the singularities of EO data, such as indirect observation of interest variables, varying spatial resolution and irregularly sampled time series.

Deep learning (DL) models offer promising solutions to learn complex patterns from huge amounts of data. However, most of the recent models lack physical consistency and interpretability. Furthermore, they are not able to process data with irregular and unaligned sampling, which is common in multi-modal EO.

Training also requires large amounts of labeled data, which are scarce and noisy in EO. Consequently, current models have a restricted usage in large scale EO systems.

This project will develop new self-supervised representation learning methods to produce semantically meaningful probabilistic representations from high-dimensional multi-modal EO data. The originality lies on the use of prior knowledge from physical models into DL and thus proposing advances in uncertainty estimation and interpretability. The proposed hybrid AI system will blend physical priors and DL to pre-train models that can learn (1) semantically meaningful representations related to EVs and (2) task agnostic generic embeddings (AI-ready data) that can be used by downstream tasks. The system will process multi-modal data to capture complementary spatio-temporal patterns. Physics-guided DL methods will be designed to condition the decoding of generic embeddings to retrieve and forecast EVs and their uncertainties.

To ensure the continuity of land monitoring, the system will use new data assimilation strategies combining satellite observations with pre-trained model forecasts. Continual learning will be used to update the models in response to new EO data. Non-stationary and long-term trends beyond the temporal range of the initial training will be accounted for. The project raises scientific questions regarding joint probabilistic representation learning, incorporation of physical prior information, efficient use of pre-trained models, and continuous model updating with newly acquired data and new on-orbit sensors.

Combining Polynomial Optimization and Machine Learning: Application to Power System Decision Support Tools

Principal Investigator(s): Magron, Victor – CNRS / LAAS, Panciatici, Patrick – RTE, Lasserre, Jean-Bernard – CNRS / LAAS

Co-Chair(s): Henrion, Didier – CNRS / LAAS, Korda, Milan – CNRS / LAAS, Skomra, Mateusz – CNRS / LAAS, Ruiz, Manuel – RTE, Loho, Georg – University of Twente / University of Twente

Summary

There is an increasing need for efficient methods to approximate values of secure operating conditions for electrical power systems. Indeed, recent and ongoing changes in the European power network, such as the increase in renewable energy sources interfaced by power electronic devices, are bringing up new challenges in terms of power grid security and large-scale stability assessment. The optimal power flow (OPF) problem aims at determining an optimal steady-state operating point for an alternative-current (AC) electric power system in terms of a given objective function, usually the power generation costs or power losses per time unit, subject to both electrotechnic equality constraints and engineering limits. AC-OPF

problems can be modeled as certain nonlinear optimization problems, involving polynomials in complex variables. We recently proposed convex relaxations that allowed to approximate the optimal values of some large-scale AC-OPF instances with thousands of variables.

The goal of this ambitious collaborative project between the academic partner POP from CNRS LAAS

and the industrial partner RTE is to combine efficient and accurate polynomial optimization techniques with machine learning (ML) tools to solve AC-OPF instances at global optimality, and to provide decision-making tools for transmission system operators.

The first main research direction is to develop frameworks embedding fast convex optimization algo rithms potentially mixing classical interior-point methods and ML/data-driven based schemes.

The second main research direction is to rely on these frameworks to tackle important problems in static and dynamical optimization arising from stability assessment of large power grids and minimization of active power generation with a mixture of continuous and integer variables.

This chair project shall lead us to address both directions by providing fast yet accurate bounds for the underlying optimization problems.

Embeddability and safety assurance of ML-based systems under certification

Physics-Informed Learning Methods for Continental Waters and Marine Risks

Principal Investigator(s): Roustant, Olivier – INSA / IMT, Monnier, Jérôme – INSA / IMT

Co-Chair(s): Baraille, Rémy – SHOM, Bouclier, Robin – INSA / IMT, Garambois, Pierre-André – INRAE / RECOVER, Garnier, Josselin – Ecole Polytechnique / CMAP, Lüthen, Nora – ETH Zürich, Noble, Pascal – INSA / IMT, Sanchez, Eduardo – IRT Saint-Exupéry

Summary

Flooding, whether inland or coastal, is a complex phenomenon that can be described by non-linear differential equations (PDE, ODE), but only partially in some situations. Accurately modeling river flows, from low flows (water shortage) to high flows (flooding), as well as marine submersions is crucial for our societies. Purely physics-based approaches have limitations in their completeness, and in running time of the associated computer codes. Purely data-driven methods are complementary but require huge amounts of data. Hybridizing physics- and data- driven approaches (hybrid AI) has shown dramatic improvements on idealistic contexts. The aim of this project is to investigate and develop hybrid AI algorithms for water related extreme events (floods, inundations, marine submersions) and their associated risk management. Significant advances in model accuracy and explainability are expected. The research will be guided by challenging real-world cases, using our advanced computational codes to solve a hierarchy of physical models, possibly with their adjoint codes. The databases include in-situ historical measurements, and in some cases, satellites data. The research program is divided in five interconnected axes.

Physics-informed learning methods. Hybridization of two famous classes of models, Neural Network and Gaussian Process, with physical knowledge. Investigation of resulting surrogate models and data assimilation processes.

Reduced-basis methods. Model reductions based on hybrid PCA – NN like methods.

Multi-fidelity models. Techniques to account for a hierarchy of computer codes. 1 Applicant’s last name ACRONYM.

Uncertainty quantification. Risk assessment; global sensitivity analysis for model explainability.

Design of experiments. Strategies to create new data from computer codes.

The chair team comprises a panel of researchers and experts from academia and industry, who bring complementary skills in mathematical modeling (PDEs, ODEs, probabilistic), computational sciences and machine learning.

Certifiable Auto-supervised Large Models

Principal Investigator(s): Mamalet, Franck – IRT Saint-Exupéry, Serrurier, Mathieu – UT3 / IRIT

Co-Chair(s): Ducoffe, Mélanie – Airbus, Sengnès, Coralie – INSERM / RESTORE

Summary

The project has a high degree of continuity with the great success of the 3IA Aniti and DEEL program: studying 1-Lipschitz neural networks, a class of robust by-design neural networks. The team, built in this first round, was composed of Mathematicians, Computer Science researchers, Data-scientists, and Industry experts, and has published several papers in major conferences and journals. They have laid the groundwork for classification with 1-Lipschitz neural networks, both on theory, on the definition of optimal loss linked to optimal transport, and proof of robustness, certifiability and explainability.

Additionally a full library has been developed, called DEEL-LIP, to learn these kind of neural networks as classical Tensorflow models.

In this project, we propose to further investigate the 1-Lipschitz neural networks in the scope of self supervised learning: the objective is to be able to learn large models with unannotated data in several domains (medical/satellite images, time series, natural language processing) while maintaining the guar antees in terms of robustness, certifiability and explainability. Self-supervised learning is a high trend for classical networks with applications in few-shot learning, semi-supervised learning or as backbones. But, as far as we know, there is no contribution in the literature on self-supervised 1-Lipschitz neural networks.

We propose to tackle the unexplored domain of self-supervised 1-Lipschitz large models with three research axis: In the first axis, we will explore methods for self-supervised learning using optimal transport loss, to learn from unannotated data while still promoting the robustness of the neural network. We will also investigate more recent and deeper 1-Lipschitz architectures, such as transformer, to enhance the learning capabilities of these networks on very large datasets and their generalization. To end with, we will work to establish the theory and certifiable guarantees for these self-supervised learnt 1-Lipschitz Neural Networks. For industrial safety-critical applications, we will develop a set of pre-trained 1-Lipschitz Networks for various domains, including satellite images, time series, language processing, and medical imaging, where data quantity and annotation are crucial.

Hybridizing lEarning, seaRch and combinatorial Optimization for Industrial deCision-making

Principal Investigator(s): Guillaume, Romain – UT2J / IRIT, Techteil-Koenigsbuch, Florent – Airbus, Gerchinovitz, Sébastien – IRT Saint-Exupéry, Thiebaux, Sylvie – UT / LAAS

Co-Chair(s): Artigues, Christian – CNRS / LAAS, Cesari, Tommaso – University of Ottawa, Fargier, Hélène – CNRS / IRIT, Poveda, Guillaume – Airbus, Roussel, Stéphanie – ONERA / ONERA

Summary

Decision-making problems are ubiquitous in industry, from production optimization to in-service product operation management, including internal project and resource optimization. In the last decades, many data-driven (e.g. Deep Reinforcement Learning) and model-based (e.g. AI Planning, Constraint Programming) approaches have been separately investigated to solve those problems. While the former often need a huge amount of data which is scarcely available in many industrial problems, the latter are not always suitable for problems that are complex to model with accuracy. Also, both approaches fail to solve large problems in reasonable time and computational resources, especially in presence of uncertainty that significantly augments the combinatorial explosion of the solution space. This chair will investigate tight hybridization techniques between data-driven and model-based approaches to decision-making by targeting three main objectives: scalability, robustness, industrial use case representativeness. It strives for opening the door to optimized, reactive and robust decision-making in large and complex industrial problem scenarios, while significantly lowering the computation and data cost of current solvers used in the industry. The chair will gather together academic researchers from diverse institutions (LAAS, IRIT, IRT, ONERA, Ottawa University) and scientific fields, including combinatorial optimization, search, and machine learning.

Industrial partners from the aerospace and automotive industries (Airbus, Liebherr, Vitesco) will second engineers and provide challenging use cases where hybrid methods are expected to reduce operational costs due to uncertainty, model inaccuracy or solution suboptimality. This research will also benefit the health sector where we will investigate with Oncopole how to optimally schedule radiotherapy treatments for cancer patients under uncertain medical pathway appointments, with a view to improving remission chances.

Clôture de l’appel à chaires ANITI – liste des projets retenus

Starting chairs

Uncertainty Quantification for Physical and Artificial Intelligence systems

Facing Low Resource Natural Language Processing

Mathematical Approaches for Deep Learning, representation Learning, And high-Dimensional Statistics

LArge Tensors for daTa analysIs and maChine lEarning

Hybrid, Interpretable Machine Learning

Advanced chairs

Human-Centered AI for Argument-based Deliberation

Frugal Reinforcement Learning for Stochastic Networks

Center for Moral Artificial Intelligence

Disaster risk prediction and multivariate anomaly detection

Innovations in the Wake of COVID-19

BRAIN

AI for Smart and Sustainable Air Traffic Management and Air Mobility

Advances in majorization-minimization algorithms for optimization with non-quadratic loss functions

Designing Artificial Social Reasoners

Machine Learning for Sustainable International Development (ML4SID)

Guaranteed and frugal deep learning<br>

eXplainability science in artifiCIal intelligENCE

Evolution of galaxies using Machine Learning

Reinforcement Learning on a Diet

User-Centered Interactive Machine Learning for 3D Point Cloud Analysis

Anomaly Detection and Diagnosis

International attractivity chairs

Hybridizing AI and Large-scale Simulations for Engineering Design

Hybrid Policy Optimization for Safe and Efficient Robotic Manipulation and Locomotion

Synergy chairs

Trust and Responsibility in Artificial Intelligence

EXPLainablE and physics-informed Ai for Regional weaTHer prediction

Advanced Eddy-resolving Global and Integrated modeling using machine learning for accurate climate predictions

Cobots with Conversation, Cognition and PerceptiOn

Certified AI for Understanding Intracellular Dynamics

Industrial chairs

REpresentation Learning for Earth Observation

Combining Polynomial Optimization and Machine Learning: Application to Power System Decision Support Tools

Embeddability and safety assurance of ML-based systems under certification

Physics-Informed Learning Methods for Continental Waters and Marine Risks

Certifiable Auto-supervised Large Models

Hybridizing lEarning, seaRch and combinatorial Optimization for Industrial deCision-making

Ne manquez rien !

Inscrivez-vous pour recevoir l'actualité d'ANITI chaque mois.