Post-doc proposition ANITI, MADLADS CHAIR – 13/11/2024
1-Context
The postdoc candidate is applying to work with the following team of researchers
- CHHAIBI Reda, Professor at Université Côte d’Azur
- GAMBOA Fabrice, Professor at Université Toulouse II,
- LAGNOUX Agnès, Associate professor at Université Toulouse II,
- PELLEGRINI Clément, Associate professor at Université of Toulouse III.
The team specializes in the mathematics of artificial intelligence, and the focus on the project at hand is the use of innovative tools such as Random Matrix Theory.
We are interested in both proving new theorems in the field, for publication in optimization, probability or statistics journals, in practical implementations, for publication in AI conference proceedings.
2-Some relevant literature on RMT and AI :
RMT is a branch of mathematics that studies the statistical properties of large matrices with random entries. It has found applications in a wide range of fields, including physics, finance, and statistics.
Relevance for high dimensional statistics. In the context of covariance matrices, RMT can help us better understand the behavior of these matrices when the dimensionality of the data is high while the number of samples is small. One key result from RMT is the Marchenko-Pastur (MP) theorem, which describes the eigenvalue distribution of large covariance matrices with random entries. In particular, the MP theorem shows that when the dimensionality of the data is much larger than the number of samples, the eigenvalue spectrum of the covariance matrix converges to a limiting distribution that depends only on the ratio of the dimensionality to the number of samples. This limiting distribution is the MP distribution, which has a well-defined shape and can be used to estimate the noise level in the eigenvalues of the covariance matrix.
Relevant applications and their state of the art.
First, in the line of Pennington et al.’s work [8], RMT is a natural tool in order to control the spectra of the Jacobian of Neural Networks at the initial- ization. This approach is a more rich and quantitative version than the seminal paper by Glorot et al. [7] which was mainly based on variance considerations. In [6], we consider a computational solution to the metamodel built from RMT. We use a homotopy method based on the chaining of basins of attraction for the Newton-Raphson algorithm. Not only is the result guaranteed to be correct – unlike the solution previously proposed by Pennington et al., but the method is also very fast.
Second, it is well-known that PCA (Principal Component Analysis) is the initial step of many statistical methods, either for the purpose of dimensionality reduction, or for the detection of a signal of small rank. And PCA is about diagonalizing large covariance matrices, hence the relevance of RMT. In fact, there is a flurry of theoretical works analyzing “signal plus noise” models, where one is interested in the spectrum of a large matrix deformed by some noise. Existing mathematical results are under quite general hypotheses, and describe refined phenomenons such as the celebrated BBP (Baik-Benarous-P ́ech ́e) phase transition [1]. In comparison, there are only a handful of papers giving practical computational solutions which leverage RMT to estimate the spectra of large covariance matrices. We can mention El Karoui’s work [4] as a prime example.
Finally, in the context of kernel learning, it is now common to use random features whose behavior is controlled thanks to RMT. The most known example is certainly that of Random Fourier Features which is accelerated by performing products of structured matrices as in [5].
3-Research topics
Following the three paragraphs describing each a different topic, the candidate will join projects in
- the use of Free Probability Theory for studying Jacobians of Neural Net- works. Basically, we shall aim for the natural follow-up to the paper [6];
- the problem of Free Deconvolution and cleaning large covariance matrices in regards to [2, 4];
- the use of RMT when generating random features for kernel learning as in [3, 5].
References
[1] Baik, Jinho, Gérard Ben Arous, and Sandrine Péché. ”Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices.” (2005): 1643-1697.
[2] Bun, Joël, Jean-Philippe Bouchaud, and Marc Potters. ”Cleaning large correlation matrices: tools from random matrix theory.” Physics Reports 666 (2017): 1-109.
[3] Demni, Nizar, and Hachem Kadri. ”Orthogonal Random Features: Explicit Forms and Sharp Inequalities.” arXiv preprint arXiv:2310.07370 (2023).
[4] El Karoui N. Spectrum estimation for large dimensional covariance matrices using random matrix theory (2008): 2757-2790.
[5] Le, Quoc, Tam ́as Sarl ́os, and Alex Smola. ”Fastfood-approximating kernel expansions in loglinear time.” Proceedings of the international conference on machine learning. Vol. 85. No. 8. 2013.
[6] Chhaibi, Reda, Tariq Daouda, and Ezechiel Kahn. ”Free Probability for predicting the performance of feed-forward fully connected neural networks.” Advances in Neural Information Processing Systems 35 (2022): 2439-2450.
[7] Glorot, Xavier, and Yoshua Bengio. ”Understanding the difficulty of train- ing deep feedforward neural networks.” Proceedings of the thirteenth inter- national conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, 2010.
[8] Pennington J., Schoenholz S., and Ganguli S. The emergence of spectral universality in deep networks. In International Conference on Artificial Intel- ligence and Statistics (2018): 1924–1932.
Contract : post-doc
Duration : 24 months
Salary : according to experience
Location : Toulouse, France. Institut de mathématiques
Advisors :
CHHAIBI Reda, GMABOA Fabrice, LAGNOUX Agnès, PELLIGRINI Clément
Application
Formal applications should include detailed CV, a motivation letter and at least one recommendation letter, directly sent by the supporter to the aforementioned address.
Samples of published research by the candidate will be a plus.
Applications should be send by email to: clement.pellegrini@univ-toulouse. fr and reda.chhaibi@univ-cotedazur.fr
Deadline for application is the 29th of May 2025. Position begins on 1st of September 2025 (2 years).
Two engineering positions on conversational agents for Audio Mobility 2030 – 9/10/2024
These positions on conversational agents are proposed in the framework of the Audio Mobility 2030 (AM2030) project, which started in April 2023. AM2030 aims at enabling car manufacturers to have their own in-car audio application, regardless of the operating system. They will be able to deploy a global audio experience and offer the best content and proactive services to drivers. It is positioned as a true road companion that will help consumers adopt eco-responsible behaviors: vehicle self-diagnosis and maintenance reports, advice on driving and the use of on-board equipment.
Project partners: ETX Studio (Lead), Continental Automotive FRANCE SAS, Université de Toulouse – ANITI, École Polytechnique de Paris.
ANITI’s role in the project is related to working on human-computer interactions, in particular on natural language understanding. This will include a conversational model that can exploit conversational structure as well as content provided by modern transformer-based models. The model will learn constraints on the user’s preferences, from the conversation and from his previous choices.
The conversational assistant will go considerably beyond the art of current finite state dialogue systems but offering a transparency, guarantees and explainability that large transformer models by themselves cannot. It will interact with voice based components as well as a recommendation model for actions based on the information acquired by the conversational assistant.
Required skills
Applicants should have good programming skills. English communication skills are also required.
Contract : post-doc
Duration : 12 months
Salary : according to experience
Location : Computer Science Research Institute of Toulouse (IRIT), Toulouse, France
Advisor : Nicolas Asher
Application
Formal applications should include detailed CV, a motivation letter and reference letters.
Samples of published research by the candidate will be a plus.
Applications should be send by email to: asher@irit.fr
Automatic speech recognition for an in-car voice assistant – 2/10/2024
This PostDoc position is proposed in the framework of the Audio Mobility 2030 (AM2030) project, which started in April 2023. AM2030 aims at enabling car manufacturers to have their own in-car audio application, regardless of the operating system. They will be able to deploy a global audio experience and offer the best content and proactive services to drivers. It is positioned as a true road companion that will help consumers adopt eco-responsible behaviors: vehicle self-diagnosis and maintenance reports, advice on driving and the use of on-board equipment.
Project partners: ETX Studio (Lead), Continental Automotive FRANCE SAS, ANITI, Université de Toulouse, École Polytechnique de Paris.
ANITI’s role in the project is related to working on human-computer interactions, in particular on natural language understanding. The role of the hired PostDoc researcher will be to work more specifically on automatic speech (ASR, Speech-To-Text) in a noisy environment (the interior of a car).
The envisaged line of research focuses on the use of modern text-to-speech systems to generate synthetic speech data. An initial study conducted on the Google Speech Commands dataset demonstrated the feasibility of using 100% synthetic data to train a classifier satisfactorily. This study also revealed that it is still possible to easily distinguish real speech from synthetic speech using representations derived from self-supervised models such as WavLM. We aim to continue this characterization by identifying the dimensions involved in this distinction. Additionally, we seek to optimally align the distributions of real and synthetic speech in the space of self-supervised representations, using GANs or flow matching techniques.
This research will be conducted in connection
with the two other aspects treated by ANITI: 1) the study of the conversational structures between the driver and the assistant and their semantic interpretation, 2) the detection of
emotions and states of mind based on speech and transcription cues.
The hired PostDoc will be based at the Computer Science Research Institute of Toulouse (IRIT, located in the campus of the Toulouse III Paul Sabatier University.
Required skills
Applicants should have a PhD in machine learning, ideally in speech/natural language processing.
Good programming and English communication skills are also required.
Contract : post-doc
Duration : 14 months
Salary : according to experience
Location : Computer Science Research Institute of Toulouse (IRIT), Toulouse, France
Advisor : Thomas Pellegrini
Application
Formal applications should include detailed CV, a motivation letter and reference letters.
Samples of published research by the candidate will be a plus.
Applications should be send by email to Thomas Pelligrini