Hugo Latourelle-Vigeant

Ph.D. student @ Yale University

Welcome to my academic website! I am a Ph.D. student in Statistics and Data Science at Yale University. I completed my Master’s degree in Mathematics and Statistics at McGill University, where I had the privilege of working under the co-supervision of Professor Courtney Paquette and Professor Elliot Paquette. Prior to this, I earned my B.Sc. in Mathematics and Computer Science with First-Class Honours, also from McGill University.

My academic interests revolve around the fascinating realm of large random systems. I see data science and machine learning as exciting avenues for theoretical exploration, using tools from random matrix theory, high-dimensional probability, optimization and statistics, into the complexities of high-dimensional systems. My research has led me to study various aspects of random matrix theory, notably the matrix Dyson equation. I have also explored the applications of random matrix theory to machine learning, notably by establishing a Gaussian equivalence result for the empirical test error of random features ridge regression. For a slightly more detailed discussion, please refer to the “research” tab.

Beyond my academic pursuits, I like to engage in physical activities. Snorkeling, kayaking, and skiing are among my favorite pastimes. In an alternate reality, I might have been known as a “gym bro.”

news

May 20, 2025	Featured as an Alumni Spotlight by Champlain College Saint-Lambert
Oct 28, 2024	Updated the Paper on the Dyson Equation for Correlated Linearizations
Jul 08, 2024	Master's thesis published
Dec 13, 2023	First paper!
Dec 04, 2023	Presentation at the 2023 Winter meeting of the Canadian Mathematical Society

selected publications

Preprint
Dyson Equation for Correlated Linearizations and Test Error of Random Features Regression

Hugo Latourelle-Vigeant, and Elliot Paquette

2024

Abs arXiv Bib PDF

This paper develops some theory of the Dyson equation for correlated linearizations and uses it to solve a problem on asymptotic deterministic equivalent for the test error in random features regression. The theory developed for the correlated Dyson equation includes existence-uniqueness, spectral support bounds, and stability properties. This theory is new for constructing deterministic equivalents for pseudo-resolvents of a class of correlated linear pencils. In the application, this theory is used to give a deterministic equivalent of the test error in random features ridge regression, in a proportional scaling regime, wherein we have conditioned on both training and test datasets.
@misc{latourellevigeant2024dysonequationcorrelatedlinearizations, title = {Dyson Equation for Correlated Linearizations and Test Error of Random Features Regression}, author = {Latourelle-Vigeant, Hugo and Paquette, Elliot}, year = {2024}, eprint = {2312.09194v2}, archiveprefix = {arXiv}, primaryclass = {math.ST}, }
Thesis
The matrix Dyson equation for machine learning: Correlated linearizations and the test error in random features regression

Hugo Latourelle-Vigeant

2024

Abs Bib HTML PDF

Contemporary machine learning models, particularly deep learning models, are frequently trained on large datasets within high-dimensional feature spaces, presenting challenges for traditional analytical approaches. Notably, the effective generalization of highly overparameterized models contradicts conventional statistical wisdom. Furthermore, the presence of non-linear activations in artificial neural networks adds complexity to their analysis. To simplify theoretical analysis, it is often assumed that training data is sampled from an unstructured distribution. While such analyses offer insights into certain aspects of machine learning, they fall short in elucidating how neural networks extract information from the structure of the data, crucial for their success in real-world applications. Fortunately, random matrix theory has emerged as a valuable tool for theoretically understanding certain machine learning procedures. Various techniques have been employed to explore large random matrices through asymptotic deterministic equivalents. One such approach involves substituting the random resolvent associated with a large random matrix with the solution of a deterministic fixed-point equation known as the matrix Dyson equation. Another effective technique, known as the linearization trick, involves embedding a matrix expression into a larger random matrix, termed a linear matrix pencil, with a simplified correlation structure. In this thesis, we extend the matrix Dyson equation framework to derive an anisotropic global law for a broad class of pseudo-resolvents with general correlation structures. This extension enables the analysis of spectral properties of a wide range of random matrices using a simpler and deterministic solution to the matrix Dyson equation. Through the development of this theory, we address critical aspects such as existence-uniqueness, spectral support bounds, and stability properties. These considerations are essential for constructing deterministic equivalents for pseudo-resolvents of a class of correlated linear pencils. Leveraging this theoretical framework, we provide an asymptotically exact deterministic expression for the empirical test error of random features ridge regression. The random features model, characterized by its non-linear activation function and potential for overparameterization, emerges as a powerful model for studying phenomena observed in real-life machine learning models, such as multiple descent and implicit regularization. Our exact expression facilitates a precise characterization of the implicit regularization of the model and unveils connections between random features regression and closely related kernel methods. Since we make no particular assumptions about the distribution of the data and response variable, our work represents a significant step towards understanding how neural networks exploit specific data structures.
@mastersthesis{latourelle2024matrix, title = {The matrix Dyson equation for machine learning: Correlated linearizations and the test error in random features regression}, author = {Latourelle-Vigeant, Hugo}, year = {2024}, publisher = {McGill University}, eprint = {https://escholarship.mcgill.ca/concern/theses/m900p100w?locale=en}, }