Loucas Pillaud-Vivien

Briefly

Contact

Email loucas.pillaud-vivien [at] enpc [dot] fr
Address Bureau 215, 6 et 8 avenue Blaise Pascal, Cité Descartes — Cermics
Scholar Google Scholar

Research Interests

My research sits at the intersection of optimization, probability and statistics, with a focus on understanding the mathematical mechanisms behind learning algorithms. I am particularly interested in the training dynamics of neural networks: how does gradient descent navigate non-convex landscapes, and why does it end up finding solutions that generalize? Since we barely understand gradient flow in these settings, I tend to start there before worrying about discrete updates.

A recurring theme in my work is the role of structure — geometric, algebraic, stochastic — in shaping the behavior of algorithms. This includes the implicit bias of overparametrized models (why does SGD favor simple solutions without being told to?), the interplay between architecture and the geometry of the loss landscape, and how the noise inherent to stochastic methods can actually help rather than hinder learning. I also have a lasting interest in kernel methods and spectral approaches, which offer a rich functional-analytic framework for these questions.

More recently, I have been working on single- and multi-index models as a natural testbed for feature learning in neural networks, and on variational inference methods. I enjoy problems where continuous-time dynamics, PDEs and stochastic differential equations shed light on fundamentally discrete algorithms.

The Research Web

I built a graph of my papers. Why? Unclear. You could just read the titles, honestly. But there it is — nodes, edges, little colored circles floating in the void — and I'm oddly pleased with myself. Perhaps that's what research is: the fine art of spending unreasonable amounts of time on things nobody asked for, and calling it a contribution.

There's also a co-author graph, which is even more pointless, and therefore even more dear to me. It's just a map of people I've been lucky enough to be confused with — over blackboards, bad coffee, trains to conferences, long emails about notation. The theorems came later, almost by accident. Conversations come first.

Selected Publications

For a complete list, rendez-vous to my Google Scholar page.

See all papers →

Selected Presentations

Label noise (stochastic) gradient descent implicitly solves the Lasso for quadratic parametrisation Slides
COLT. July 2022.
Some results on the role of stochasticity in learning algorithms Slides
Birs-Banff workshop. May 2022.
Some results on the role of stochasticity in learning algorithms Slides
One world ML seminar. March 2022.
Some results on the role of stochasticity in learning algorithms Slides
NYU-Flatiron. January 2022.
Model order reduction by spectral gap optimization Slides
CEMRACS. August 2021.
Implicit Bias of SGD for Diagonal Linear Networks: a Provable Benefit of Stochasticity Slides
Theory of Deep Learning, EPFL. June 2021.
Two results on Stochastic gradient descent in Hilbert spaces for Machine Learning problems Slides
Cermics seminar, Ecole des Ponts. November 2019.
Statistical Estimation of the Poincaré constant and Application to Sampling Multimodal Distributions Slides
Sierra group seminar, INRIA. April 2019.
Statistical Optimality of Stochastic Gradient Descent through Multiple Passes Slides Poster
Optimization and Statistical Learning, Les Houches. March 2019.
Langevin dynamics and applications to Machine Learning Slides
Sierra group seminar, INRIA. February 2019.
Comparing Dynamics: Deep Neural Networks versus Glassy systems Slides
Statistical Machine Learning in Paris (SMILE seminar). December 2018.
Statistical Optimality of Stochastic Gradient Descent through Multiple Passes Slides Poster
NeurIPS. December 2018.
Exponential convergence of testing error for stochastic gradient methods Slides Video Poster
COLT. July 2018.