I am a research scientist at OpenAI, working on interpretability of language models, for AI safety. Before that, I was a postdoc at UC Berkeley in the Gallant Lab. I defended my Ph.D. in 2018, in the Image, Data, Signal department at Telecom ParisTech in France, supervised by Alexandre Gramfort and Yves Grenier. I graduated from Ecole polytechnique in 2013 and EPFL in 2015. More details can be found in my resume.
My work focuses on developing machine learning and signal processing methods, for interpreting human/animal brain recordings (electrocorticography, magnetoencephalography, functional magnetic resonance imaging, neuron spikes) and silicon brain recordings (large language model activations). I have also been a core developer of scikit-learn between 2015 and 2022.