Tom Dupré la Tour

About me
Publications
Talks
Software
Blog

About me

I am a research scientist at OpenAI, working on interpretability of language models, for AI safety. Before that, I was a postdoc at UC Berkeley in the Gallant Lab. I defended my Ph.D. in 2018, in the Image, Data, Signal department at Telecom ParisTech in France, supervised by Alexandre Gramfort and Yves Grenier. I graduated from École polytechnique in 2013 and EPFL in 2015. More details can be found in my resume.

My work focuses on developing machine learning and signal processing methods, for interpreting human/animal brain recordings (electrocorticography, magnetoencephalography, functional magnetic resonance imaging, neuron spikes) and silicon brain recordings (large language model activations). I have also been a core developer of scikit-learn between 2015 and 2022.

News

Aug. 2023 - Joined OpenAI
Sep. 2021 - Presented a tutorial at the conference on Cognitive Computational Neuroscience
Jun. 2019 - Won PhD thesis award - 1st prize in Signal, Image and Vision, from the Club EEA, GRETSI and GdR ISIS
Feb. 2019 - Joined the Gallant Lab at UC Berkeley as a postdoc
Nov. 2018 - Won PhD student award - 1st prize at Université Paris-Saclay STIC doctoral school
Nov. 2018 - Defended my PhD - Non-linear models for neurophysiological time series

Blog posts

March 2018 - Dask-Distributed and Joblib

Publications

2025

Persona features control emergent misalignment

Miles Wang*, Tom Dupré la Tour*, Olivia Watkins*, Alex Makelov*, Ryan A. Chi*, Samuel Miserendino, Johannes Heidecke, Tejal Patwardhan, Dan Mossing*

arXiv preprint, 2025

[pdf] [arxiv] [blog]

Individual differences shape conceptual representation in the brain

Matteo Visconti di Oleggio Castello, Tom Dupré la Tour, Jack L. Gallant

bioRxiv preprint, 2025

[pdf] [biorxiv]

The voxelwise encoding model framework: a tutorial introduction to fitting encoding models to fMRI data

Tom Dupré la Tour*, Matteo Visconti di Oleggio Castello*, Jack L. Gallant

Imaging Neuroscience, 2025

[pdf]

2024

Scaling and evaluating sparse autoencoders

Leo Gao*, Tom Dupré la Tour*, Henk Tillman*, Gabriel Goh, Rajan Troll, Alec Radford, Ilya Sutskever, Jan Leike, Jeffrey Wu*

ICLR 2025

[pdf] [arxiv] [blog] [code]

The cortical representation of language timescales is shared between reading and listening

Catherine Chen, Tom Dupré la Tour, Jack L. Gallant, Dan Klein, Fatma Deniz

Communications Biology, 2024

[pdf] [biorxiv]

A biologically-inspired hierarchical convolutional energy model predicts V4 responses to natural videos

Michael Oliver*, Michele Winter*, Tom Dupré la Tour*, Michael Eickenberg*, Jack L. Gallant

bioRxiv preprint, 2024

[pdf]

Transformer debugger

Dan Mossing, Steven Bills, Henk Tillman, Tom Dupré la Tour, Nick Cammarata, Leo Gao, Joshua Achiam, Catherine Yeh, Jan Leike, Jeff Wu, William Saunders

GitHub repository, 2024

[code]

2023

Model connectivity: leveraging the power of encoding models to overcome the limitations of functional connectivity

Emily X. Meschke*, Matteo Visconti di Oleggio Castello*, Tom Dupré la Tour, Jack L. Gallant

bioRxiv preprint, 2023

[pdf] [biorxiv]

Semantic representations during language comprehension are affected by context

Fatma Deniz, Christine Tseng, Leila Wehbe, Tom Dupré la Tour, Jack L. Gallant

Journal of Neuroscience, 2023

[pdf] [biorxiv]

2022

Benchopt: Reproducible, efficient and collaborative optimization benchmarks

Thomas Moreau, Mathurin Massias, Alexandre Gramfort, Pierre Ablin, Pierre-Antoine Bannier, Benjamin Charlier, Mathieu Dagréou, Tom Dupré la Tour, Ghislain Durif, Cassio F. Dantas, Quentin Klopfenstein, Johan Larsson, En Lai, Tanguy Lefort, Benoit Malézieux, Badr Moufad, Binh T. Nguyen, Alain Rakotomamonjy, Zaccharie Ramzi, Joseph Salmon, Samuel Vaiter

NeurIPS, 2022

[pdf] [arxiv] [poster] [code]

Feature-space selection with banded ridge regression

Tom Dupré la Tour, Michael Eickenberg, Anwar O. Nunez-Elizalde, Jack L. Gallant

NeuroImage, 2022

[pdf] [biorxiv] [code]

2021

A finer mapping of convolutional neural network layers to the visual cortex

Tom Dupré la Tour, Michael Lu, Michael Eickenberg, Jack L. Gallant

NeurIPS workshop SVRHM, 2021

[pdf]

2019

The strength of alpha-beta oscillatory coupling predicts motor timing precision

Laetitia Grabot, Tadeusz W. Kononowicz, Tom Dupré la Tour, Alexandre Gramfort, Valérie Doyère, Virginie van Wassenhove

Journal of Neuroscience, 2019

[pdf]

2018

Non-linear models for neurophysiological time series

Tom Dupré la Tour

PhD Thesis, 2018
PhD student award - 1st prize at Université Paris-Saclay STIC doctoral school [link]
PhD thesis award - 1st prize in Signal, Image and Vision, from the Club EEA, GRETSI and GdR ISIS [link] [news]

[pdf] [book_pdf] [tel] [pastel] [slides]

Multivariate convolutional sparse coding for electromagnetic brain signals

Tom Dupré la Tour*, Thomas Moreau*, Mainak Jas, Alexandre Gramfort

NeurIPS, 2018

[pdf] [arxiv] [poster] [code]

Driver estimation in non-linear autoregressive models

Tom Dupré la Tour, Yves Grenier, Alexandre Gramfort

ICASSP, 2018

[pdf] [poster]