Valentino Maiorca

I’m interested in how the semantics of data shape the latent geometry of neural networks and enable information transfer between them.

I study how to act on this shared geometry, from aligning representational spaces to steering them toward task-relevant properties. The goal is to understand better what models learn and how to control, transfer, or repurpose that knowledge.

I’m always open to collaborations, discussions, and new ideas, so feel free to contact me!

Full CV available here.

selected publications

NeurIPS
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss

Pau Rodriguez, Michael Klein, Eleonora Gualdoni, Valentino Maiorca, Arno Blaas, Luca Zappella, Marco Cuturi, and Xavier Suau

In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025

TL;DR: We introduce LinEAS, a method for controlling generative models by learning affine transformations on internal activations using optimal transport theory. Training end-to-end across all layers with just 32 unpaired samples and sparse regularization for automatic neuron selection, LinEAS achieves effective toxicity mitigation in LLMs and style control in text-to-image models without retraining

arXiv Bib
@inproceedings{lineas, title = {Lin{EAS}: End-to-end Learning of Activation Steering with a Distributional Loss}, author = {Rodriguez, Pau and Klein, Michael and Gualdoni, Eleonora and Maiorca, Valentino and Blaas, Arno and Zappella, Luca and Cuturi, Marco and Suau, Xavier}, booktitle = {The Thirty-ninth Annual Conference on Neural Information Processing Systems}, year = {2025}, url = {https://openreview.net/forum?id=EBONa3tT3K}, }
NeurIPS spotlight
Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Lorenzo Basile, Valentino Maiorca, Diego Doimo, Francesco Locatello, and Alberto Cazzaniga

In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025

TL;DR: We use Simultaneous Orthogonal Matching Pursuit to identify attention heads specialized in narrow semantic domains (colors, countries, toxicity) in large language and vision-language models. Intervening on as few as 1% of heads enables bidirectional concept control—suppressing toxic content by 34-51% or enhancing target attributes—without any training

Bib PDF Blog Code
@inproceedings{headpursuit, title = {Head Pursuit: Probing Attention Specialization in Multimodal Transformers}, author = {Basile, Lorenzo and Maiorca, Valentino and Doimo, Diego and Locatello, Francesco and Cazzaniga, Alberto}, booktitle = {The Thirty-ninth Annual Conference on Neural Information Processing Systems}, year = {2025}, url = {https://neurips.cc/virtual/2025/poster/117597}, }
TMLR
ResiDual Transformer Alignment with Spectral Decomposition

Lorenzo Basile^*, Valentino Maiorca^*, Luca Bortolussi, Emanuele Rodolà, and Francesco Locatello

TMLR, 2025

TL;DR: We discover that attention head representations in vision transformers lie on low-dimensional manifolds where principal components encode specialized semantics (letters, locations, animals, etc.). By selectively amplifying task-relevant principal components through learned anisotropic scaling (ResiDual), we achieve fine-tuning level performance with up to 4 orders of magnitude fewer parameters than full fine-tuning.

arXiv Bib Code
@article{basile2025residual, title = {ResiDual Transformer Alignment with Spectral Decomposition}, author = {Basile, Lorenzo and Maiorca, Valentino and Bortolussi, Luca and Rodol{\`a}, Emanuele and Locatello, Francesco}, journal = {TMLR}, year = {2025}, url = {https://openreview.net/forum?id=z37LCgSIzI}, }
ICLR oral
Relative representations enable zero-shot latent space communication

Luca Moschella^*, Valentino Maiorca^*, Marco Fumero, Antonio Norelli, Francesco Locatello, and Emanuele Rodola

In International Conference on Learning Representations, 2023

TL;DR: We introduce relative representations that make neural network latent spaces invariant to training stochasticity by encoding data points relative to anchor samples using cosine similarity. This enables zero-shot model stitching across different random seeds, architectures, languages, and datasets without any training.

arXiv Bib Code
@inproceedings{moschella2023relative, title = {Relative representations enable zero-shot latent space communication}, author = {Moschella, Luca and Maiorca, Valentino and Fumero, Marco and Norelli, Antonio and Locatello, Francesco and Rodola, Emanuele}, booktitle = {International Conference on Learning Representations}, year = {2023}, url = {https://openreview.net/forum?id=SrC-nwieGJ}, }
NeurIPS
Asif: Coupled data turns unimodal models to multimodal without training

Antonio Norelli, Marco Fumero, Valentino Maiorca, Luca Moschella, Emanuele Rodola, and Francesco Locatello

In Advances in Neural Information Processing Systems, 2023

TL;DR: ASIF creates multimodal models without any training by using relative representations computed from frozen pre-trained unimodal encoders and a small collection of image-text pairs. This training-free approach achieves competitive zero-shot classification with 250× less data than CLIP, while providing built-in interpretability.

arXiv Bib Code
@inproceedings{norelli2023asif, title = {Asif: Coupled data turns unimodal models to multimodal without training}, author = {Norelli, Antonio and Fumero, Marco and Maiorca, Valentino and Moschella, Luca and Rodola, Emanuele and Locatello, Francesco}, booktitle = {Advances in Neural Information Processing Systems}, year = {2023}, url = {https://openreview.net/forum?id=XjOj3ZmWEl}, }
NeurIPS
Latent Space Translation via Semantic Alignment

Valentino Maiorca^*, Luca Moschella^*, Antonio Norelli, Marco Fumero, Francesco Locatello, and Emanuele Rodolà

In Advances in Neural Information Processing Systems, 2023

TL;DR: We enable zero-shot stitching of independently trained encoders and decoders by estimating simple transformations (orthogonal via Procrustes analysis) between their latent spaces using semantically aligned anchor points. This works across architectures, domains, and even modalities without requiring training on relative representations.

arXiv Bib
@inproceedings{maiorca2023latent, title = {Latent Space Translation via Semantic Alignment}, author = {Maiorca, Valentino and Moschella, Luca and Norelli, Antonio and Fumero, Marco and Locatello, Francesco and Rodol{\`a}, Emanuele}, booktitle = {Advances in Neural Information Processing Systems}, year = {2023}, url = {https://neurips.cc/virtual/2023/poster/70426}, }
COSYNE
Multi-subject neural decoding via relative representations

Valentino Maiorca, Simone Azeglio, Marco Fumero, Clémentine Dominé, Emanuele Rodolà, and Francesco Locatello

In COSYNE, 2024

TL;DR: We apply relative representations to neural decoding, mapping fMRI data from different subjects into a common subject-agnostic representational space by leveraging neural encoders and anchor-based similarity functions. On the Natural Scenes Dataset with 8 subjects, our framework achieves substantially higher cross-subject retrieval accuracy than PCA and absolute baselines, enabling generalization without expensive alignment training

Bib
@inproceedings{maiorca2024multi, title = {Multi-subject neural decoding via relative representations}, author = {Maiorca, Valentino and Azeglio, Simone and Fumero, Marco and Domin{\'e}, Cl{\'e}mentine and Rodol{\`a}, Emanuele and Locatello, Francesco}, booktitle = {COSYNE}, year = {2024}, }