Publications | Valentino Maiorca

2025

NeurIPS
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss

Pau Rodriguez, Michael Klein, Eleonora Gualdoni, Valentino Maiorca, Arno Blaas, Luca Zappella, Marco Cuturi, and Xavier Suau

In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025

TL;DR: We introduce LinEAS, a method for controlling generative models by learning affine transformations on internal activations using optimal transport theory. Training end-to-end across all layers with just 32 unpaired samples and sparse regularization for automatic neuron selection, LinEAS achieves effective toxicity mitigation in LLMs and style control in text-to-image models without retraining

arXiv Bib
@inproceedings{lineas, title = {Lin{EAS}: End-to-end Learning of Activation Steering with a Distributional Loss}, author = {Rodriguez, Pau and Klein, Michael and Gualdoni, Eleonora and Maiorca, Valentino and Blaas, Arno and Zappella, Luca and Cuturi, Marco and Suau, Xavier}, booktitle = {The Thirty-ninth Annual Conference on Neural Information Processing Systems}, year = {2025}, url = {https://openreview.net/forum?id=EBONa3tT3K}, }
NeurIPS spotlight
Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Lorenzo Basile, Valentino Maiorca, Diego Doimo, Francesco Locatello, and Alberto Cazzaniga

In The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025

TL;DR: We use Simultaneous Orthogonal Matching Pursuit to identify attention heads specialized in narrow semantic domains (colors, countries, toxicity) in large language and vision-language models. Intervening on as few as 1% of heads enables bidirectional concept control—suppressing toxic content by 34-51% or enhancing target attributes—without any training

Bib PDF Blog Code
@inproceedings{headpursuit, title = {Head Pursuit: Probing Attention Specialization in Multimodal Transformers}, author = {Basile, Lorenzo and Maiorca, Valentino and Doimo, Diego and Locatello, Francesco and Cazzaniga, Alberto}, booktitle = {The Thirty-ninth Annual Conference on Neural Information Processing Systems}, year = {2025}, url = {https://neurips.cc/virtual/2025/poster/117597}, }
TMLR
ResiDual Transformer Alignment with Spectral Decomposition

Lorenzo Basile^*, Valentino Maiorca^*, Luca Bortolussi, Emanuele Rodolà, and Francesco Locatello

TMLR, 2025

TL;DR: We discover that attention head representations in vision transformers lie on low-dimensional manifolds where principal components encode specialized semantics (letters, locations, animals, etc.). By selectively amplifying task-relevant principal components through learned anisotropic scaling (ResiDual), we achieve fine-tuning level performance with up to 4 orders of magnitude fewer parameters than full fine-tuning.

arXiv Bib Code
@article{basile2025residual, title = {ResiDual Transformer Alignment with Spectral Decomposition}, author = {Basile, Lorenzo and Maiorca, Valentino and Bortolussi, Luca and Rodol{\`a}, Emanuele and Locatello, Francesco}, journal = {TMLR}, year = {2025}, url = {https://openreview.net/forum?id=z37LCgSIzI}, }
ArXiv
R3L: Relative Representations for Reinforcement Learning

Antonio Pio Ricciardi, Valentino Maiorca, Luca Moschella, Riccardo Marin, and Emanuele Rodolà

2025

TL;DR: We adapt relative representations to reinforcement learning, enabling zero-shot composition of encoders and controllers trained independently across different visual variations and task objectives. This achieves 75% reduction in training time (from N×M to N+M models) while maintaining performance comparable to end-to-end training.

arXiv Bib
@misc{ricciardi2025r3l, title = {R3L: Relative Representations for Reinforcement Learning}, author = {Ricciardi, Antonio Pio and Maiorca, Valentino and Moschella, Luca and Marin, Riccardo and Rodolà, Emanuele}, year = {2025}, eprint = {2404.12917}, archiveprefix = {arXiv}, primaryclass = {cs.LG}, url = {https://arxiv.org/abs/2404.12917}, journal = {ArXiv}, }
bioRxiv
Decoding RNA-RNA Interactions: The Role of Low-Complexity Repeats and a Deep Learning Framework for Sequence-Based Prediction

Adriano Setti, Giorgio Bini, Valentino Maiorca, Flaminia Pellegrini, Gabriele Proietti, Dimitrios Miltiadis-Vrachnos, Alexandros Armaos, Julie Martone, and 6 more authors

bioRxiv, 2025

TL;DR: We identify low-complexity repeats (LCRs) as key drivers of RNA-RNA interactions and develop RIME, a deep learning model using nucleic acid language model embeddings to predict RNA-RNA interactions. RIME outperforms traditional thermodynamics-based tools and successfully captures LCR-mediated interactions important for gene regulation and neuronal development

Abs DOI Bib

RNA-RNA interactions (RRIs) are fundamental to gene regulation and RNA processing, yet their molecular determinants remain unclear. In this work, we analyzed several large-scale RRI datasets and identified low-complexity repeats (LCRs), including simple tandem repeats, as key drivers of RRIs. Our findings reveal that LCRs enable thermodynamically stable interactions with multiple partners, positioning them as key hubs in RNA-RNA interaction networks. RNA-sequencing of the interactors of the Lhx1os lncRNA allowed to validate the importance of LCRs in shaping interactions potentially involved in neuronal development.Recognizing the pivotal role of sequence determinants, we developed RIME, a deep learning model that predicts RRIs by leveraging embeddings from a nucleic acid language model. RIME outperforms traditional thermodynamics-based tools, successfully captures the role of LCRs and prioritizes high-confidence interactions, including those established by lncRNAs. RIME is freely available at https://tools.tartaglialab.com/rna_rna.Competing Interest StatementThe authors have declared no competing interest.
@article{Setti2025.02.16.638500, author = {Setti, Adriano and Bini, Giorgio and Maiorca, Valentino and Pellegrini, Flaminia and Proietti, Gabriele and Miltiadis-Vrachnos, Dimitrios and Armaos, Alexandros and Martone, Julie and Monti, Michele and Ruocco, Giancarlo and Rodol{\`a}, Emanuele and Bozzoni, Irene and Colantoni, Alessio and Tartaglia, Gian Gaetano}, title = {Decoding RNA-RNA Interactions: The Role of Low-Complexity Repeats and a Deep Learning Framework for Sequence-Based Prediction}, elocation-id = {2025.02.16.638500}, year = {2025}, doi = {10.1101/2025.02.16.638500}, publisher = {Cold Spring Harbor Laboratory}, url = {https://www.biorxiv.org/content/early/2025/02/16/2025.02.16.638500}, eprint = {https://www.biorxiv.org/content/early/2025/02/16/2025.02.16.638500.full.pdf}, journal = {bioRxiv}, }

2024

ICLR spotlight
From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication

Irene Cannistraci, Luca Moschella^*, Marco Fumero^*, Valentino Maiorca, and Emanuele Rodolà

In International Conference on Learning Representations, 2024

TL;DR: We construct a product space of multiple relative representations, each computed using different distance metrics (Cosine, Euclidean, L1, L∞), to capture complex transformations between latent spaces without committing to a single invariance a priori. This consistently matches or exceeds the best single metric across vision, text, and graph domains in zero-shot model stitching tasks.

arXiv Bib Code
@inproceedings{cannistraci2024bricks, title = {From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication}, author = {Cannistraci, Irene and Moschella, Luca and Fumero, Marco and Maiorca, Valentino and Rodol{\`a}, Emanuele}, booktitle = {International Conference on Learning Representations}, year = {2024}, url = {https://openreview.net/forum?id=vngVydDWft}, }
ArXiv
Scalable unsupervised alignment of general metric and non-metric structures

Sanketh Vedula, Valentino Maiorca, Lorenzo Basile, Francesco Locatello, and Alex Bronstein

In ICML AI4Science Workshop, 2024

TL;DR: We transform the computationally intractable Gromov-Wasserstein problem into a scalable, inductive solution by learning embeddings that map domains into a common space where alignment reduces to a single optimal transport problem. This enables handling 45,000+ samples where standard solvers fail beyond 25,000, with extensions to non-metric structures through rank-based matching.

arXiv Bib
@inproceedings{vedula2024scalable, title = {Scalable unsupervised alignment of general metric and non-metric structures}, author = {Vedula, Sanketh and Maiorca, Valentino and Basile, Lorenzo and Locatello, Francesco and Bronstein, Alex}, booktitle = {ICML AI4Science Workshop}, year = {2024}, url = {https://arxiv.org/abs/2406.13507}, }
ArXiv
Latent Space Translation via Inverse Relative Projection

Valentino Maiorca^*, Luca Moschella^*, Marco Fumero, Francesco Locatello, and Emanuele Rodolà

arXiv, 2024

TL;DR: We formalize the inverse transformation from relative space back to absolute space, enabling zero-shot latent space translation without concurrent access to both models or additional training. Combined with the scale invariance properties of neural classifiers, this enables practical model compositionality where pre-trained encoders and classifiers can be mixed and matched across architectures and modalities.

arXiv Bib
@article{maiorca2024latent, title = {Latent Space Translation via Inverse Relative Projection}, author = {Maiorca, Valentino and Moschella, Luca and Fumero, Marco and Locatello, Francesco and Rodol{\`a}, Emanuele}, journal = {arXiv}, year = {2024}, url = {https://arxiv.org/abs/2406.15057}, }
It’s All Relative: Relative Uncertainty in Latent Spaces using Relative Representations

Fabian Mager, Valentino Maiorca, and Lars Kai Hansen

In NeurIPS UniReps Workshop, 2024

TL;DR: We address the reparametrization problem in neural network ensemble uncertainty quantification by transforming latent spaces into relative representations. By sampling models along a curve connecting two independently trained networks and measuring alignment with a Fisher-inspired metric, we show that relative spaces reduce overestimated uncertainty and reveal that most meaningful latent changes occur around the curve midpoint
Latent Communication for Zero-shot Stitching in Reinforcement Learning

Antonio Pio Ricciardi, Valentino Maiorca, Luca Moschella, Riccardo Marin, and Emanuele Rodolà

In Seventeenth European Workshop on Reinforcement Learning, 2024

TL;DR: We enable zero-shot modular policy reuse in reinforcement learning by learning simple affine transformations (via SVD) between independently trained agents’ latent representations using semantically aligned anchor observations. This allows encoders and controllers from different agents to be stitched together without training, achieving performance comparable to end-to-end training for robust control tasks.
NeurIPS
Latent Functional Maps: a spectral framework for representation alignment

Marco Fumero^*, Marco Pegoraro^*, Valentino Maiorca, Francesco Locatello, and Emanuele Rodolà

In Advances in Neural Information Processing Systems, 2024

TL;DR: We adapt the functional maps framework from 3D geometry processing to neural latent spaces by computing spectral transformations between graph Laplacian eigenbases. This provides a unified framework that can compare representational spaces, find correspondences with minimal supervision (5-10 anchors), and transfer representations while maintaining interpretability through spectral analysis.

Bib Code
@inproceedings{fumero2024latent, title = {Latent Functional Maps: a spectral framework for representation alignment}, author = {Fumero, Marco and Pegoraro, Marco and Maiorca, Valentino and Locatello, Francesco and Rodol{\`a}, Emanuele}, booktitle = {Advances in Neural Information Processing Systems}, year = {2024}, }
COSYNE
Multi-subject neural decoding via relative representations

Valentino Maiorca, Simone Azeglio, Marco Fumero, Clémentine Dominé, Emanuele Rodolà, and Francesco Locatello

In COSYNE, 2024

TL;DR: We apply relative representations to neural decoding, mapping fMRI data from different subjects into a common subject-agnostic representational space by leveraging neural encoders and anchor-based similarity functions. On the Natural Scenes Dataset with 8 subjects, our framework achieves substantially higher cross-subject retrieval accuracy than PCA and absolute baselines, enabling generalization without expensive alignment training

Bib
@inproceedings{maiorca2024multi, title = {Multi-subject neural decoding via relative representations}, author = {Maiorca, Valentino and Azeglio, Simone and Fumero, Marco and Domin{\'e}, Cl{\'e}mentine and Rodol{\`a}, Emanuele and Locatello, Francesco}, booktitle = {COSYNE}, year = {2024}, }

2023

ICLR oral
Relative representations enable zero-shot latent space communication

Luca Moschella^*, Valentino Maiorca^*, Marco Fumero, Antonio Norelli, Francesco Locatello, and Emanuele Rodola

In International Conference on Learning Representations, 2023

TL;DR: We introduce relative representations that make neural network latent spaces invariant to training stochasticity by encoding data points relative to anchor samples using cosine similarity. This enables zero-shot model stitching across different random seeds, architectures, languages, and datasets without any training.

arXiv Bib Code
@inproceedings{moschella2023relative, title = {Relative representations enable zero-shot latent space communication}, author = {Moschella, Luca and Maiorca, Valentino and Fumero, Marco and Norelli, Antonio and Locatello, Francesco and Rodola, Emanuele}, booktitle = {International Conference on Learning Representations}, year = {2023}, url = {https://openreview.net/forum?id=SrC-nwieGJ}, }
NeurIPS
Asif: Coupled data turns unimodal models to multimodal without training

Antonio Norelli, Marco Fumero, Valentino Maiorca, Luca Moschella, Emanuele Rodola, and Francesco Locatello

In Advances in Neural Information Processing Systems, 2023

TL;DR: ASIF creates multimodal models without any training by using relative representations computed from frozen pre-trained unimodal encoders and a small collection of image-text pairs. This training-free approach achieves competitive zero-shot classification with 250× less data than CLIP, while providing built-in interpretability.

arXiv Bib Code
@inproceedings{norelli2023asif, title = {Asif: Coupled data turns unimodal models to multimodal without training}, author = {Norelli, Antonio and Fumero, Marco and Maiorca, Valentino and Moschella, Luca and Rodola, Emanuele and Locatello, Francesco}, booktitle = {Advances in Neural Information Processing Systems}, year = {2023}, url = {https://openreview.net/forum?id=XjOj3ZmWEl}, }
NeurIPS
Latent Space Translation via Semantic Alignment

Valentino Maiorca^*, Luca Moschella^*, Antonio Norelli, Marco Fumero, Francesco Locatello, and Emanuele Rodolà

In Advances in Neural Information Processing Systems, 2023

TL;DR: We enable zero-shot stitching of independently trained encoders and decoders by estimating simple transformations (orthogonal via Procrustes analysis) between their latent spaces using semantically aligned anchor points. This works across architectures, domains, and even modalities without requiring training on relative representations.

arXiv Bib
@inproceedings{maiorca2023latent, title = {Latent Space Translation via Semantic Alignment}, author = {Maiorca, Valentino and Moschella, Luca and Norelli, Antonio and Fumero, Marco and Locatello, Francesco and Rodol{\`a}, Emanuele}, booktitle = {Advances in Neural Information Processing Systems}, year = {2023}, url = {https://neurips.cc/virtual/2023/poster/70426}, }
Sparse vicious attacks on graph neural networks

Giovanni Trappolini, Valentino Maiorca, Silvio Severino, Emanuele Rodolà, Fabrizio Silvestri, and Gabriele Tolomei

IEEE Transactions on Artificial Intelligence, 2023

TL;DR: We introduce SAVAGE, an adversarial attack framework that manipulates GNN-based link prediction in social networks by strategically injecting minimal malicious nodes with a sparsity-enforcing mechanism. The attacks achieve optimal trade-off between success rate and resource usage while transferring effectively across different black-box link prediction methods

Bib Code
@article{trappolini2023sparse, title = {Sparse vicious attacks on graph neural networks}, author = {Trappolini, Giovanni and Maiorca, Valentino and Severino, Silvio and Rodol{\`a}, Emanuele and Silvestri, Fabrizio and Tolomei, Gabriele}, journal = {IEEE Transactions on Artificial Intelligence}, year = {2023}, publisher = {IEEE}, url = {https://ieeexplore.ieee.org/document/10264111}, }
ArXiv
Bootstrapping Parallel Anchors for Relative Representations

Irene Cannistraci, Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, and Emanuele Rodolà

arXiv, 2023

TL;DR: We reduce the number of required parallel anchors for relative representations by one order of magnitude through an optimization-based method that discovers new anchors from a minimal seed set. Starting with just 15 seed anchors, our method discovers 300 parallel anchors and often outperforms using ground truth anchors.

arXiv Bib Code
@article{cannistraci2023bootstrapping, title = {Bootstrapping Parallel Anchors for Relative Representations}, author = {Cannistraci, Irene and Moschella, Luca and Maiorca, Valentino and Fumero, Marco and Norelli, Antonio and Rodol{\`a}, Emanuele}, journal = {arXiv}, year = {2023}, }
Attention-likelihood relationship in transformers

Valeria Ruscio, Valentino Maiorca, and Fabrizio Silvestri

ICLR Tiny Papers Track, 2023

TL;DR: We discover that unexpected (low-likelihood) tokens cause transformer models to attend less to information from themselves when computing representations, particularly in higher layers. This correlation between token likelihood and attention values has implications for assessing LLM robustness in real-world scenarios

Bib Code
@article{ruscio2023attention, title = {Attention-likelihood relationship in transformers}, author = {Ruscio, Valeria and Maiorca, Valentino and Silvestri, Fabrizio}, journal = {ICLR Tiny Papers Track}, year = {2023}, url = {https://openreview.net/forum?id=R82eeIF4rP_}, }
ACL
Accelerating transformer inference for translation via parallel decoding

Andrea Santilli, Silvio Severino, Emilian Postolache, Valentino Maiorca, Michele Mancusi, Riccardo Marin, and Emanuele Rodolà

In Annual Meeting of the Association for Computational Linguistics, 2023

TL;DR: We accelerate transformer inference for translation by reformulating standard greedy autoregressive decoding as parallel fixed-point iteration using Jacobi and Gauss-Seidel methods, achieving 38% speedup (nearly 2× on parallel hardware) while maintaining translation quality without any model retraining or architectural changes

Bib Code
@inproceedings{santilli2023accelerating, title = {Accelerating transformer inference for translation via parallel decoding}, author = {Santilli, Andrea and Severino, Silvio and Postolache, Emilian and Maiorca, Valentino and Mancusi, Michele and Marin, Riccardo and Rodol{\`a}, Emanuele}, booktitle = {Annual Meeting of the Association for Computational Linguistics}, year = {2023}, url = {https://aclanthology.org/2023.acl-long.689/}, }
Infusing invariances in neural representations

Irene Cannistraci, Marco Fumero, Luca Moschella, Valentino Maiorca, and Emanuele Rodolà

In ICML TAG-ML Workshop, 2023

TL;DR: We propose a framework for incorporating invariances to targeted transformations (model initialization, architecture, training modality) into neural representations, demonstrating that independently trained networks have structural similarities in latent spaces. Analysis across 8 benchmarks reveals that optimal transformation classes depend on the task at hand

Code

2022

LoG
Metric based few-shot graph classification

Donato Crisostomi, Simone Antonelli, Valentino Maiorca, Luca Moschella, Riccardo Marin, and Emanuele Rodolà

In Learning on Graphs Conference, 2022

TL;DR: We provide a unified evaluation framework for few-shot graph classification, showing that simple metric learning baselines with state-of-the-art graph embedders and task-conditioned embedding spaces outperform complex graph-specific approaches. Our modular framework with MixUp data augmentation achieves best overall results across benchmarks

Bib Code
@inproceedings{crisostomi2022metric, title = {Metric based few-shot graph classification}, author = {Crisostomi, Donato and Antonelli, Simone and Maiorca, Valentino and Moschella, Luca and Marin, Riccardo and Rodol{\`a}, Emanuele}, booktitle = {Learning on Graphs Conference}, year = {2022}, organization = {PMLR}, url = {https://proceedings.mlr.press/v198/crisostomi22a.html}, }

2021

EMNLP
WikiNEuRal: Combined neural and knowledge-based silver data creation for multilingual NER

Simone Tedeschi, Valentino Maiorca, Niccolò Campolungo, Francesco Cecconi, and Roberto Navigli

In Conference on Empirical Methods in Natural Language Processing, 2021

TL;DR: We create high-quality silver (automatically labeled) training data for multilingual Named Entity Recognition by combining knowledge-based approaches from Wikipedia with neural models and a novel domain adaptation technique, achieving 6 F1-score point improvement over previous state-of-the-art data creation methods

Bib Code
@inproceedings{tedeschi2021wikineural, title = {WikiNEuRal: Combined neural and knowledge-based silver data creation for multilingual NER}, author = {Tedeschi, Simone and Maiorca, Valentino and Campolungo, Niccol{\`o} and Cecconi, Francesco and Navigli, Roberto}, booktitle = {Conference on Empirical Methods in Natural Language Processing}, year = {2021}, url = {https://aclanthology.org/2021.findings-emnlp.215/}, }