Data-driven collective variables

Using machine learning to discover CVs for enhanced sampling

A key challenge in enhanced sampling simulations is identifying collective variables (CVs) able to efficiently explore rare events. I developed data-driven approaches that automate this process using machine learning techniques. Notably, I proposed a method to build CVs from metastable states alone via neural networks optimized with Fisher’s discriminant (Bonati et al., 2020) and a deep learning framework to extract slow modes from biased simulations, improving rare-event sampling in diverse applications (Bonati et al., 2021).

Left: DeepLDA. Right: DeepTICA.

Recent advances include a descriptor-free approach leveraging geometric graph neural networks for symmetry-invariant CVs (Zhang et al., 2024) and a multitask approach that can learn CVs from transition path sampling simulations while simultaneously optimizing shooting efficiency (Zhang et al., 2024).

Code. All these techniques are implemented in mlcolvar, a Python library I developed which integrates machine learning-based CVs into enhanced sampling workflows (Bonati et al., 2023).

Reviews. We recently covered these topics in a tutorial-style book chapter (Trizio et al., 2024) and a Chemical Review article (Zhu et al., 2025)

References

2025

  1. Chem. Rev.
    2025-chemrev.jpeg
    Enhanced Sampling in the Age of Machine Learning: Algorithms and Applications
    Kai Zhu, Enrico Trizio, Jintu Zhang, Renling Hu, Linlong Jiang, Tingjun Hou, and Luigi Bonati
    Chemical Reviews, Oct 2025

2024

  1. JCTC
    2024-jctc-gnn.png
    Descriptor-Free Collective Variables from Geometric Graph Neural Networks
    Jintu Zhang, Luigi Bonati, Enrico Trizio, Odin Zhang, Yu Kang, Ting Jun Hou, and Michele Parrinello
    Journal of Chemical Theory and Computation, Dec 2024
  2. JCTC
    2024-jctc-tps.png
    Combining Transition Path Sampling with Data-Driven Collective Variables through a Reactivity-Biased Shooting Algorithm
    Jintu Zhang, Odin Zhang, Luigi Bonati, and Ting Jun Hou
    Journal of Chemical Theory and Computation, Jun 2024
  3. arXiv
    Advanced simulations with PLUMED: OPES and Machine Learning Collective Variables
    Enrico Trizio, Andrea Rizzi, Pablo M. Piaggi, Michele Invernizzi, and Luigi Bonati
    arXiv, Oct 2024

2023

  1. JCP
    2023-jcp-mlcolvar.png
    A unified framework for machine learning collective variables for enhanced sampling simulations: mlcolvar
    Luigi Bonati, Enrico Trizio, Andrea Rizzi, and Michele Parrinello
    The Journal of Chemical Physics, Jul 2023

2021

  1. PNAS
    2021-pnas-tica.png
    Deep learning the slow modes for rare events sampling
    Luigi Bonati, GiovanniMaria Piccini, and Michele Parrinello
    Proceedings of the National Academy of Sciences, 16 2021

2020

  1. J. Phys. Chem. Lett.
    2020-jpcl-deeplda.png
    Data-Driven Collective Variables for Enhanced Sampling
    Luigi Bonati, Valerio Rizzi, and Michele Parrinello
    Journal of Physical Chemistry Letters, 16 2020