Mechanistic Machine Learning

MInD Harvard Team Visit

George G. Vega Yon, Ph.D.

2023-05-19

Overview

Slides can be downloaded from
https://ggv.cl/slides/mind2023

Machine Learning is Broken

Mechanistic Machine Learning

Mechanistic Machine Learning [MechML]–a.k.a. theory-guided data science/machine learning: A hybrid between theory and data-driven prediction.

Mechanistic models

  • Inference-driven (causality).
  • Great for small datasets.
  • Knowledge beyond the observed data.

Machine Learning

  • Data-driven (prediction).
  • Great for big data.
  • Finds hidden knowledge in observed data.

ML can help explain what theory hasn’t… but we still need theory (Lazer et al. 2014)!

MechML: State-of-the-art

  • Creating a loss function with a mechanistic penalty for modeling tumor cell density (Gaw et al. 2019)

Warning

  1. Mechanistic Machine Learning is not domain-knowledge-aided feature engineering. You need a whole other model to complement the ML algorithm.

  2. This isn’t just an ML ensemble; you must have an ML and a Mech model.

How to “MechML”?

MechML: Three strategies

  • ML Correction: Use machine learning to learn the errors of a mechanistic model.
  • Mechanistic Feature: Add mechanistic predictions as a feature of a machine learning model.
  • Mechanistic Penalty: Add constraints to the ML algorithm based on a mechanistic model.