Action-Angle Networks

Physics Insight

There is a subclass of Hamiltonian systems called Liouville integrable systems.
The Liouville-Arnold theorem guarantees that, for these systems, there exists a canonical transformation to a set of coordinates called Action-Angle coordinates, with which the phase space is greatly simplified and allowing for easier prediction.

The Architecture

Daigavane et al.1 introduce Action-Angle networks, an architecture that takes advantage of the linear evolution of the angle variable for Liouville-Arnold theorem to evolve the system forward in time.

The paper adopts an autoencoder-type architecture:

  • Encode step: they use a GSympNet2 network to do the canonical transformation from $(q(t), p(t))$ to $(I(t), \theta (t))$,
  • Evolve step: they take advantage of the fact that the angle coordinate evolves linearly with time to use the Euler update rule to get $\theta (t + \Delta t)$*, and
  • Decode step: they use the inverse of the GSympNet network to go back to position-momentum at the new time step
    N.B.: GSympNets are very easily invertible by just reserving order of the gradient modules and negating their learnable parameters. For more details, check again2

*: the $\dot{\theta} (t)$ (used in the Euler update rule in the evolve step) is approximated using a multi-layer perceptron.

This is the schematic of the architecture (image taken from the paper1):
ActionAngleNetworkArchitecture.png

The loss function of the action-angle network has the following two terms:

  • $L_{\text{predict}} = \frac{1}{1 + \Delta t} \sum_{t_0 \in \text{Tr}} ||\mathcal{M} (u(t_0), \Delta t) - u(t_0 + \Delta t)||^2$, and
  • $L_{\text{action}} = \frac{1}{T} \left (\hat{I} (t_0) - \frac{1}{T} \sum_{t_0 \in \text{Tr}} \hat{I}(t_0) \right)$ (this term is to enforce the constancy of the action variable),
    where $\text{Tr}$ is the training data composed of position $q(t)$ and momentum $p(t)$ data for a number of time steps.
1

Daigavane, A., Kosmala, A., Cranmer, M., Smidt, T., & Ho, S. (2022). Learning Integrable Dynamics with Action-Angle Networks. https://arxiv.org/abs/2211.15338

2

Jin, P., Zhang, Z., Zhu, A., Tang, Y., & Karniadakis, G. E. (2020). SympNets: Intrinsic structure-preserving symplectic networks for identifying Hamiltonian systems. https://arxiv.org/abs/2001.03750 jin2020 SympNets