Theory#
A parametric potential typically takes the form
where and
are the
coordinates and species of a system of
atoms, respectively, and
denotes a set of fitting parameters.
For notational simplicity, in the following discussion, we assume that the atomic
species information is implicitly carried by the coordinates and thus we can exclude
from the functional form, and use
to denote the
coordinates of all atoms in the configuration. Then we have
A potential parameterization process is typically formulated as a weighted
least-squares minimization problem, where we adjust the potential parameters
so as to reproduce a training set of reference data obtained from
experiments and/or first-principles computations. Mathematically, we hope to minimize a loss function
with respect to , where
is
a training set of
reference data,
is the corresponding
prediction for
computed from the potential (as indicated by its
argument),
denote the
norm, and
is the
weight for the
-th data point. We call
the residual function that characterizes the difference between the potential predictions and the reference data for a set of properties.
Generally speaking, can be a collection of any material properties
considered important for a given application, such as the cohesive energy,
equilibrium lattice constant, and elastic constants of a given crystal phase.
These materials properties can be obtained from experiments and/or
first-principles calculations.
However, nowadays, most of the potentials are trained using the force-matching
scheme, where the potential is trained to a large set of forces on atoms
(and/or energies, stresses) obtained by first-principles calculations for a
set of atomic configurations. This is extremely true for machine learning
potentials, where a large set of training data is necessary, and it seems impossible
to collect sufficient number of material properties for the training set.
The reference and the prediction
are typically
represented as vectors such that
is the
-th reference property and
is the
corresponding
-th prediction obtained from the potential.
Assuming we want to fit a potential to energy and forces, then
is a vector of size
, in which
is the number
of atoms in a configuration, with
where is the reference energy, and
,
, and
denote the
-,
-, and
-component of reference force on atom
, respectively.
In other words, we put the energy as the 0th component of
, and
then put the force on the first atom as the 1st to 3rd components of
,
the force on the second atom the next three components till the forces on all
atoms are placed in
.
In the same fashion, we can construct the prediction vector
, and
then to compute the residual vector.
Note
We use boldface with subscript to denote a data point (e.g.
means the
-th data point in the training set), and use normal text
with square bracket to denote the component of a data point (e.g. :
indicates the
-th component of a general data point
.
If stress is used in the fitting, to
will store
the reference Voigt stress
,
and, of course,
to
are the corresponding
predictions computed from the potential.
The objective of the parameterization process is to find a set of parameters
of potential that reproduce the reference data as well as
possible.