source: src/documentation/constructs/potentials.dox@ 13e5be

stable v1.7.0
Last change on this file since 13e5be was 3cfb31, checked in by Frederik Heber <frederik.heber@…>, 8 years ago

DOCU: Updated potentials.dox.

  • Property mode set to 100644
File size: 14.9 KB
Line 
1/*
2 * Project: MoleCuilder
3 * Description: creates and alters molecular systems
4 * Copyright (C) 2010 University of Bonn. All rights reserved.
5 * Please see the LICENSE file or "Copyright notice" in builder.cpp for details.
6 */
7
8/**
9 * \file potentials.dox
10 *
11 * Created on: Nov 28, 2012
12 * Author: heber
13 */
14
15/** \page potentials Empirical Potentials and FunctionModels
16 *
17 * On this page we explain what is meant with the Potentials sub folder.
18 *
19 * First, we are based on fragmenting a molecular system, i.e. dissecting its
20 * bond structure into connected subgraphs, calculating the energies of the
21 * fragments (ab-initio) and summing up to a good approximation of the total
22 * energy of the whole system, \sa fragmentation.
23 * Second, having calculated these energies, there quickly comes up the idea
24 * that one actually calculates quite similar systems all time and if one could
25 * not cache results in an intelligent (i.e. interpolating) fashion ...
26 *
27 * That's where so-called empirical potentials come into play. They are
28 * functions depending on a number of "fitted" parameters and the variable
29 * distances within a molecular fragment (i.e. the bond lengths) in order to
30 * give a value for the total energy without the need to solve a complex
31 * ab-initio model (essentially, not solving the electronic Schrödinger equation
32 * anymore). And they are accompanied by a specific binding model that
33 * represents what kind of many-body force is represented by the potential.
34 *
35 * Empirical potentials have been thought of by fellows such as Lennard-Jones,
36 * Morse, Tersoff, Stillinger and Weber, etc. And in their honor, most of the
37 * potential forms are named after its inventor. Hence, we speak e.g. of a
38 * Lennard-Jones potential.
39 *
40 * So, what we have to do in order to cache results is the following procedure:
41 * -# gather similar fragments
42 * -# perform a fit procedure to obtain the parameters for the empirical
43 * potential
44 * -# evaluate the potential instead of an ab-initio calculation
45 *
46 * However, we need more: What are similar fragments? How do we perform the
47 * fitting procedure? And as the potentials are mathematical functions, what
48 * arguments do they depend on?
49 *
50 * Similar fragments are those that share the same bond graph, i.e. they have
51 * the same number of nodes and the same number of edges. And each edge is
52 * between the same two elements.
53 *
54 * The fitting procedure works by looking at a training set, i.e. a list of
55 * elements where each contains an energy and a number of arguments, namely
56 * pair-wise distances. The error is then the difference between the energies
57 * in the set and all the energy values that we obtain when we feed the
58 * arguments into the fitted potentials. This error is minimized in the
59 * euclidian norm, i.e. least squares regression. But other norms might be
60 * possible in the future, too.
61 *
62 * And the pair-wise distances, we mentioned are the arguments.
63 *
64 * The terms, that we use, model the classes that are implemented:
65 * -# EmpiricalPotential: Contains the interface to a function that can be
66 * evaluated given a number of arguments_t, i.e. distances. Also, one might
67 * want to evaluate derivatives.
68 * -# FunctionModel: Is a function that can be fitted, i.e. it depends on a
69 * set of internal parameters that can be set and got.
70 * -# argument_t: The Argument stores not only the distance but also the index
71 * pair of the associated atoms and also their charges, to let the potential
72 * check on validity.
73 * -# SerializablePotential: Eventually, one wants to store to or parse from
74 * a file all potential parameters. This functionality is encoded in this
75 * class.
76 * -# HomologyGraph: "Similar" fragments in our case have to have the same bond
77 * graph. It is stored in the HomologyGraph that acts as representative.
78 * -# HomologyContainer: This container combines, in a ultimap fashion, all
79 * similar fragments with their energies together, with the HomologyGraph
80 * as their "key".
81 * -# TrainingData: Here, we combine InputVector and OutputVector that contain
82 * the set of distances required for the FunctionModel (e.g. only a single
83 * distance/argument for a pair potential, three for an angle potential,
84 * etc.) and also the expected OutputVector, i.e. the energy of the specific
85 * configuration in our case. This in combination with the FunctionModel is
86 * the basis for the non-linear regression used for the fitting procedure.
87 * -# Extractors: These set of functions yield the set of distances from a
88 * given fragment that is stored in the HomologyContainer.
89 * -# FunctionApproximation: Contains the interface to the levmar package where
90 * the Levenberg-Marquardt (Newton + Trust region) algorithm is used to
91 * perform the fit.
92 *
93 * \section potentials-fit-potential-action What happens in FitPotentialAction.
94 *
95 * First, charges and a potential type is used from the given options. This
96 * is used to instantiate EmpiricalPotentials via the PotentialFactory, stored
97 * within the PotentialRegistry. This is the available set of potentials
98 * (without requiring any knowledge as to the nature of the fragment employed
99 * in fitting).
100 *
101 * Second, the given fragment is used to get a suitable HomologyGraph from
102 * the World's HomologyContainer. This is given to a CompoundPotential, that in
103 * turn browses through the PotentialRegistry, picking out those
104 * EmpiricalPotential's that match with a subset of the FragmentNode's of the
105 * given graph. These are stored as a list of FunctionModel's within the
106 * CompoundPotential instance. Here comes the specific fragment into play,
107 * picking a subset from the available potentials.
108 *
109 * Third, we need to setup the training data. For this we need vectors of input
110 * and output data that are obtained from the HomologyContainer with the
111 * HomologyGraph as key. The output vector in our case is simply a number
112 * (although the interface allows for more). The input vector is the set of
113 * distances. In order to pre-process the input data for the specific model
114 * a filter is required in the TrainingData's constructor. The purpose of the
115 * filter is to pick out the subset of distance arguments for each model one
116 * after the other and concatenate them to a list. On evaluation of the model
117 * this concatenated list of distances is given to the model and it may easily
118 * dissect the list and hand over each contained potential its subset of
119 * arguments. See Extractors for more information.
120 *
121 * Afterwards, training may commence: The goal is to find a set of parameters
122 * for the model such that it as good as possible reproduces the output vector
123 * for a given input vector. This non-linear regression is contained in the
124 * levmar package and its functionality is wrapped in the FunctionApproximation
125 * class. An instance is initialized with both the gathered training data and
126 * the model containing a list of potentials. See
127 * [FunctionApproximation-details] for more details.
128 *
129 * During the fitting procedure, first the derivatives of the model is checked
130 * for consistency, then the model is initialized with a sensible guess of
131 * starting parameters, and afterwards the Levenberg-Marquardt algorithm
132 * commences that makes numerous calls to evaluate the model and its derivative
133 * to find the minimum in the L2-norm.
134 *
135 * This is done more than once as high-dimensional regression is sensitive to
136 * the starting values as there are possible numerous local minima. The lowest
137 * of the found minima is taken, either via a given threshold or the best of a
138 * given number of attempts.
139 *
140 * Eventually, these parameters of the best model are streamed via
141 * PotentialSerializer back into a potential file. Each EmpiricalPotential in
142 * the CompoundPotential making up the whole model is also a
143 * SerializablePotential. Hence, each in turn writes a single line with its
144 * respective subset of parameters and particle types, describing this
145 * particular fit function.
146 *
147 * \section potentials-function-evaluation How does the model evaluation work
148 *
149 * We now come to the question of how the model and its derivative are actually
150 * evaluated. We have an input vector from the training data and we have the
151 * model with a specific set of parameters.
152 *
153 * FunctionModel is just an abstract interface that is implemented by the
154 * potential functions, such as CompoundPotential, that combines multiple
155 * potentials into a single function for fitting, or PairPotential_Harmonic,
156 * that is a specific fit function for harmonic bonds.
157 *
158 * The main issue with the evaluation is picking the right set of distances from
159 * ones given in the input vector and feed it to each potential contained in
160 * CompoundPotential.
161 *
162 * Initially, the HomologyGraph only contains a list of configurations of a
163 * specific fragments (i.e. the position of each atom in the fragment) and an
164 * energy value. These first have to be converted into distances.
165 *
166 * These distances are prepared by the TrainingData instantiation, i.e. a
167 * fragment with all its atomic positions has already been converted to the
168 * set of all pair-wise interatomic distances.
169 *
170 * \section potentials-distance-picking How does the distance picking work
171 *
172 * Given a set of pair-wise distances, how do we pick the subset of distances
173 * needed by a particular potential.
174 *
175 * Let's make an example first: Imagine a water molecule, i.e. one oxygen and
176 * and two hydrogen atoms with two O-H bonds. Naturally, we obtain three pair-
177 * wise distances, OH1, OH2 and H1H2. Now, we would like to fit a Morse
178 * potential that just depends on a single interatomic distance. We would like
179 * it to represents the O-H bond energy. Hence, the distance picker, namely
180 * the Extractor function, needs to pick any subset of distance that contains
181 * a unique single O-H distance. In effect, it needs to return a list
182 * containing OH1 and OH2 as the Morse potential needs to represent both bond
183 * energies together.
184 *
185 * Now, this is really still quite simple as long as the potential only
186 * depends on a single distance. However, what if we continue and look at a
187 * angle potential, requiring three atoms, i.e. H-O-H?
188 *
189 * Or even more complicated: Imagine an ethane molecule (C2H6) and we would
190 * to represent the H-C-C angular interaction by a harmonic angle potential.
191 * Now, there are multiple of these at the same time, namely six angular
192 * interactions.
193 *
194 * What have to do is look for subgraphs inside a graph. Each potential comes
195 * with a small graph that represents the binding structure, in our terms
196 * the bond model, that we expect. And we need to find the all matching
197 * subgraphs in the whole graph being the fragment itself. Then, for each
198 * subgraph the potential tells us in what order the pair-wise distances
199 * associated with the subgraph are required to be. All of these subset of
200 * distances are eventually concatenated and fed into the model on evaluation.
201 *
202 * \section potentials-howto-use Howto use the potentials
203 *
204 * We just give a brief run-down in terms of code on how to use the potentials.
205 * Here, we just describe what to do in order to perform the fitting. This is
206 * basically what is implemented in FragmentationFitPotentialAction.
207 *
208 * \code
209 * // we need the homology container and the representative graph we want to
210 * // fit to.
211 * HomologyContainer homologies;
212 * const HomologyGraph graph = getSomeGraph(homologies);
213 * Fragment::atomicnumbers_t h2o;
214 * h2o += 8,1,1;
215 * // TrainingData needs so called Extractors to get the required distances
216 * // from the stored fragment. These functions are bound via boost::bind.
217 * TrainingData AngleData(
218 * boost::bind(&Extractors::gatherDistancesFromFragment,
219 * boost::bind(&Fragment::getPositions, _1),
220 * boost::bind(&Fragment::getAtomicNumbers, _1),
221 * boost::cref(h2o),
222 * _2)
223 * );
224 * // now we extract the distances and energies and store them
225 * AngleData(homologies.getHomologousGraphs(graph));
226 * // give ParticleTypes of this potential to make it unique
227 * PairPotential_Angle::ParticleTypes_t types =
228 * boost::assign::list_of<PairPotential_Angle::ParticleType_t>
229 * (8)(1)(1)
230 * ;
231 * PairPotential_Angle angle(types);
232 * // give initial parameter
233 * FunctionModel::parameters_t params(PairPotential_Angle::MAXPARAMS, 0.);
234 * // ... set some potential-specific initial parameters in params struct
235 * angle.setParameters(params);
236 *
237 * // use the potential as a FunctionModel along with prepared TrainingData
238 * FunctionModel &model = angle;
239 * FunctionApproximation approximator(AngleData, model);
240 * approximator(FunctionApproximation::ParameterDerivative);
241 *
242 * // obtain resulting parameters and check remaining L_2 and L_max error
243 * angleparams = model.getParameters();
244 * LOG(1, "INFO: L2sum = " << AngleData(model)
245 * << ", LMax = " << AngleData(model) << ".");
246 * \endcode
247 *
248 * The evaluation of the fitted potential is then trivial, e.g.
249 * \code
250 * // constructed someplace
251 * PairPotential_Angle angle(...);
252 *
253 * // evaluate
254 * FunctionModel::arguments_t args;
255 * // .. initialise args to the desired distances
256 * const double value = angle(args)[0]; // output is a vector!
257 * \endcode
258 *
259 * \section potentials-stability-of-fit note in stability of fit
260 *
261 * As we always start from random initial parameters (within a certain sensible
262 * range at least), the non-linear fit does not always converge. Note that the
263 * random values are drawn from the defined distribution and the uniform distribution
264 * engine is obtained from the currently set, see \ref randomnumbers. Hence, you
265 * can manipulate both in order to get different results or to set the seed such that
266 * some "randomly" drawn value always work well (e.g. for testing).
267 *
268 * In any case, the FragmentationFitPotentialAction has the option
269 * "take-best-of" to allow for multiple fits where the best (in terms of l2 error)
270 * is taken eventually. Furthermore, you can use the "set-threshold" option to
271 * repeat the fit procedure until the L2 error has dropped below the given
272 * threshold.
273 *
274 * \section potentials-howto-add Howto add new potentials
275 *
276 * Adding a new potential requires the following:
277 * -# Add the new modules to Potentials/Specifics
278 * -# Add a unit test on the potential in Potentials/Specifics/unittests
279 * -# Give the potential a type name and add it to PotentialTypes.def. Note
280 * that the name must not contain white space.
281 * -# Add the potential name as case to PotentialFactory such that it knows
282 * how to instantiate your new potential when requested.
283 * -# Remember to use the the RandomNumberGenerator for getting random starting
284 * values!
285 *
286 * PotentialTypes.def contains a boost::preprocessor sequence of all
287 * potential names. PotentialFactory uses this sequence to build its enum to
288 * type map and inverse which the user sees when specifying the potential to
289 * fit via PotentialTypeValidator.
290 *
291 *
292 * \date 2017-05-14
293 */
Note: See TracBrowser for help on using the repository browser.