Update 4.6MBAR.md

KuangYu · Nov 7, 2023 · 4f52538 · 4f52538
1 parent 899d050
commit 4f52538
Showing 1 changed file with 51 additions and 45 deletions.
diff --git a/docs/user_guide/4.6MBAR.md b/docs/user_guide/4.6MBAR.md
@@ -2,116 +2,122 @@
 
 ## 1. Theory
 
-In molecular dynamics (MD) simulations, the deep computational graph spanning the entire trajectory incurs significant temporal and computational costs. This limitation can be circumvented through trajectory reweighting schemes. In DMFF, the reweighting algorithm is incorporated into the MBAR method, extending the differentiable estimators for average properties and free energy. Although differentiable estimation of dynamic quantities remains a challenge, introducing the reweighted MBAR estimator has optimized the fitting of thermodynamic properties.
+In molecular dynamics (MD) simulations, the deep computational graph spanning the entire trajectory incurs significant temporal and computational costs. This limitation can be circumvented through trajectory reweighting schemes. In DMFF, the reweighting algorithm is incorporated into the MBAR method, extending the differentiable estimators for average properties and free energy. Although differentiable estimation of dynamic properties remains a challenge, introducing the reweighted MBAR estimator has largely eased the fitting of thermodynamic properties.
 
-In the MBAR theory, it is assumed that there are K ensembles defined by potential energies 
+In the MBAR theory, it is assumed that there are K ensembles defined by (effective) potential energies 
 
-$$
+```math
 \tag{1}
-u_{i}(x)(i=1,2,3,……,K) 
-$$
+u_{i}(x)\ (i=1,2,3,……,K) 
+```
 
 For each ensemble, the Boltzmann weight, partition function, and probability function are defined as:
 
-$$
+```math
 \tag{2}
-w_i = \exp(-\beta_i u_i(x)) \\
-c_i = \int dx \cdot w_i(x) \\
-p_i(x) = \frac{w_i(x)}{c_i} 
-$$
+\begin{align}
+w_i &= \exp(-\beta_i u_i(x)) \\
+c_i &= \int dx \cdot w_i(x) \\
+p_i(x) &= \frac{w_i(x)}{c_i}
+\end{align}
+```
 
-For each ensemble $i$, select $N_{i}$ configurations, represented by {${x_{in}}$} (where $n=1,2,3,……,N_{i}$), and the total number of configurations across ensembles is represented by {${x_{n}}$} ($n=1,2,3,……,N$), where N is:
+For each ensemble $i$, select $N_{i}$ configurations, represented by { ${x_{in}}$ } $n=1,2,3,……,N_i$ , and the total number of configurations across ensembles is represented by { ${x_{n}}$ } ( $n=1,2,3,……,N$ ), where N is:
 
-$$
+```math
 \tag{3}
 N = \sum_{i=1}^{K} N_i 
-$$
+```
 
 Within the context of MBAR, for any ensemble K, the weighted average of the observable is defined as:
 
-$$
+```math
 \tag{4}
-\hat{c}_i = \sum_{n=1}^{N} \frac{w_{i}(x_n)}{\sum_{k=1}^{K} N_{k} \hat{c}_k^{-1} w_{k}(x_n)} 
-$$
+\hat{c}_i = \sum_{n=1}^{N} w_{i}(x_n) \cdot \left(\sum_{k=1}^{K} N_{k} \hat{c}_k^{-1} w_{k}(x_n)\right)^{-1}
+```
 
 To compute the average of a physical quantity $A$ in ensemble $i$, one can utilize the above values to define a virtual ensemble $j$ , with its corresponding Boltzmann weight and partition function:
 
-$$
+```math
 \tag{5}
-w_j = w_i(x)A(x) \\
-c_i = \int dx \cdot w_j(x) 
-$$
+\begin{align}
+w_j &= w_i(x)A(x) \\
+c_i &= \int dx \cdot w_j(x)
+\end{align}
+```
 
 Thus, the ensemble average of A is:
 
-$$
+```math
 \tag{6}
 \langle A \rangle_i = \frac{\hat{c}_j}{\hat{c}_i} = \frac{\int dx \cdot w_i(x)A(x)}{\int dx \cdot w_i(x)}
-$$
+```
 
-Thus, the MBAR theory provides a method for estimating the average of physical properties using multiple samples.
+Thus, the MBAR theory provides a method to estimate the ensemble averages using multiple samples from different ensembles.
 
-In the MBAR framework,  $\hat{c}_i$ in Eqn $\ref{eq-4}$ needs to be solved iteratively; however, the differentiable reweighting algorithm can simplify this estimation process. During gradient descent training for parameter optimization, the parameters undergo only slight perturbations in each training cycle. This allows for the usage of samples from the previous cycles, such that resampling is not necessary until the optimized ensemble deviates significantly from the sampling ensemble. This reduces the time and computational cost of the optimization considerably. In the reweighted MBAR estimator, we define two types of ensembles: the sampling ensemble, from which all samples are extracted (assuming there are $m$ samples, labeled as $m=1, 2, 3, …, M$), and the target ensemble, which needs optimization (labeled as $p, q$, corresponding to the indices $i, j$ in Eqn $\ref{eq-6}$). The sampling ensemble is updated only when necessary and does not need to be differentiable. Its data can be generated by external samplers like OpenMM. Hence, $\hat{c}_i$ can be transformed into:
+In the MBAR framework,  $\hat{c}_i$ in Eqn (4) needs to be solved iteratively; however, the differentiable reweighting algorithm can simplify this estimation process. During the gradient descent parameter optimization, the parameters undergo only small changes in each training cycle. This allows for the usage of samples from the previous cycles to evaluate the target ensemble that is being optimized. So resampling is not necessary until the target ensemble deviates significantly from the sampling ensemble. This reduces the time and computational cost of the optimization considerably. 
 
-$$
+In the reweighted MBAR estimator, we define two types of ensembles: the sampling ensemble, from which all samples are extracted (labeled as $m=1, 2, 3, …, M$ ), and the target ensemble, which needs optimization (labeled as $p, q$, corresponding to the indices $i, j$ in Eqn (6)). The sampling ensemble is updated only when necessary and does not need to be differentiable. Its data can be generated by external samplers like OpenMM. Hence, $\hat{c}_i$ can be transformed into:
+
+```math
 \tag{7}
 \hat{c}_p = \sum_{n=1}^{N} w_{p}(x_n) \left( \sum_{m=1}^{M} N_{m} \hat{c}_m^{-1} w_{m}(x_n) \right)^{-1}
-$$
+```
 
-When resample happens,  Eqn. $\ref{eq-4}$ is solved iteratively using standard MBAR to update $\hat{c}_m$, which is stored and used to evaluate $\hat{c}_p$ until the next resampling. Subsequently, during the parameter optimization process, Eqn $\ref{eq-7}$ is employed to compute $\hat{c}_p$,  serving as a differentiable estimator.
+When resample happens,  Eqn. (4) is solved iteratively using standard MBAR to update $\hat{c}_m$, which is stored and used to evaluate $\hat{c}_p$ until the next resampling. Subsequently, during the parameter optimization process, Eqn (7) is employed to compute $\hat{c}_p$,  serving as a differentiable estimator.
 
 Below, we illustrate the workflow of how to use MBAR Estimator in DMFF through a case study.
 
 If all sampling ensembles are defined as a single ensemble $w_{0}(x)$, and the target ensemble is defined as $w_{p}(x)$, and for physical quantity A, we have:
 
-$$
+```math
 \tag{8}
 w_q(x) = w_p(x) A(x) 
-$$
+```
 
 and define:
 
-$$
+```math
 \tag{9}
 \Delta u_{p_0} = u_p(x) - u_0(x) 
-$$
+```
 
 then:
 
-$$
+```math
 \tag{10}
-\langle A \rangle_p = \frac{\hat{c}_q}{\hat{c}_p}= \frac{\sum_{n=1}^{N} A(x_n) \exp(-\beta \Delta u_{p_0}(x_n))}{\sum_{n=1}^{N} \exp(-\beta \Delta u_{p_0}(x_n))}
-$$
+\langle A \rangle_p = \frac{\hat{c}_q}{\hat{c}_p} = \left(\sum_{n=1}^{N} A(x_n) \exp(-\beta \Delta u_{p_0}(x_n))\right) \cdot \left(\sum_{n=1}^{N} \exp(-\beta \Delta u_{p_0}(x_n))\right)^{-1}
+```
 
 Refers to equations above, this equation indicates that the trajectory reweighting algorithm is a special case of the reweighted MBAR estimator.
 
 In DMFF, when calculating the average of the physical quantity A, the formula is expressed as:
 
-$$
+```math
 \tag{11}
 \langle A \rangle_p = \sum_{n=1}^{N} W_n A(x_n)
-$$
+```
 
 where
 
-$$
+```math
 \tag{12}
 \Delta U_{mp} = U_m(x_n) - U_p(x_n)
-$$
+```
 
-$$
+```math
 \tag{13}
-W_n = \frac{\left[\sum_{m=1}^{M} N_m e^{\hat{f}_m -\beta \Delta U_{mp}(x_n)}\right]^{-1}}{\sum_{n=1}^{N} \left[ \sum_{m=1}^{M} N_m e^{\hat{f}_m -\beta \Delta U_{mp}(x_n)} \right]^{-1}}
-$$
+W_n = \left[\sum_{m=1}^{M} N_m e^{\hat{f}_m -\beta \Delta U_{mp}(x_n)}\right]^{-1} \cdot \left(\sum_{n=1}^{N} \left[ \sum_{m=1}^{M} N_m e^{\hat{f}_m -\beta \Delta U_{mp}(x_n)} \right]^{-1}\right)^{-1}
+```
 
 $\hat{f}_m$ is the partition function of the sampling state. W is the MBAR weight for each sample. Finally, the effective sample size is given, based on which one can judge the deviation of the sampling ensemble from the target ensemble:
 
-$$
+```math
 \tag{14}
-n_{\text{eff}} = \frac{\left(\sum_{n=1}^{N} W_n\right)^2}{\sum_{n=1}^{N} W_n^2}
-$$
+n_{\text{eff}} = \left(\sum_{n=1}^{N} W_n\right)^2\cdot\left(\sum_{n=1}^{N} W_n^2\right)^{-1}
+```
 
-When $n_{eff}$ is too small, it indicates that the current sampling ensemble deviates too much from the target ensemble and needs to be resampled.
+When $n_{eff}$ is too small, it indicates that the current sampling ensemble deviates too much from the target ensemble and resample is needed.
 
 Here is a graphical representation of the workflow mentioned above: