Add document of property fitting

deepmodeling · Dec 20, 2024 · d43da75 · d43da75
1 parent d384f62
commit d43da75
Show file tree

Hide file tree

Showing 8 changed files with 205 additions and 6 deletions.
diff --git a/checkpoint b/checkpoint
@@ -0,0 +1 @@
+model.ckpt-1.pt
diff --git a/deepmd/entrypoints/test.py b/deepmd/entrypoints/test.py
@@ -862,7 +862,7 @@ def test_property(
         detail_path = Path(detail_file)
 
         for ii in range(numb_test):
-            test_out = test_data["property"][ii].reshape(-1, 1)
+            test_out = test_data[property_name][ii].reshape(-1, 1)
             pred_out = property[ii].reshape(-1, 1)
 
             frame_output = np.hstack((test_out, pred_out))
@@ -876,7 +876,7 @@ def test_property(
 
         if has_atom_property:
             for ii in range(numb_test):
-                test_out = test_data["atom_property"][ii].reshape(-1, 1)
+                test_out = test_data[f"atom_{property_name}"][ii].reshape(-1, 1)
                 pred_out = aproperty[ii].reshape(-1, 1)
 
                 frame_output = np.hstack((test_out, pred_out))

diff --git a/deepmd/pt/model/atomic_model/base_atomic_model.py b/deepmd/pt/model/atomic_model/base_atomic_model.py
@@ -461,7 +461,7 @@ def change_out_bias(
         elif bias_adjust_mode == "set-by-statistic":
             property_name = (
                 self.fitting_net.property_name
-                if "property_name" in vars(self.fitting_net)
+                if "property" in self.bias_keys
                 else None
             )
             bias_out, std_out = compute_output_stats(

diff --git a/deepmd/pt/utils/stat.py b/deepmd/pt/utils/stat.py
@@ -478,7 +478,6 @@ def compute_output_stats_global(
 
     bias_atom_e = {}
     std_atom_e = {}
-
     for kk in keys:
         if kk in stats_input:
             if property_name is not None:
@@ -501,7 +500,6 @@ def compute_output_stats_global(
         else:
             # this key does not have global labels, skip it.
             continue
-
     bias_atom_e, std_atom_e = _post_process_stat(bias_atom_e, std_atom_e)
 
     # unbias_e is only used for print rmse

diff --git a/doc/model/index.rst b/doc/model/index.rst
@@ -16,6 +16,7 @@ Model
    train-energy-spin
    train-fitting-tensor
    train-fitting-dos
+   train-fitting-property
    train-se-e2-a-tebd
    train-se-a-mask
    train-se-e3-tebd

diff --git a/doc/model/train-fitting-property.md b/doc/model/train-fitting-property.md
@@ -0,0 +1,192 @@
+# Fit other properties {{ pytorch_icon }} {{ jax_icon }} {{ dpmodel_icon }}
+
+:::{note}
+**Supported backends**: PyTorch {{ pytorch_icon }}, JAX {{ jax_icon }}, DP {{ dpmodel_icon }}
+:::
+
+Here we present an API to DeepProperty model, which can be used to fit other properties like band gap, bulk modulus, critical temperature, etc.
+
+In this example, we will show you how to train a model to fit properties of `humo`, `lumo` and `band gap`. A complete training input script of the examples can be found in
+
+```bash
+$deepmd_source_dir/examples/property/train
+```
+
+The training and validation data are also provided our examples. But note that **the data provided along with the examples are of limited amount, and should not be used to train a production model.**
+
+Similar to the `input.json` used in `ener` mode, training JSON is also divided into {ref}`model <model>`, {ref}`learning_rate <learning_rate>`, {ref}`loss <loss>` and {ref}`training <training>`. Most keywords remain the same as `ener` mode, and their meaning can be found [here](train-se-atten.md). To fit the `property`, one needs to modify {ref}`model[standard]/fitting_net <model[standard]/fitting_net>` and {ref}`loss <loss>`.
+
+## The fitting Network
+
+The {ref}`fitting_net <model[standard]/fitting_net>` section tells DP which fitting net to use.
+
+The JSON of `property` type should be provided like
+
+```json
+	"fitting_net" : {
+		"type": "property",
+        "intensive": true,
+        "property_name": "band_prop",
+        "task_dim": 3,
+		"neuron": [240,240,240],
+		"resnet_dt": true,
+		"fparam": 0,
+		"seed": 1,
+	},
+```
+
+- `type` specifies which type of fitting net should be used. It should be `property`.
+- `intensive` indicates whether the fitting property is intensive. If `intensive` is `true`, the model output is the average of the property contribution of each atom. If `intensive` is `false`, the model output is the sum of the property contribution of each atom.
+- `property_name` is the name of the property to be predicted. It should be consistent with the property name in the dataset. In each system, code will read `set.*/{property_name}.npy` file as prediction label if you use NumPy format data.
+- `fitting_net/task_dim` is the dimension of model output. It should be consistent with the property dimension in the dataset, which means if the shape of data stored in `set.*/{property_name}.npy` is `batch size * 3`, `fitting_net/task_dim` should be set to 3.
+- The rest arguments have the same meaning as they do in `ener` mode.
+
+## Loss
+
+DeepProperty supports trainings of the global system (one or more global labels are provided in a frame). For example, when fitting `property`, each frame will provide a `1 x task_dim` vector which gives the fitting properties. 
+
+The loss section should be provided like
+
+```json
+	"loss" : {
+		"type": "property",
+        "metric": ["mae"],
+        "loss_func": "smooth_mae"
+	},
+```
+
+- {ref}`type <loss/type>` should be written as `property` as a distinction from `ener` mode.
+- `metric`: The metric for display, which will be printed in `lcurve.out`. This list can include 'smooth_mae', 'mae', 'mse' and 'rmse'.
+- `loss_func`: The loss function to minimize, you can use 'mae','smooth_mae', 'mse' and 'rmse'.
+
+## Training Data Preparation
+
+The label should be named `{property_name}.npy/raw`, `property_name` is defined by `fitting_net/property_name` in `input.json`.
+
+To prepare the data, you can use `dpdata` tools, for example:
+```
+import dpdata
+import numpy as np
+from dpdata.data_type import (
+    Axis,
+    DataType,
+)
+
+property_name = "band_prop"  #fittng_net/property_name
+task_dim = 3 #fitting_net/task_dim
+
+# register datatype
+datatypes = [
+    DataType(
+        property_name,
+        np.ndarray,
+        shape=(Axis.NFRAMES,task_dim),
+        required=False,
+    ),
+]
+datatypes.extend(
+    [
+        DataType(
+            "energies",
+            np.ndarray,
+            shape=(Axis.NFRAMES,1),
+            required=False,
+        ),
+        DataType(
+            "forces",
+            np.ndarray,
+            shape=(Axis.NFRAMES, Axis.NATOMS, 1),
+            required=False,
+        )
+    ]
+)
+
+for datatype in datatypes:
+    dpdata.System.register_data_type(datatype)
+    dpdata.LabeledSystem.register_data_type(datatype)
+
+ls = dpdata.MultiSystems()
+frame = dpdata.System("POSCAR",fmt="vasp/poscar")
+labelframe = dpdata.LabeledSystem()
+labelframe.append(frame)
+labelframe.data[property_name] = np.array([[-0.236,0.056,0.292]],dtype=np.float32)
+ls.append(labelframe)
+ls.to_deepmd_npy_mixed("deepmd")
+```
+
+## Train the Model
+
+The training command is the same as `ener` mode, i.e.
+
+::::{tab-set}
+
+:::{tab-item} PyTorch {{ pytorch_icon }}
+
+```bash
+dp --pt train input.json
+```
+
+:::
+
+::::
+
+The detailed loss can be found in `lcurve.out`:
+
+```
+# step        mae_val     mae_trn   lr      
+# If there is no available reference data, rmse_*_{val,trn} will print nan
+      1      2.72e-02    2.40e-02    2.0e-04
+    100      1.79e-02    1.34e-02    2.0e-04
+    200      1.45e-02    1.86e-02    2.0e-04
+    300      1.61e-02    4.90e-03    2.0e-04
+    400      2.04e-02    1.05e-02    2.0e-04
+    500      9.09e-03    1.85e-02    2.0e-04
+    600      1.01e-02    5.63e-03    2.0e-04
+    700      1.10e-02    1.76e-02    2.0e-04
+    800      1.14e-02    1.50e-02    2.0e-04
+    900      9.54e-03    2.70e-02    2.0e-04
+   1000      1.00e-02    2.73e-02    2.0e-04
+```
+
+## Test the Model
+
+We can use `dp test` to infer the properties for given frames.
+
+::::{tab-set}
+
+:::{tab-item} PyTorch {{ pytorch_icon }}
+
+```bash
+
+dp --pt freeze -o frozen_model.pth
+
+dp --pt test -m frozen_model.pth -s ../data/data_0/ -d ${output_prefix} -n 100
+```
+
+:::
+
+::::
+
+if `dp test -d ${output_prefix}` is specified, the predicted properties for each frame are output in the working directory
+
+```
+${output_prefix}.property.out.0   ${output_prefix}.property.out.1  ${output_prefix}.property.out.2  ${output_prefix}.property.out.3
+```
+
+for `*.property.out.*`, it contains matrix with shape of `(2, task_dim)`,
+
+```
+# ../data/data_0 - 0: data_property pred_property
+-2.449000030755996704e-01 -2.315840660495154801e-01
+6.400000303983688354e-02 5.810663314446311983e-02
+3.088999986648559570e-01 2.917143316092784544e-01
+```
+
+## Data Normalization
+When `fitting_net/type` is `ener`, the energy bias layer “$e_{bias}$” adds a constant bias to the atomic energy contribution according to the atomic number.i.e., 
+$$e_{bias} (Z_i) (MLP(D_i))= MLP(D_i) + e_{bias} (Z_i)$$
+
+But when `fitting_net/type` is `property`. The property bias layer is used to normalize the property output of the model.i.e.,
+$$p_{bias} (MLP(D_i))= MLP(D_i) * std+ mean$$
+1. `std`: The standard deviation of the property label
+2. `mean`: The average value of the property label
diff --git a/examples/property/train/README.md b/examples/property/train/README.md
@@ -0,0 +1,4 @@
+Some explanations of the parameters in `input.json`:
+1. `fitting_net/property_name` is the name of the property to be predicted. It should be consistent with the property name in the dataset. In each system, code will read `set.*/{property_name}.npy` file as prediction label if you use NumPy format data.
+2. `fitting_net/task_dim` is the dimension of model output. It should be consistent with the property dimension in the dataset, which means if the shape of data stored in `set.*/{property_name}.npy` is `batch size * 3`, `fitting_net/task_dim` should be set to 3.
+3. `fitting/intensive` indicates whether the fitting property is intensive. If `intensive` is `true`, the model output is the average of the property contribution of each atom. If `intensive` is `false`, the model output is the sum of the property contribution of each atom.
diff --git a/examples/property/train/input_torch.json b/examples/property/train/input_torch.json
@@ -32,8 +32,8 @@
     "fitting_net": {
       "type": "property",
       "intensive": true,
+      "task_dim": 3,
       "property_name": "band_prop",
-      "property_dim": 3,
       "neuron": [
         240,
         240,
@@ -54,6 +54,9 @@
   },
   "loss": {
     "type": "property",
+    "metric": ["mae"],
+    "loss_func": "smooth_mae",
+    "beta": 1.0,
     "_comment": " that's all"
   },
   "training": {