-
Notifications
You must be signed in to change notification settings - Fork 40
BASE/BC_H13 v20220210
Cryolite edited this page Mar 10, 2022
·
2 revisions
- Type: Transformer encoder layers (the same network structure as the one used for BERTBASE)
- Dimension: 768
- # of heads: 12
- Dimension of feedforward networks: 3072
- # of layers: 12
- Activation function: GELU
- Dropout rate in training: 0.1
- Initialization: Random
- Type: Single-layer position-wise feedforward network
- Dimension: 3072
- Activation function: GELU
- Dropout rate in training: 0.1
- Initialization: Random
- Type: Behavioral cloning (BC)
- Loss function: Cross entropy
Crawled Game Records v202007_202107
43002752 samples consisting of action selections only by Saint 2, Saint 3, and Celestial players, which are randomly sampled from the crawled data and shuffled.
- Optimizer: LAMB
- Learning rate: 0.001
- ε: 1.0e-6
- Batch size: 4096
- # of training epochs: 1