We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What would be the slightly bigger student base model configuration?
The text was updated successfully, but these errors were encountered:
This was our 6-2 student base system from En-De WMT22.
# Model options dec-cell-base-depth: 2 dec-cell-high-depth: 1 dec-cell: ssru dec-depth: 2 dim-emb: 512 enc-cell-depth: 1 enc-cell: gru enc-depth: 6 enc-type: bidirectional tied-embeddings-all: true transformer-decoder-autoreg: rnn transformer-dim-ffn: 2048 transformer-ffn-activation: relu transformer-ffn-depth: 2 transformer-guided-alignment-layer: last transformer-heads: 8 transformer-postprocess: dan transformer-preprocess: "" transformer-tied-layers: [] transformer-train-position-embeddings: false type: transformer
Which is essentially: https://github.com/browsermt/students/blob/master/train-student/models/student.base/student.base.yml
For comparison our 6-2 student tiny, which looks to be the same as the tiny11 you posted in #3
# Model options dec-cell-base-depth: 2 dec-cell-high-depth: 1 dec-cell: ssru dec-depth: 2 dim-emb: 256 enc-cell-depth: 1 enc-cell: gru enc-depth: 6 enc-type: bidirectional tied-embeddings-all: true transformer-decoder-autoreg: rnn transformer-dim-ffn: 1536 transformer-ffn-activation: relu transformer-ffn-depth: 2 transformer-guided-alignment-layer: last transformer-heads: 8 transformer-postprocess: dan transformer-preprocess: "" transformer-tied-layers: [] transformer-train-position-embeddings: false type: transformer
And deciding between tied-embeddings-all: true or tied-embeddings: true as required
tied-embeddings-all: true
tied-embeddings: true
Sorry, something went wrong.
No branches or pull requests
What would be the slightly bigger student base model configuration?
The text was updated successfully, but these errors were encountered: