Model converter needed #36

dhdaines · 2022-11-30T04:03:31Z

The acoustic model format has diverged somewhat from CMU Sphinx, and is expected to diverge further. Supporting multiple model formats is not consistent with the goal of making the smallest possible library, so we require a converter to be able to use publically available models. Currently this means:

Convert text to binary model definition
Convert mixture_weights to sendump
Rename text files to include ".txt" extension
Convert feat_params to JSON
Include default dictionary

In the future it may mean (but this is not in the scope of this issue):

Dictionary is an FST and may be a G2P model
Model definition is also an FST (i.e. the "HC" in "HCLG")
GMMs are also quantized
GMMs might be replaced by DNNs

The text was updated successfully, but these errors were encountered:

dhdaines modified the milestones: 0.5.0, 0.4.2 Nov 30, 2022

dhdaines added the enhancement New feature or request label Dec 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model converter needed #36

Model converter needed #36

dhdaines commented Nov 30, 2022

Model converter needed #36

Model converter needed #36

Comments

dhdaines commented Nov 30, 2022