You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The CLAP model version 2023 process audios sampled at 44100 Hz according to this configuration file).
However, when we initialise the model, the htsat module still uses this other config file to define some of its parameters, as we can see here. In this config file, the sampling rate is set to 32000. As a result, the LogmelFilterBank is initialised with a sampling rate of 32000.
Is this behaviour expected ?
Thanks!
The text was updated successfully, but these errors were encountered:
Could you, please, have a look at this issue? It is not only about Sampling Rate, but also about Fmax. The values used in the code (from config.pyhere) are different from those in yml model config and the original 2023 version paper
The CLAP model version 2023 process audios sampled at 44100 Hz according to this configuration file).
However, when we initialise the model, the htsat module still uses this other config file to define some of its parameters, as we can see here. In this config file, the sampling rate is set to 32000. As a result, the
LogmelFilterBank
is initialised with a sampling rate of 32000.Is this behaviour expected ?
Thanks!
The text was updated successfully, but these errors were encountered: