Normalize path_score as confidence score for utterance? #18

chinshr · 2015-05-05T05:17:33Z

How can I normalize a path_score (like in Pocketsphinx::Decoder::Hypothesis) of an utterance to a relative confidence probability?

The text was updated successfully, but these errors were encountered:

nshmyrev · 2015-05-05T17:15:23Z

You can not do that. Confidence is available in C api with ps_get_prob call which is not used in bindings.

watsonbox · 2015-05-12T16:22:31Z

I've made it possible to get at this value using Pocketsphinx::Decoder::Hypothesis#posterior_prob. Does this help? Is there some additional normalization calculation that would be worth adding to the hypothesis?

@nshmyrev Does calling ps_get_prob every time I call ps_get_hyp have any negative performance implications?

chinshr · 2015-05-13T04:02:04Z

@watsonbox IMO a step in the right direction, but the posterior probability is logarithmic and needs to be converted to a decimal probability in order to get to a more meaningful confidence score, e.g. .81 = 81% confidence, etc. According to my investigation, I think an approach worth investigating is the following:

...
# Add inside Pocketsphinx::API::Pocketsphinx
typedef :pointer, :logmath
attach_function :ps_get_logmath, [:decoder], :logmath
attach_function :logmath_get_base, [:logmath], FFI::NativeType::FLOAT64
attach_function :logmath_exp, [:logmath, :int], FFI::NativeType::FLOAT64
...
# Pocketsphinx::Decoder
logmath = ps_api.ps_get_logmath(ps_decoder)
logbase = ps_api.logmath_get_base(logmath)
log_prob = ps_api.ps_get_prob(ps_decoder) # -> -9834
dec_prob = ps_api.logmath_exp(logmath, log_prob) # => 0.83111

Something similar needs to happen per word within an utterance:

# Inside Pocketsphinx::Decoder
def words
  ...
  acoustic_score = FFI::MemoryPointer.new(:int32, 1)
  language_score = FFI::MemoryPointer.new(:int32, 1)
  language_backoff = FFI::MemoryPointer.new(:int32, 1)

  ps_api.ps_seg_prob(seg_iter, acoustic_score, language_score, language_backoff)
  ...

Again, these scores are logarithmic and need to be converted before they are meaningful.

@chinshr

This is based on the comment by @chinshr on watsonbox#18 (comment) 1506602 This conversion makes posterior_prob a usable confidence value between 0 and 1.

watsonbox · 2015-08-10T17:21:21Z

Sorry for the delay! Please let me know if my commit resolves these issues.

chinshr changed the title ~~Normalize path_score for an utterance~~ Normalize path_score as confidence score for utterance? May 5, 2015

watsonbox added a commit that referenced this issue May 12, 2015

Add Pocketsphinx::Decoder::Hypothesis#posterior_prob [#18]

93ca565

joepestro mentioned this issue Jun 26, 2015

Convert posterior probability using logmath_exp. #21

Open

watsonbox added a commit that referenced this issue Aug 10, 2015

Convert logarithmic values when calculating a hypothesis [#18]

9958a43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize path_score as confidence score for utterance? #18

Normalize path_score as confidence score for utterance? #18

chinshr commented May 5, 2015

nshmyrev commented May 5, 2015

watsonbox commented May 12, 2015

chinshr commented May 13, 2015

watsonbox commented Aug 10, 2015

Normalize path_score as confidence score for utterance? #18

Normalize path_score as confidence score for utterance? #18

Comments

chinshr commented May 5, 2015

nshmyrev commented May 5, 2015

watsonbox commented May 12, 2015

chinshr commented May 13, 2015

watsonbox commented Aug 10, 2015