Skip to content
#

ngram

Here are 145 public repositories matching this topic...

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate a…

  • Updated Nov 25, 2024
  • C++

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

  • Updated Apr 25, 2022
  • Scala

Improve this page

Add a description, image, and links to the ngram topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ngram topic, visit your repo's landing page and select "manage topics."

Learn more