We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
145f4e0
In [1]: from tiny_tokenizer import WordTokenizer In [2]: tk = WordTokenizer("whitespace") In [3]: tk.tokenize("γγγ γ― η«") Out[3]: [η§, γ―, η«]