Documentation
¶
Overview ¶
Package textutil provides text processing utilities for form classification.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func NormalizeWhitespaces ¶
NormalizeWhitespaces replaces newlines and multiple whitespace with a single space.
func NumberPattern ¶
NumberPattern replaces digits with X and letters with C if the digit ratio >= threshold. Returns empty string otherwise.
func TokenNgrams ¶
TokenNgrams returns n-grams from a list of tokens, joined by space.
Types ¶
This section is empty.
Click to show internal directories.
Click to hide internal directories.