Documentation
¶
Overview ¶
Package simhash implements SimHash algorithm for near-duplicate detection.
The original algorithm is taken from: https://github.com/yahoo/gryffin/blob/master/html-distance/feature.go Optimized implementation with performance improvements.
Original Copyright 2015, Yahoo Inc. All rights reserved. Use of this source code is governed by a BSD-style license that can be found in the LICENSE file.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
Types ¶
Click to show internal directories.
Click to hide internal directories.