diff options
author | Russell Belfer <rb@github.com> | 2013-02-14 17:25:10 -0800 |
---|---|---|
committer | Russell Belfer <rb@github.com> | 2013-02-20 15:09:40 -0800 |
commit | 5e5848eb15cc0dd8476d1c6882a9f770e6556586 (patch) | |
tree | 953fd30d6360b67c2174b6c03fd2984561c84cf6 /tests-clar/diff/blob.c | |
parent | 99ba8f2322eaa2df51ace9782b8eadc8c5a6e8b8 (diff) | |
download | libgit2-5e5848eb15cc0dd8476d1c6882a9f770e6556586.tar.gz |
Change similarity metric to sampled hashes
This moves the similarity metric code out of buf_text and into a
new file. Also, this implements a different approach to similarity
measurement based on a Rabin-Karp rolling hash where we only keep
the top 100 and bottom 100 hashes. In theory, that should be
sufficient samples to given a fairly accurate measurement while
limiting the amount of data we keep for file signatures no matter
how large the file is.
Diffstat (limited to 'tests-clar/diff/blob.c')
0 files changed, 0 insertions, 0 deletions